Euclidean distance

In mathematics, the Euclidean distance or Euclidean metric is the "ordinary" straight-line distance between two points in Euclidean space. With this distance, Euclidean space becomes a metric space. The associated norm is called the Euclidean norm. Older literature refers to the metric as the Pythagorean metric. A generalized term for the Euclidean norm is the L² norm or L² distance.

Illustration for n=3, repeated application of the Pythagorean theorem yields the formula

Definition

The Euclidean distance between points p and q is the length of the line segment connecting them ( ${\overline {\mathbf {p} \mathbf {q} }}$ ).

In Cartesian coordinates, if p = (p₁, p₂,..., p_n) and q = (q₁, q₂,..., q_n) are two points in Euclidean n-space, then the Euclidean distance (d) from p to q, or from q to p is given by the Pythagorean formula:[1]

{\begin{aligned}d(\mathbf {p} ,\mathbf {q} )=d(\mathbf {q} ,\mathbf {p} )&={\sqrt {(q_{1}-p_{1})^{2}+(q_{2}-p_{2})^{2}+\cdots +(q_{n}-p_{n})^{2}}}\\[8pt]&={\sqrt {\sum _{i=1}^{n}(q_{i}-p_{i})^{2}}}.\end{aligned}}

(1)

The position of a point in a Euclidean n-space is a Euclidean vector. So, p and q may be represented as Euclidean vectors, starting from the origin of the space (initial point) with their tips (terminal points) ending at the two points. The Euclidean norm, or Euclidean length, or magnitude of a vector measures the length of the vector:[1]

\left\|\mathbf {p} \right\|={\sqrt {p_{1}^{2}+p_{2}^{2}+\cdots +p_{n}^{2}}}={\sqrt {\mathbf {p} \cdot \mathbf {p} }},

where the last expression involves the dot product.

Describing a vector as a directed line segment from the origin of the Euclidean space (vector tail), to a point in that space (vector tip), its length is actually the distance from its tail to its tip. The Euclidean norm of a vector is seen to be just the Euclidean distance between its tail and its tip.

The relationship between points p and q may involve a direction (for example, from p to q), so when it does, this relationship can itself be represented by a vector, given by

\mathbf {q} -\mathbf {p} =(q_{1}-p_{1},q_{2}-p_{2},\cdots ,q_{n}-p_{n}).

In a two- or three-dimensional space (n = 2, 3), this can be visually represented as an arrow from p to q. In any space it can be regarded as the position of q relative to p. It may also be called a displacement vector if p and q represent two positions of some moving point.

The Euclidean distance between p and q is just the Euclidean length of this displacement vector:

\left\|\mathbf {q} -\mathbf {p} \right\|={\sqrt {(\mathbf {q} -\mathbf {p} )\cdot (\mathbf {q} -\mathbf {p} )}}.

(2)

which is equivalent to equation 1, and also to:

\left\|\mathbf {q} -\mathbf {p} \right\|={\sqrt {\left\|\mathbf {p} \right\|^{2}+\left\|\mathbf {q} \right\|^{2}-2\mathbf {p} \cdot \mathbf {q} }}.

One dimension

In the context of Euclidean geometry, a metric is established in one dimension by fixing two points on a line, and choosing one to be the origin. The length of the line segment between these points defines the unit of distance and the direction from the origin to the second point is defined as the positive direction. This line segment may be translated along the line to build longer segments whose lengths correspond to multiples of the unit distance. In this manner real numbers can be associated to points on the line (as the distance from the origin to the point) and these are the Cartesian coordinates of the points on what may now be called the real line. As an alternate way to establish the metric, instead of choosing two points on the line, choose one point to be the origin, a unit of length and a direction along the line to call positive. The second point is then uniquely determined as the point on the line that is at a distance of one positive unit from the origin.

The distance between any two points on the real line is the absolute value of the numerical difference of their coordinates. It is common to identify the name of a point with its Cartesian coordinate. Thus if p and q are two points on the real line, then the distance between them is given by:

{\sqrt {(q-p)^{2}}}=|q-p|.

In one dimension, there is a single homogeneous, translation-invariant metric (in other words, a distance that is induced by a norm), up to a scale factor of length, which is the Euclidean distance, induced by the absolute-value norm (which is the unique norm in one dimension, up to scaling). In higher dimensions there are other possible norms, such as the $L^{p}$ norms (which are all equal in one dimension), and in one dimension there are other metrics (scaling Euclidean distance by any monotonic function), but they are not induced by norms (as metrics, they are not homogeneous and translation-invariant).

Two dimensions

Euclidean distance in R²

In the Euclidean plane, if p = (p₁, p₂) and q = (q₁, q₂) then the distance is given by

d(\mathbf {p} ,\mathbf {q} )={\sqrt {(q_{1}-p_{1})^{2}+(q_{2}-p_{2})^{2}}}.

This is equivalent to the Pythagorean theorem.

Alternatively, it follows from (2) that if the polar coordinates of the point p are (r₁, θ₁) and those of q are (r₂, θ₂), then the distance between the points is

{\sqrt {r_{1}^{2}+r_{2}^{2}-2r_{1}r_{2}\cos(\theta _{1}-\theta _{2})}}.

Three dimensions

In three-dimensional Euclidean space, the distance is

d(\mathbf {p} ,\mathbf {q} )={\sqrt {(p_{1}-q_{1})^{2}+(p_{2}-q_{2})^{2}+(p_{3}-q_{3})^{2}}}.

n dimensions

In general, for an n-dimensional space, the distance is

d(\mathbf {p} ,\mathbf {q} )={\sqrt {(p_{1}-q_{1})^{2}+(p_{2}-q_{2})^{2}+\cdots +(p_{i}-q_{i})^{2}+\cdots +(p_{n}-q_{n})^{2}}}={\sqrt {\sum _{i=1}^{n}{(p_{i}-q_{i})^{2}}}}.

Squared Euclidean distance

The square of the standard Euclidean distance, which is known as the squared Euclidean distance (SED), is also of interest; as an equation:

d^{2}(\mathbf {p} ,\mathbf {q} )=(p_{1}-q_{1})^{2}+(p_{2}-q_{2})^{2}+\cdots +(p_{i}-q_{i})^{2}+\cdots +(p_{n}-q_{n})^{2}.

Squared Euclidean distance is of central importance in estimating parameters of statistical models, where it is used in the method of least squares, a standard approach to regression analysis. The corresponding loss function is the squared error loss (SEL), and places progressively greater weight on larger errors. The corresponding risk function (expected loss) is mean squared error (MSE).

Squared Euclidean distance is not a metric, as it does not satisfy the triangle inequality. However, it is a more general notion of distance, namely a divergence (specifically a Bregman divergence), and can be used as a statistical distance. The Pythagorean theorem is simpler in terms of squared distance (since there is no square root); if $\mathbf {pq} \perp \mathbf {qr}$ , then:

d^{2}(\mathbf {p} ,\mathbf {r} )=d^{2}(\mathbf {p} ,\mathbf {q} )+d^{2}(\mathbf {q} ,\mathbf {r} ).

In information geometry, the Pythagorean identity can be generalized from SED to other Bregman divergences, including relative entropy (Kullback–Leibler divergence), allowing generalized forms of least squares to be used to solve non-linear problems.

The SED is a smooth, strictly convex function of the two points, unlike the distance, which is not smooth when two points are equal and is not strictly convex (because it is linear). The SED is thus preferred in optimization theory, since it allows convex analysis to be used. Since squaring is a monotonic function of non-negative values, minimizing the SED is equivalent to minimizing the Euclidean distance, so the optimization problem is equivalent in terms of either, but easier to solve using the SED.

If one of the points is fixed, the SED can be interpreted as a potential function, in which case a normalization factor of one half is used, and the sign may be switched, depending on convention. In detail, given two points $\mathbf {p} ,\mathbf {q}$ , the vector $\mathbf {p} -\mathbf {q}$ points from $\mathbf {q}$ to $\mathbf {p}$ and has magnitude proportional to their Euclidean distance. If one fixes $\mathbf {p}$ , one can thus define a smooth vector field "pointing at $\mathbf {p}$ " by $X_{\mathbf {p} }(\mathbf {q} ):=\mathbf {p} -\mathbf {q} .$ This is the gradient of the scalar-valued function "half SED from $\mathbf {p}$ ", where the half cancels the two in the power rule. Writing half the squared distance from $\mathbf {p}$ as $\textstyle {D_{\mathbf {p} }(q):={\frac {1}{2}}\sum _{i}(p_{i}-q_{i})^{2}}$ , one has $\mathbf {p} -\mathbf {q} =-\mathrm {grad} _{\mathbf {q} }D_{\mathbf {p} }.$ Alternatively, one can consider the vector field pointing from $\mathbf {p}$ , and omit the minus sign.

In information geometry, the notion of a vector field of "pointing from one point to another" can be generalized to statistical manifolds – one can use an affine connection to connect tangent vectors at different points and the exponential map to flow from one point to another, and on a statistical manifold this is invertible, defining a unique "difference vector" from any given point to another. In this context, the SED (whose gradient generates the standard difference vector) generalized to a divergence that generates the information geometry of the manifold; a uniform construction of such a divergence (given the geometric structure) is called a canonical divergence.[2]

In the field of rational trigonometry, the SED is referred to as quadrance.

References

Anton, Howard (1994), Elementary Linear Algebra (7th ed.), John Wiley & Sons, pp. 170–171, ISBN 978-0-471-58742-2
Ay & Amari 2015, 2. A New Approach to the General Inverse Problem.

Ay, Nihat; Amari, Shun-ichi (2015). "A Novel Approach to Canonical Divergences within Information Geometry" (PDF). Entropy. 17 (12): 8111–8129. Bibcode:2015Entrp..17.8111A. doi:10.3390/e17127866.