Mahalanobis distance

Mahalanobis distance formula: D² = (x - μ)'Σ^(-1)(x - μ)

Image: Public domain, via Wikimedia Commons

Mahalanobis distance

Mahalanobis distance formula: D² = (x - μ)'Σ^(-1)(x - μ)

The Mahalanobis distance formula is a multivariate measure that quantifies the distance between a point and a probability distribution. It generalizes the concept of standard score distance, taking into account the covariance among variables. This distance is zero for points within the distribution and increases with the distance from the mean.

Example

Given a point P with coordinates (x1, x2) and a distribution D with mean μ = (μ1, μ2) and covariance matrix Σ, the Mahalanobis distance D^2 is calculated as D^2 = [(x1 - μ1), (x2 - μ2)]'Σ^(-1)[(x1 - μ1), (x2 - μ2)].

Understanding the Mahalanobis distance formula is crucial for applications in multivariate statistics, such as clustering and classification, where it helps measure similarity and differentiate between groups.

Related concepts

One email a day: 5 concepts + the 5 stories that matter →

Swipe through 100 ML concepts daily

Open TickerNews