The curse of dimensionality is not a specific geometric structure, but a collection of phenomena induced by the intrinsic properties of high-dimensional spaces.
The curse of dimensionality is not a specific geometric structure, but a collection of phenomena induced by the intrinsic properties of high-dimensional spaces. In particular, effects such as concentration of measure and distance concentration lead to the degeneration of metric discriminability, making many distance-based methods ineffective.
This article introduces the Curse of Dimensionality effect in spaces of different curvature from mathematical and statistical perspectives.
Euclidean Space
To introduce the curse of dimensionality, we need to start with the sample distribution in space.
Let us start with -dimension, a line segment in the numerical range , and assume that all samples follow a random and uniform distribution on this line segment. Also, the unit measure of the line segment is unit of length.
Now, we consider the inner region that contains 90% of the measure, i.e., the interval , and treat the remaining 10% as a thin outer shell with finite thickness. It is evident that 90% of the samples are randomly distributed within , while the remaining 10% are located in the shell regions, namely and .
Obviously, we can infer that the proportion of samples distributed on the thin shell in -dimension is .
Now, let us extend this similar situation to circles in a -dimensional plane. As we all know, the area of a circle can be calculated with the formula . Also, we consider the inner region that contains 90% of the measure, i.e., , and the remaining finite shell . Next, we need to calculate the proportion of samples distributed in these two regions.
Similarly, we can continue our exploration in -dimensional space. The volume for a -dimensional sphere is .
Here, we seem to be able to summarize a general pattern: as the dimension increases, the proportion of samples distributed on the outer shell of the space increases continuously , while the proportion of samples in the cavity decreases continuously .
To rigorously verify this conclusion, we need general mathematical reasoning. For a -dimensional space, the volume of a hypersphere with radius is:
Where:
- is the gamma function;
- When is a positive integer, , if is even;
- If is odd, the gamma function gives an expression containing .
We have:
For high dimensions , we have the limit:
That is to say, random and uniform samples tend to distribute on the outer shell as the dimension grows. This is named the Thin-Shell Effect.
Euclidean Distance
Consider a high-dimensional space with random vectors:
The magnitude of the vector is:
Since , we have:
Here, is the Chi-Squared Distribution, defined as the sum of squares of mutually independent random variables following a standard normal distribution . In this case, since the squared norm is the sum of such variables, the degrees of freedom is .
According to the Law of Large Numbers:
That is, the magnitude of vectors is approximated by .
We have the Euclidean distance:
We have known:
The inner product of vectors is:
Since:
- are independent
Therefore:
The order of magnitude is .
Substituting back into the distance equation, we have:
Thus:
Looking at the relative fluctuation:
- Mean:
- Standard Deviation:
Therefore:
Conclusion: The relative fluctuation of distance tends to 0 the distance between all points is almost the same.
Angular Distance
If we project vectors onto the unit hypersphere after normalization:
Then:
In high dimensions:
Therefore:
The angular distance between all vectors is almost equidistant.
In high-dimensional spaces, the norm of random vectors concentrates around a constant due to the law of large numbers. Moreover, the inner product between independent vectors grows only on the order of , which is negligible compared to the magnitude of squared norms. As a result, pairwise distances concentrate around a constant, leading to the phenomenon that almost all points are approximately equidistant.
Positively Curved Riemannian Space
In Euclidean space, the curse of dimensionality arises from the polynomial growth of volume and the independence structure of coordinates. However, in a positively curved Riemannian manifold, the geometry itself fundamentally reshapes both volume distribution and distance behavior.
A canonical example is the -dimensional unit sphere:
which is a space with constant positive sectional curvature.
Measure Concentration on the Sphere
Unlike Euclidean space, where volume spreads radially, on the sphere all points lie on a fixed-radius manifold. Thus, “radial shells” are replaced by geodesic bands.
Let us fix a point and define a geodesic ball:
The measure of this region depends on :
Now observe the key phenomenon:
- When is small, is relatively flat
- When , becomes sharply peaked at:
This leads to: Most of the mass concentrates near the equator orthogonal to any fixed direction.
More formally, for any fixed :
This is a manifestation of the concentration of measure phenomenon on positively curved spaces.
The Geometric Interpretation could be described as “Equatorial Collapse”. In Euclidean space, samples concentrate in a thin outer shell. On the sphere, samples instead concentrate in a thin equatorial band.
This can be viewed as a curvature-induced redistribution of measure.
Angular Distance in Positive Curvature
Let:
Then their inner product satisfies:
Thus:
So we have the conclusion:
Two random points on a high-dimensional sphere are almost always orthogonal.
This implies:
Distance Concentration under Geodesic Metric
Unlike Euclidean distance, the natural metric on is the geodesic distance:
Since:
- fluctuations are
we have:
Thus, Geodesic distances also concentrate → almost all pairs of points are at distance .
Curvature Amplifies Concentration
An important insight is: Positive curvature does not eliminate the curse of dimensionality — it reshapes and often strengthens it.
Why?
- The sphere is compact → no radial dispersion
- Curvature forces geodesics to “bend back”
- Volume grows slower than Euclidean space
As a result: Samples are even more tightly concentrated. Angular discrimination becomes harder. nd most directions become indistinguishable
In positively curved spaces: Measure concentrates on equatorial regions, and angles concentrate around , geodesic distances become nearly constant. The curse of dimensionality persists, but manifests as angular collapse instead of radial shell concentration.