Dimension Curse on Zero and Positive Curvature Space

The curse of dimensionality is not a specific geometric structure, but a collection of phenomena induced by the intrinsic properties of high-dimensional spaces. In particular, effects such as concentration of measure and distance concentration lead to the degeneration of metric discriminability, making many distance-based methods ineffective.

This article introduces the Curse of Dimensionality effect in spaces of different curvature from mathematical and statistical perspectives.

Euclidean Space

To introduce the curse of dimensionality, we need to start with the sample distribution in space.

Let us start with $1$ -dimension, a line segment in the numerical range $[-1,1]$ , and assume that all samples follow a random and uniform distribution on this line segment. Also, the unit measure of the line segment is $1$ unit of length.

Now, we consider the inner region that contains 90% of the measure, i.e., the interval $[-0.9, 0.9]$ , and treat the remaining 10% as a thin outer shell with finite thickness. It is evident that 90% of the samples are randomly distributed within $[-0.9, 0.9]$ , while the remaining 10% are located in the shell regions, namely $[-1, -0.9)$ and $(0.9, 1]$ .

Obviously, we can infer that the proportion of samples distributed on the thin shell in $1$ -dimension is $P_\text{shell}^{d=1} = 10\%$ .

Now, let us extend this similar situation to circles in a $2$ -dimensional plane. As we all know, the area of a circle can be calculated with the formula $S = \pi r^2$ . Also, we consider the inner region that contains 90% of the measure, i.e., $x^2 + y^2 \leq 0.9$ , and the remaining finite shell $0.9 < x^2 + y^2 \leq 1$ . Next, we need to calculate the proportion of samples distributed in these two regions.

\begin{aligned} S_\text{inner} = \pi \cdot (0.9)^2 = 0.81 \pi \\ S_\text{shell} = S_\text{unit} - S_\text{inner} = \pi \cdot 1^2 - \pi \cdot (0.9)^{2} = 0.19\pi \\ P^{d=2}_\text{shell} = \frac{S_\text{shell}}{S_\text{unit}} = \frac{0.19 \pi}{\pi} = 19\% \end{aligned}

Similarly, we can continue our exploration in $3$ -dimensional space. The volume for a $3$ -dimensional sphere is $V=\frac{4}{3} \pi r^3$ .

\begin{aligned} V_\text{inner} = \frac{4}{3} \pi \cdot (0.9)^3 = \frac{4}{3} \pi \times 0.729 \\ V_\text{shell} = V_\text{unit} - V_\text{inner} = \frac{4}{3} \pi \cdot 1^3 - \frac{4}{3} \pi \cdot (0.9)^{3} = \frac{4}{3} \pi \times 0.271 \\ P^{d=3}_\text{shell} = \frac{V_\text{shell}}{V_\text{unit}} = 27.1\% \end{aligned}

Here, we seem to be able to summarize a general pattern: as the dimension increases, the proportion of samples distributed on the outer shell of the space increases continuously $10\% \rightarrow 27.1\%$ , while the proportion of samples in the cavity decreases continuously $90\% \rightarrow 72.9\%$ .

To rigorously verify this conclusion, we need general mathematical reasoning. For a $d$ -dimensional space, the volume of a hypersphere with radius $r$ is:

V^d(r) = \frac{\pi^{d/2}}{\Gamma\left(\frac{d}{2}+1\right)} r^d

Where:

$\Gamma(\cdot)$ is the gamma function;
When $d$ is a positive integer, $\Gamma\left(\frac{d}{2}+1\right) = \left(\frac{d}{2}\right)!$ , if $d$ is even;
If $d$ is odd, the gamma function gives an expression containing $\sqrt{\pi}$ .

We have:

P^{d}_\text{shell} = \frac{V^d_\text{shell}}{V^d_\text{unit}} = 1^d - (0.9)^d

For high dimensions $d \rightarrow +\infty$ , we have the limit:

\lim_{d \rightarrow +\infty} 1^d - (0.9)^d = 1 - 0 = 1

That is to say, random and uniform samples tend to distribute on the outer shell as the dimension $d$ grows. This is named the Thin-Shell Effect.

Euclidean Distance

Consider a high-dimensional space $\mathbb{R}^d$ with random vectors:

x, y \sim \text{i.i.d.} \ \mathcal{N}(0, I_d)

The magnitude of the vector $x$ is:

||x||^2 = \sum_{i=1}^d x_i^2

Since $x_i^2 \sim \chi^2(1)$ , we have:

||x||^2 \sim \chi^2(d)

Here, $\chi^2(k)$ is the Chi-Squared Distribution, defined as the sum of squares of $k$ mutually independent random variables following a standard normal distribution $\mathcal{N}(0,1)$ . In this case, since the squared norm is the sum of $d$ such variables, the degrees of freedom is $k=d$ .

According to the Law of Large Numbers:

\frac{1}{d}||x||^2 \to 1

That is, the magnitude of vectors is approximated by $||x|| \approx \sqrt{d}$ .

We have the Euclidean distance:

||x - y||^2 = ||x||^2 + ||y||^2 - 2 \langle x, y \rangle

We have known:

||x||^2 \approx d, \quad ||y||^2 \approx d

The inner product of vectors $x, y$ is:

\langle x, y \rangle = \sum_{i=1}^d x_i y_i

Since:

$x_i, y_i$ are independent
$E[x_i y_i] = 0$
$\text{Var}(x_i y_i) = 1$

Therefore:

\langle x, y \rangle \sim \mathcal{N}(0, d)

The order of magnitude is $O(\sqrt{d})$ .

Substituting back into the distance equation, we have:

||x - y||^2 = d + d - 2 \cdot O(\sqrt{d}) = 2d + O(\sqrt{d}) \\ ||x - y||^2 \approx 2d

Thus:

||x - y|| \approx \sqrt{2d}

Looking at the relative fluctuation:

Mean: $2d$
Standard Deviation: $O(\sqrt{d})$

Therefore:

\frac{\text{std}}{\text{mean}} \sim \frac{\sqrt{d}}{d} = \frac{1}{\sqrt{d}} \to 0

Conclusion: The relative fluctuation of distance tends to 0 $\rightarrow$ the distance between all points is almost the same.

Angular Distance

If we project vectors onto the unit hypersphere after $\ell_2$ normalization:

||x|| = ||y|| = 1

Then:

||x - y||^2 = 2 - 2\cos\theta

In high dimensions:

\cos\theta \approx 0

Therefore:

||x - y|| \approx \sqrt{2}

The angular distance between all vectors is almost equidistant.

In high-dimensional spaces, the norm of random vectors concentrates around a constant due to the law of large numbers. Moreover, the inner product between independent vectors grows only on the order of $O(\sqrt{d})$ , which is negligible compared to the $O(d)$ magnitude of squared norms. As a result, pairwise distances concentrate around a constant, leading to the phenomenon that almost all points are approximately equidistant.

Positively Curved Riemannian Space

In Euclidean space, the curse of dimensionality arises from the polynomial growth of volume and the independence structure of coordinates. However, in a positively curved Riemannian manifold, the geometry itself fundamentally reshapes both volume distribution and distance behavior.

A canonical example is the $d$ -dimensional unit sphere:

\mathbb{S}^{d-1} = { x \in \mathbb{R}^d : ||x|| = 1 }

which is a space with constant positive sectional curvature.

Measure Concentration on the Sphere

Unlike Euclidean space, where volume spreads radially, on the sphere all points lie on a fixed-radius manifold. Thus, “radial shells” are replaced by geodesic bands.

Let us fix a point $x_0 \in \mathbb{S}^{d-1}$ and define a geodesic ball:

B(\theta) = { x \in \mathbb{S}^{d-1} : \angle(x, x_0) \leq \theta }

The measure of this region depends on $\sin^{d-2}(\theta)$ :

\mathrm{Vol}(B(\theta)) \propto \int_0^\theta \sin^{d-2}(\phi), d\phi

Now observe the key phenomenon:

When $d$ is small, $\sin^{d-2}(\theta)$ is relatively flat
When $d \to \infty$ , $\sin^{d-2}(\theta)$ becomes sharply peaked at: $\theta = \frac{\pi}{2}$

This leads to: Most of the mass concentrates near the equator orthogonal to any fixed direction.

More formally, for any fixed $\epsilon > 0$ :

\mathbb{P}\left( \left| \angle(x, x_0) - \frac{\pi}{2} \right| \leq \epsilon \right) \to 1 \quad \text{as } d \to \infty

This is a manifestation of the concentration of measure phenomenon on positively curved spaces.

The Geometric Interpretation could be described as “Equatorial Collapse”. In Euclidean space, samples concentrate in a thin outer shell. On the sphere, samples instead concentrate in a thin equatorial band.

This can be viewed as a curvature-induced redistribution of measure.

Angular Distance in Positive Curvature

Let:

x, y \sim \text{Uniform}(\mathbb{S}^{d-1})

Then their inner product satisfies:

\langle x, y \rangle \sim \mathcal{N}\left(0, \frac{1}{d}\right)

Thus:

\cos\theta = \langle x, y \rangle \approx 0 \quad \Rightarrow \quad \theta \approx \frac{\pi}{2}

So we have the conclusion:

Two random points on a high-dimensional sphere are almost always orthogonal.

This implies:

d_{\text{geo}}(x, y) = \arccos(\langle x, y \rangle) \approx \frac{\pi}{2}

Distance Concentration under Geodesic Metric

Unlike Euclidean distance, the natural metric on $\mathbb{S}^{d-1}$ is the geodesic distance:

d_{\mathbb{S}}(x, y) = \arccos(\langle x, y \rangle)

Since:

$\langle x, y \rangle \to 0$
fluctuations are $O(1/\sqrt{d})$

we have:

\text{Var}(d_{\mathbb{S}}(x, y)) \to 0

Thus, Geodesic distances also concentrate → almost all pairs of points are at distance $\frac{\pi}{2}$ .

Curvature Amplifies Concentration

An important insight is: Positive curvature does not eliminate the curse of dimensionality — it reshapes and often strengthens it.

Why?

The sphere is compact → no radial dispersion
Curvature forces geodesics to “bend back”
Volume grows slower than Euclidean space

As a result: Samples are even more tightly concentrated. Angular discrimination becomes harder. nd most directions become indistinguishable

In positively curved spaces: Measure concentrates on equatorial regions, and angles concentrate around $\frac{\pi}{2}$ , geodesic distances become nearly constant. The curse of dimensionality persists, but manifests as angular collapse instead of radial shell concentration.