2026, April, 10
Dimension Curse on Zero and Positive Curvature Space

The curse of dimensionality is not a specific geometric structure, but a collection of phenomena induced by the intrinsic properties of high-dimensional spaces.

The curse of dimensionality is not a specific geometric structure, but a collection of phenomena induced by the intrinsic properties of high-dimensional spaces. In particular, effects such as concentration of measure and distance concentration lead to the degeneration of metric discriminability, making many distance-based methods ineffective.

This article introduces the Curse of Dimensionality effect in spaces of different curvature from mathematical and statistical perspectives.

Euclidean Space

To introduce the curse of dimensionality, we need to start with the sample distribution in space.

Let us start with 11-dimension, a line segment in the numerical range [1,1][-1,1], and assume that all samples follow a random and uniform distribution on this line segment. Also, the unit measure of the line segment is 11 unit of length.

Now, we consider the inner region that contains 90% of the measure, i.e., the interval [0.9,0.9][-0.9, 0.9], and treat the remaining 10% as a thin outer shell with finite thickness. It is evident that 90% of the samples are randomly distributed within [0.9,0.9][-0.9, 0.9], while the remaining 10% are located in the shell regions, namely [1,0.9)[-1, -0.9) and (0.9,1](0.9, 1].

Obviously, we can infer that the proportion of samples distributed on the thin shell in 11-dimension is Pshelld=1=10%P_\text{shell}^{d=1} = 10\%.

Now, let us extend this similar situation to circles in a 22-dimensional plane. As we all know, the area of a circle can be calculated with the formula S=πr2S = \pi r^2. Also, we consider the inner region that contains 90% of the measure, i.e., x2+y20.9x^2 + y^2 \leq 0.9, and the remaining finite shell 0.9<x2+y210.9 < x^2 + y^2 \leq 1. Next, we need to calculate the proportion of samples distributed in these two regions.

Sinner=π(0.9)2=0.81πSshell=SunitSinner=π12π(0.9)2=0.19πPshelld=2=SshellSunit=0.19ππ=19%\begin{aligned} S_\text{inner} = \pi \cdot (0.9)^2 = 0.81 \pi \\ S_\text{shell} = S_\text{unit} - S_\text{inner} = \pi \cdot 1^2 - \pi \cdot (0.9)^{2} = 0.19\pi \\ P^{d=2}_\text{shell} = \frac{S_\text{shell}}{S_\text{unit}} = \frac{0.19 \pi}{\pi} = 19\% \end{aligned}

Similarly, we can continue our exploration in 33-dimensional space. The volume for a 33-dimensional sphere is V=43πr3V=\frac{4}{3} \pi r^3.

Vinner=43π(0.9)3=43π×0.729Vshell=VunitVinner=43π1343π(0.9)3=43π×0.271Pshelld=3=VshellVunit=27.1%\begin{aligned} V_\text{inner} = \frac{4}{3} \pi \cdot (0.9)^3 = \frac{4}{3} \pi \times 0.729 \\ V_\text{shell} = V_\text{unit} - V_\text{inner} = \frac{4}{3} \pi \cdot 1^3 - \frac{4}{3} \pi \cdot (0.9)^{3} = \frac{4}{3} \pi \times 0.271 \\ P^{d=3}_\text{shell} = \frac{V_\text{shell}}{V_\text{unit}} = 27.1\% \end{aligned}

Here, we seem to be able to summarize a general pattern: as the dimension increases, the proportion of samples distributed on the outer shell of the space increases continuously 10%27.1%10\% \rightarrow 27.1\%, while the proportion of samples in the cavity decreases continuously 90%72.9%90\% \rightarrow 72.9\%.

To rigorously verify this conclusion, we need general mathematical reasoning. For a dd-dimensional space, the volume of a hypersphere with radius rr is:

Vd(r)=πd/2Γ(d2+1)rd V^d(r) = \frac{\pi^{d/2}}{\Gamma\left(\frac{d}{2}+1\right)} r^d

Where:

  • Γ()\Gamma(\cdot) is the gamma function;
  • When dd is a positive integer, Γ(d2+1)=(d2)!\Gamma\left(\frac{d}{2}+1\right) = \left(\frac{d}{2}\right)!, if dd is even;
  • If dd is odd, the gamma function gives an expression containing π\sqrt{\pi}.

We have:

Pshelld=VshelldVunitd=1d(0.9)dP^{d}_\text{shell} = \frac{V^d_\text{shell}}{V^d_\text{unit}} = 1^d - (0.9)^d

For high dimensions d+d \rightarrow +\infty, we have the limit:

limd+1d(0.9)d=10=1\lim_{d \rightarrow +\infty} 1^d - (0.9)^d = 1 - 0 = 1

That is to say, random and uniform samples tend to distribute on the outer shell as the dimension dd grows. This is named the Thin-Shell Effect.

Euclidean Distance

Consider a high-dimensional space Rd\mathbb{R}^d with random vectors:

x,yi.i.d. N(0,Id)x, y \sim \text{i.i.d.} \ \mathcal{N}(0, I_d)

The magnitude of the vector xx is:

x2=i=1dxi2||x||^2 = \sum_{i=1}^d x_i^2

Since xi2χ2(1)x_i^2 \sim \chi^2(1), we have:

x2χ2(d)||x||^2 \sim \chi^2(d)

Here, χ2(k)\chi^2(k) is the Chi-Squared Distribution, defined as the sum of squares of kk mutually independent random variables following a standard normal distribution N(0,1)\mathcal{N}(0,1). In this case, since the squared norm is the sum of dd such variables, the degrees of freedom is k=dk=d.

According to the Law of Large Numbers:

1dx21\frac{1}{d}||x||^2 \to 1

That is, the magnitude of vectors is approximated by xd||x|| \approx \sqrt{d}.

We have the Euclidean distance:

xy2=x2+y22x,y||x - y||^2 = ||x||^2 + ||y||^2 - 2 \langle x, y \rangle

We have known:

x2d,y2d||x||^2 \approx d, \quad ||y||^2 \approx d

The inner product of vectors x,yx, y is:

x,y=i=1dxiyi\langle x, y \rangle = \sum_{i=1}^d x_i y_i

Since:

  • xi,yix_i, y_i are independent
  • E[xiyi]=0E[x_i y_i] = 0
  • Var(xiyi)=1\text{Var}(x_i y_i) = 1

Therefore:

x,yN(0,d)\langle x, y \rangle \sim \mathcal{N}(0, d)

The order of magnitude is O(d)O(\sqrt{d}).

Substituting back into the distance equation, we have:

xy2=d+d2O(d)=2d+O(d)xy22d||x - y||^2 = d + d - 2 \cdot O(\sqrt{d}) = 2d + O(\sqrt{d}) \\ ||x - y||^2 \approx 2d

Thus:

xy2d||x - y|| \approx \sqrt{2d}

Looking at the relative fluctuation:

  • Mean: 2d2d
  • Standard Deviation: O(d)O(\sqrt{d})

Therefore:

stdmeandd=1d0\frac{\text{std}}{\text{mean}} \sim \frac{\sqrt{d}}{d} = \frac{1}{\sqrt{d}} \to 0

Conclusion: The relative fluctuation of distance tends to 0 \rightarrow the distance between all points is almost the same.

Angular Distance

If we project vectors onto the unit hypersphere after 2\ell_2 normalization:

x=y=1||x|| = ||y|| = 1

Then:

xy2=22cosθ||x - y||^2 = 2 - 2\cos\theta

In high dimensions:

cosθ0\cos\theta \approx 0

Therefore:

xy2||x - y|| \approx \sqrt{2}

The angular distance between all vectors is almost equidistant.

In high-dimensional spaces, the norm of random vectors concentrates around a constant due to the law of large numbers. Moreover, the inner product between independent vectors grows only on the order of O(d)O(\sqrt{d}), which is negligible compared to the O(d)O(d) magnitude of squared norms. As a result, pairwise distances concentrate around a constant, leading to the phenomenon that almost all points are approximately equidistant.

Positively Curved Riemannian Space

In Euclidean space, the curse of dimensionality arises from the polynomial growth of volume and the independence structure of coordinates. However, in a positively curved Riemannian manifold, the geometry itself fundamentally reshapes both volume distribution and distance behavior.

A canonical example is the dd-dimensional unit sphere:

Sd1=xRd:x=1\mathbb{S}^{d-1} = { x \in \mathbb{R}^d : ||x|| = 1 }

which is a space with constant positive sectional curvature.

Measure Concentration on the Sphere

Unlike Euclidean space, where volume spreads radially, on the sphere all points lie on a fixed-radius manifold. Thus, “radial shells” are replaced by geodesic bands.

Let us fix a point x0Sd1x_0 \in \mathbb{S}^{d-1} and define a geodesic ball:

B(θ)=xSd1:(x,x0)θB(\theta) = { x \in \mathbb{S}^{d-1} : \angle(x, x_0) \leq \theta }

The measure of this region depends on sind2(θ)\sin^{d-2}(\theta):

Vol(B(θ))0θsind2(ϕ),dϕ\mathrm{Vol}(B(\theta)) \propto \int_0^\theta \sin^{d-2}(\phi), d\phi

Now observe the key phenomenon:

  • When dd is small, sind2(θ)\sin^{d-2}(\theta) is relatively flat
  • When dd \to \infty, sind2(θ)\sin^{d-2}(\theta) becomes sharply peaked at: θ=π2\theta = \frac{\pi}{2}

This leads to: Most of the mass concentrates near the equator orthogonal to any fixed direction.

More formally, for any fixed ϵ>0\epsilon > 0:

P((x,x0)π2ϵ)1as d\mathbb{P}\left( \left| \angle(x, x_0) - \frac{\pi}{2} \right| \leq \epsilon \right) \to 1 \quad \text{as } d \to \infty

This is a manifestation of the concentration of measure phenomenon on positively curved spaces.

The Geometric Interpretation could be described as “Equatorial Collapse”. In Euclidean space, samples concentrate in a thin outer shell. On the sphere, samples instead concentrate in a thin equatorial band.

This can be viewed as a curvature-induced redistribution of measure.

Angular Distance in Positive Curvature

Let:

x,yUniform(Sd1)x, y \sim \text{Uniform}(\mathbb{S}^{d-1})

Then their inner product satisfies:

x,yN(0,1d)\langle x, y \rangle \sim \mathcal{N}\left(0, \frac{1}{d}\right)

Thus:

cosθ=x,y0θπ2\cos\theta = \langle x, y \rangle \approx 0 \quad \Rightarrow \quad \theta \approx \frac{\pi}{2}

So we have the conclusion:

Two random points on a high-dimensional sphere are almost always orthogonal.

This implies:

dgeo(x,y)=arccos(x,y)π2d_{\text{geo}}(x, y) = \arccos(\langle x, y \rangle) \approx \frac{\pi}{2}

Distance Concentration under Geodesic Metric

Unlike Euclidean distance, the natural metric on Sd1\mathbb{S}^{d-1} is the geodesic distance:

dS(x,y)=arccos(x,y)d_{\mathbb{S}}(x, y) = \arccos(\langle x, y \rangle)

Since:

  • x,y0\langle x, y \rangle \to 0
  • fluctuations are O(1/d)O(1/\sqrt{d})

we have:

Var(dS(x,y))0\text{Var}(d_{\mathbb{S}}(x, y)) \to 0

Thus, Geodesic distances also concentrate → almost all pairs of points are at distance π2\frac{\pi}{2}.

Curvature Amplifies Concentration

An important insight is: Positive curvature does not eliminate the curse of dimensionality — it reshapes and often strengthens it.

Why?

  • The sphere is compact → no radial dispersion
  • Curvature forces geodesics to “bend back”
  • Volume grows slower than Euclidean space

As a result: Samples are even more tightly concentrated. Angular discrimination becomes harder. nd most directions become indistinguishable

In positively curved spaces: Measure concentrates on equatorial regions, and angles concentrate around π2\frac{\pi}{2}, geodesic distances become nearly constant. The curse of dimensionality persists, but manifests as angular collapse instead of radial shell concentration.