Maria Nicolae's Website

RSS Feed (What is RSS?)


Back to Blog.

The Hyperbolic Geometry of Einstein's Relativity

How changing your velocity means traversing a hyperbolic space,

Maria Nicolae,

In a previous post, I teased that there is an intimate connection between Einsteinian relativity and hyperbolic (Lobachevskian) geometry, but that it was "a story for another time". Well, that time has come; in this post, I'll elaborate on that remark and illustrate this connection.

In Einsteinian relativity, velocities do not trivially add. Rather, it gets harder to accelerate as you get closer to the speed of light. If you associate with the space of velocities a geometry in which the "distance" between two velocities is how hard you have to accelerate to change from one to the other, this geometry turns out to be hyperbolic. In this post, I'll show that this is the case. Starting from Einstein's postulates, I derive how velocities transform, and use this to derive a geometry of velocity-space expressed in the language of differential geometry as a Riemannian manifold. If you need an overview of differential geometry, I recently wrote about it here. Finally, I show that this geometry is hyperbolic.

How Velocities Transform

To find out how velocities transform, I will start with how spacetime itself transforms, and then derive from that the transformation rule for velocities by differentiating paths through spacetime.

Lorentz Transformations of Spacetime

Einsteinian relativity is axiomatised by Einstein's postulates:

  1. the laws of physics are the same in all inertial frames of reference (inertia), and
  2. the speed of light is the same for all observers.

From these, we can figure out which transformations of spacetime are valid, i.e. keep the laws of physics the same. These are called the Lorentz transformations.

Let the spacetime coordinates of a point before transformation be x=[t,r]T, where t is the time coordinate and r are the spatial coordinates, and let x=[t,r]T be the coordinates after a Lorentz transformation. The first postulate tells us that inertial (constant-velocity) paths through spacetime remain inertial when they transform. Such inertial paths are represented by straight lines in x spacetime coordinates, so Lorentz transformations map straight lines to straight lines. Furthermore, these transformations should keep parallel lines parallel, since whether or not two objects collide should be something that all observers agree upon. Thus, Lorentz transformations are linear transformations, and can be represented by a matrix multiplication

x=Λx
(1)

where Λ is a Lorentz transformation matrix. In block matrix form, this is

[tr]=[ΛttΛtrTΛrtΛrr][tr].
(2)

The second postulate tells us that Lorentz transformations preserve the light cone ||r||=ct. Adding this condition to the previous condition of linearity constrains the Lorentz transformations up to an overall scaling. Obviously, scaling is not a valid Lorentz transformation, since the laws of physics are not scale-invariant. Thus, the last constraint we need is that, for all Lorentz matrices Λ, detΛ=1.

Examples of Lorentz transformations include spatial rotations

Λrotation=[10T0R]
(3)

where R is a spatial rotation matrix, as well as velocity changes (boosts)

Λboost(v)=[γγc2vTγvI+γ1v2vvT]γ=11v2c2
(4)

where v=||v||. All Lorentz transformations can be expressed as compositions of these.

Lorentz Transformations of Velocity

To figure out how Lorentz transformations affect velocity, I'll consider a path through spacetime x(τ)=[t(τ),r(τ)]T and relate the velocity to the derivative of that path. The Lorentz transformation of the spacetime path is

x(τ)=Λx(τ).
(5)

Because differentiation and matrix multiplication are both linear, the derivatives of the original and transformed paths are also related by the Lorentz transformation:

dxdτ=Λdxdτ.
(6)

Expanding this into a block matrix, as per Equation 2, we obtain

[dtdτdrdτ]=[ΛttΛtrTΛrtΛrr][dtdτdrdτ].
(7)

With the original and transformed velocities being

v=drdtv=drdt,
(8)

and dividing both sides of Equation 7 by dt/dτ,

[dtdtdtdtv]=[ΛttΛtrTΛrtΛrr][1v].
(9)

Thus,

v=1Λtt+Λtr·v(Λrt+Λrrv);
(10)

the "recipe" for transforming a velocity vector v is:

  1. prepend 1 to the vector v to get a spacetime vector,
  2. multiply this vector by the Lorentz transformation matrix Λ, and
  3. turn the result into a velocity vector by dividing the spatial components (last ones) by the time component (the first).

This is a projective transformation of velocity space, named as such because the last step projects the spacetime vector onto the spatial subspace.

The Geometry of Velocity-Space

I now associate with the set of velocities a geometry, in which the "distance" between two velocities is how much you have to accelerate to change from one velocity to the other:

s(va,vb)=vavbadτ
(11)

where a is the proper acceleration and τ is the proper time, which are the acceleration and time measured in the moving reference frame. We want to express this in terms of a Riemannian metric

s(va,vb)=vavbds=vavbdv·g(v)dv.
(12)

To do this, there are two properties of the velocity distance metric that we can use. First is the nonrelativistic limit, for which

s(0,dv)=||dv||
(13)

and therefore

g(0)=I.
(14)

The second property is that this distance is preserved by Lorentz transformations, which are the isometries of this geometry. Namely, given a Lorentz transformation of the start and end velocities (va,vb),

s(va,vb)=s(va,vb).
(15)

To compute ds in a way that will reveal the form of the metric tensor g(v), I use this second property for the Lorentz transformation Λ=Λboost(v), in terms of which v=0, which then lets me apply the first property:

ds=s(v,v+dv)=s(0,dv)=||dv||.
(16)

The transformed differential velocity dv in this expression can be evaluated as a differential of Equation 10:

dv=1Λtt+Λtr·(v+dv)[Λrt+Λrr(v+dv)]=1Λtt+Λtr·v+Λtr·dv(Λrt+Λrrv0+Λrrdv)=1Λtt+Λtr·vΛrrdv.
(17)

Substituting the form of the boost matrix from Equation 4, this becomes

dv=1γγv2c2(I+γ1v2vvT)dv=1γ(1v2c2)(I+γ1v2vvT)dv=γ(I+γ1v2vvT)dv
(18)

Substituting this back into Equation 16,

ds=||γ(I+γ1v2vvT)dv||.
(19)

Then, by squaring both sides and replacing the vector norm with an explicit dot product

ds2=||γ(I+γ1v2vvT)dv||2=γ2(I+γ1v2vvT)dv·(I+γ1v2vvT)dv=γ2dv·(I+γ1v2vvT)T(I+γ1v2vvT)dv=dv·g(v)dv,
(20)

we finally find the metric tensor

g(v)=γ2(I+γ1v2vvT)T(I+γ1v2vvT)=γ2(I+γ1v2vvT)2=γ2(I2+2γ2v2vvT+γ22γ+1v4vvTvvT)=γ2(I2+2γ2v2vvT+γ22γ+1v2vvT)=γ2(I2+γ21v2vvT)g(v)=γ2(I2+γ2c2vvT).
(21)

Velocity-Space is Hyperbolic

By construction, the metric in Equation 21 is spherically symmetric, so from this point on, I work with two-dimensional velocities v=[vx,vy]T for simplicity. The differential line element for this metric is

ds2=γ2[(1+γ2c2vx2)dvx2+(1+γ2c2vy2)dvy2+2γ2c2vxvydvxdvy]=γ2[dvx2+dvy2+γ2c2(vxdvx+vydvy)2].
(22)

To analyse this, I first convert it to polar coordinates (v,θ) for which

vx=vcosθvy=vsinθ.
(23)

The differentials of these are

dvx=cos(θ)dvvsin(θ)dθdvy=sin(θ)dv+vcos(θ)dθ,
(24)

and substituting these into Equation 22 gives us

ds2=γ2[(cos(θ)dvvsin(θ)dθ)2+(sin(θ)dv+vcos(θ)dθ)2+γ2c2((vcosθ)(cos(θ)dvvsin(θ)dθ)+(vsinθ)(sin(θ)dv+vcos(θ)dθ))2]=γ2[cos2(θ)dv2+v2sin2(θ)dθ2vsin(θ)cos(θ)dvdθ+sin2(θ)dv2+v2cos2(θ)dθ2+vsin(θ)cos(θ)dvdθ+γ2c2(vcos2(θ)dvv2sin(θ)cos(θ)dθ+vsin2(θ)dv+v2sin(θ)cos(θ)dθ)2]=γ2[dv2+v2dθ2+γ2c2(vdv)2]=γ2[(1+γ2v2c2)dv2+v2dθ2]ds2=γ4dv2+γ2v2dθ2.
(25)

Finally, to make the geometry clear, I switch to "azimuthal equidistant" coordinates (ρ,θ), where I define ρ as the distance from the origin

ρ=0vds=0vγ(v)2dv=0v11v2c2dvρ=ctanh1vc.
(26)

This is the function of velocity for which the one-dimensional relativistic addition of velocities is linear

ρ(vsum)=ρ(v1)+ρ(v2);
(27)

the nondimensionalised version of this quantity, ζ=ρ/c, is known as the "rapidity" in the parlance of Einsteinian relativity. From this, we obtain

v=ctanhρcdv=1cosh2ρcdργ=coshρc.
(28)

Substituting these into Equation 25, we obtain

ds2=cosh4(ρc)1cosh4ρcdρ2+c2tanh2(ρc)cosh2(ρc)dθ2ds2=dρ2+c2sinh2(ρc)dθ2.
(29)

We can see from this line element that this is the geometry of the hyperbolic plane, which you may recognise from the previous post about differential geometry. Circles around the origin of radius ρ have circumferences that increase exponentially with ρ, rather than linearly (as in the Euclidean plane) or sinusoidally (as on the surface of a sphere). We also see from this that c is the characteristic scale of the curvature of this hyperbolic plane; for ρc, the geometry is approximately Euclidean.

What This Tells Us

The fact that there is this connection between Einsteinian relativity and hyperbolic geometry tells us, first and foremost, that mathematical formalisms for one apply to the other. The Cartesian velocity coordinates v in the ball of radius c, together with the metric in Equation 21, is a representation of hyperbolic geometry. Specifically, it is what is called the Beltrami-Klein model of hyperbolic geometry, which has the unique property that it represents geodesics of the hyperbolic geometry as straight lines in the model's Cartesian coordinates. Its isometries, then, must be those projective transformations (mapping straight lines to straight lines) which preserve the c ball; as we have seen, these are the Lorentz transformations.

As an example of what hyperbolic geometry can teach us about Einsteinian relativity, there is the fact that hyperbolic geometry, like all curved spaces, exhibits holonomy. This is the phenomenon where, if you move around in a curved space without turning, you can nonetheless end up facing a different direction to what you started in. Consider, for example, moving on the curved surface of our spherical Earth: if you start out facing north at 0°N 0°E, walk forward to the North Pole, then walk right to the equator (0°N 90°E), and finally walk backwards to 0°N 0°E, you're back where you started, and are now facing east instead of north, despite having never turned your body. A similar phenomenon happens in hyperbolic geometry, and therefore in Einsteinian relativity: the combined effect of two boosts is not necessary a pure boost, so it's possible to rotate simply by accelerating in different directions, though the rotation is miniscule if your velocity boosts are much smaller than c. Physicists call this the Wigner rotation.