The Hyperbolic Geometry of Einstein's Relativity
How changing your velocity means traversing a hyperbolic space,
Maria Nicolae,
In a previous post, I teased that there is an intimate connection between Einsteinian relativity and hyperbolic (Lobachevskian) geometry, but that it was "a story for another time". Well, that time has come; in this post, I'll elaborate on that remark and illustrate this connection.
In Einsteinian relativity, velocities do not trivially add. Rather, it gets harder to accelerate as you get closer to the speed of light. If you associate with the space of velocities a geometry in which the "distance" between two velocities is how hard you have to accelerate to change from one to the other, this geometry turns out to be hyperbolic. In this post, I'll show that this is the case. Starting from Einstein's postulates, I derive how velocities transform, and use this to derive a geometry of velocity-space expressed in the language of differential geometry as a Riemannian manifold. If you need an overview of differential geometry, I recently wrote about it here. Finally, I show that this geometry is hyperbolic.
How Velocities Transform
To find out how velocities transform, I will start with how spacetime itself transforms, and then derive from that the transformation rule for velocities by differentiating paths through spacetime.
Lorentz Transformations of Spacetime
Einsteinian relativity is axiomatised by Einstein's postulates:
- the laws of physics are the same in all inertial frames of reference (inertia), and
- the speed of light is the same for all observers.
From these, we can figure out which transformations of spacetime are valid, i.e. keep the laws of physics the same. These are called the Lorentz transformations.
Let the spacetime coordinates of a point before transformation be , where is the time coordinate and are the spatial coordinates, and let be the coordinates after a Lorentz transformation. The first postulate tells us that inertial (constant-velocity) paths through spacetime remain inertial when they transform. Such inertial paths are represented by straight lines in spacetime coordinates, so Lorentz transformations map straight lines to straight lines. Furthermore, these transformations should keep parallel lines parallel, since whether or not two objects collide should be something that all observers agree upon. Thus, Lorentz transformations are linear transformations, and can be represented by a matrix multiplication
where is a Lorentz transformation matrix. In block matrix form, this is
The second postulate tells us that Lorentz transformations preserve the light cone . Adding this condition to the previous condition of linearity constrains the Lorentz transformations up to an overall scaling. Obviously, scaling is not a valid Lorentz transformation, since the laws of physics are not scale-invariant. Thus, the last constraint we need is that, for all Lorentz matrices , .
Examples of Lorentz transformations include spatial rotations
where is a spatial rotation matrix, as well as velocity changes (boosts)
where . All Lorentz transformations can be expressed as compositions of these.
Lorentz Transformations of Velocity
To figure out how Lorentz transformations affect velocity, I'll consider a path through spacetime and relate the velocity to the derivative of that path. The Lorentz transformation of the spacetime path is
Because differentiation and matrix multiplication are both linear, the derivatives of the original and transformed paths are also related by the Lorentz transformation:
Expanding this into a block matrix, as per Equation 2, we obtain
With the original and transformed velocities being
and dividing both sides of Equation 7 by ,
Thus,
the "recipe" for transforming a velocity vector is:
- prepend to the vector to get a spacetime vector,
- multiply this vector by the Lorentz transformation matrix , and
- turn the result into a velocity vector by dividing the spatial components (last ones) by the time component (the first).
This is a projective transformation of velocity space, named as such because the last step projects the spacetime vector onto the spatial subspace.
The Geometry of Velocity-Space
I now associate with the set of velocities a geometry, in which the "distance" between two velocities is how much you have to accelerate to change from one velocity to the other:
where is the proper acceleration and is the proper time, which are the acceleration and time measured in the moving reference frame. We want to express this in terms of a Riemannian metric
To do this, there are two properties of the velocity distance metric that we can use. First is the nonrelativistic limit, for which
and therefore
The second property is that this distance is preserved by Lorentz transformations, which are the isometries of this geometry. Namely, given a Lorentz transformation of the start and end velocities ,
To compute in a way that will reveal the form of the metric tensor , I use this second property for the Lorentz transformation , in terms of which , which then lets me apply the first property:
The transformed differential velocity in this expression can be evaluated as a differential of Equation 10:
Substituting the form of the boost matrix from Equation 4, this becomes
Substituting this back into Equation 16,
Then, by squaring both sides and replacing the vector norm with an explicit dot product
we finally find the metric tensor
Velocity-Space is Hyperbolic
By construction, the metric in Equation 21 is spherically symmetric, so from this point on, I work with two-dimensional velocities for simplicity. The differential line element for this metric is
To analyse this, I first convert it to polar coordinates for which
The differentials of these are
and substituting these into Equation 22 gives us
Finally, to make the geometry clear, I switch to "azimuthal equidistant" coordinates , where I define as the distance from the origin
This is the function of velocity for which the one-dimensional relativistic addition of velocities is linear
the nondimensionalised version of this quantity, , is known as the "rapidity" in the parlance of Einsteinian relativity. From this, we obtain
Substituting these into Equation 25, we obtain
We can see from this line element that this is the geometry of the hyperbolic plane, which you may recognise from the previous post about differential geometry. Circles around the origin of radius have circumferences that increase exponentially with , rather than linearly (as in the Euclidean plane) or sinusoidally (as on the surface of a sphere). We also see from this that is the characteristic scale of the curvature of this hyperbolic plane; for , the geometry is approximately Euclidean.
What This Tells Us
The fact that there is this connection between Einsteinian relativity and hyperbolic geometry tells us, first and foremost, that mathematical formalisms for one apply to the other. The Cartesian velocity coordinates in the ball of radius , together with the metric in Equation 21, is a representation of hyperbolic geometry. Specifically, it is what is called the Beltrami-Klein model of hyperbolic geometry, which has the unique property that it represents geodesics of the hyperbolic geometry as straight lines in the model's Cartesian coordinates. Its isometries, then, must be those projective transformations (mapping straight lines to straight lines) which preserve the ball; as we have seen, these are the Lorentz transformations.
As an example of what hyperbolic geometry can teach us about Einsteinian relativity, there is the fact that hyperbolic geometry, like all curved spaces, exhibits holonomy. This is the phenomenon where, if you move around in a curved space without turning, you can nonetheless end up facing a different direction to what you started in. Consider, for example, moving on the curved surface of our spherical Earth: if you start out facing north at 0°N 0°E, walk forward to the North Pole, then walk right to the equator (0°N 90°E), and finally walk backwards to 0°N 0°E, you're back where you started, and are now facing east instead of north, despite having never turned your body. A similar phenomenon happens in hyperbolic geometry, and therefore in Einsteinian relativity: the combined effect of two boosts is not necessary a pure boost, so it's possible to rotate simply by accelerating in different directions, though the rotation is miniscule if your velocity boosts are much smaller than . Physicists call this the Wigner rotation.