4 Geodesics

This chapter is a continuation of the idea of parallel transport in Section 3.4 . I debated whether to move that section to this chapter, but ultimately decided linking covariant derivatives and parallel transport was necessary as motivation. But perhaps it would be worthwhile to read that section again now.

In this chapter we develop the theory of geodesics, which generalise the notion of a ‘straight line’. To put the question provocatively: what does it mean to have a straight line in a curved space? Consider this question for

𝕊^{2}

if somebody asked you to fly an aeroplane in a straight line between two cities. A reasonable definition would be a flight path that did not require steering the plane’s control stick. This is the same as the idea of ‘walking without turning’ that we used previously in our thought experiment. Such paths are called geodesics. On the other hand, this is Riemannian geometry. In euclidean space, the shortest path between two points is a straight line. Perhaps this should be taken as the defining feature of straight lines. It turns out that this length-minimising property is also true of geodesics, so that the two possible definitions coincide.

First we will formalise the definition of geodesic. Although we mostly restrict ourselves to consideration of the Levi-Civita connection, we do examine in the the mutual dependence of geodesics and connections. Using the length of a curve we define a distance function on a Riemannian manifold as the infimum of the length of all smooth curves. We will show that geodesics are critical points of the length functional, using a proof that is reminiscent of the proof that minimal surfaces are critical for area from Section 1.6. Showing that they are locally length minimising, surprisingly, is significantly more difficult. This leads us to construct special coordinates, so-called normal coordinates, in which the geodesic structure of the Riemannian manifold is a little clearer.

4.1 Straight Lines

Adding to the discussion above, we understand that ‘walking without turning’ along a curve

α

means that the tangent of the curve is parallel. The technicality is that

α^{'}

is not a vector field on the manifold, but this is not a problem because it is a vector field along the curve

α

and we know that that is sufficient to define its covariant derivative.

In a chart we may write

α = (α^{i})

and use the Christoffel coefficients to describe the covariant derivative:

This is generally known as the geodesic equation. It is of course the parallel transport equation (3.37) with

Y = α^{'}

. But since the vector field in this case is linked the the curve, we now have a second-order nonlinear system of ODEs. However we can use a trick to write this as a system of first order ODEs:

From this we conclude the local existence and uniqueness of geodesics in a neighbourhood of every point and in every direction. Specifically, given a point

p \in M

and a direction

v \in T_{p} M

existence implies that there is a curve

γ_{p, v} : (a, b) \to M

with

γ_{v} (0) = 0

and

γ_{v}^{'} (0) = v

. We will drop the

p

when it is unambiguous from context. We assume that

γ_{p, v} : (a, b) \to M

is maximal in the sense that any other solution is a restriction of

γ_{p, v}

to some smaller domain

(a^{'}, b^{'}) \subset (a, b)

. This assumption may always be achieved, because uniqueness makes it possible to glue together any two solutions into a ‘longer’ one. Related to this is the observation that if

\tilde{p} = γ_{p, w} (t_{0})

and

\tilde{w} = γ_{p, w}^{'} (t_{0})

, then

Example 4.2 (Helicoid). Consider the helicoid with the inherited metric from $ℝ^{3}$ and the tangent connection. We consider again in this example two (sets of) curves: radial lines and helices. First there are radial curves $α (t) = Φ (t, v) = (t cos v, t sin v, bv)$ for some $v \in ℝ$ . The tangent connection is the projection of the derivational derivative in $ℝ^{3}$ to the tangent plane of the helicoid. The directional derivative in the direction $α^{'}$ is the derivative with respect to $t$ . Hence $\nabla_{α^{'}}^{euc} α^{'} = α^{″} = 0$ , because $α$ is linear in $t$ . Hence radial lines are geodesics of the helicoid.

On the other hand we have the helices $β (t) = Φ (u, t) = (u cos t, u sin t, bt)$ for some $u \in ℝ$ . As above $\nabla_{β^{'}}^{euc} β^{'} = β^{″}$ but this time it is not zero. Now we reuse our previous calculations. As remarked upon in Example 1.23, $β^{″}$ already lies in the tangent plane. Hence

\nabla_{β^{'}}^{⊤} β^{'} = \underset{T_{β} Σ}{proj} β^{″} = β^{″} = (- u cos t, - u sin t, 0) \neq 0 .

This shows that the helix is not a geodesic for helicoid, unless $u = 0$ (in which case it is the central axis of the helicoid).

How do the geodesics of a manifold depend on the choice of covariant derivative? Following Lemma 3.23 we consider the connections

\nabla

and

\tilde{\nabla} = \nabla + A

. In a chart we may write

A (∂i, ∂j) = A_{i j}^{k} ∂k

and since

A

C^{\infty}

-bilinear these functions completely determine

A

. Moreover

{\tilde{Γ}}_{i j}^{k} = Γ_{i j}^{k} + A_{i j}^{k}

. Let

α

be a geodesics of

\nabla

. Then the geodesic equation for

\tilde{\nabla}

reads

Thus

α

is also a geodesic of

\tilde{\nabla}

if and only if this quantity is zero. What is obscured by index notation is the symmetry here. Using a relabelling of summation indices

This can of course be zero for some curve

α

. If

A

is antisymmetric,

A (X, Y) = - A (X, Y)

, then this quantity is zero for all curves, and in particular the two connections have the same geodesics. Conversely, we know that every vector

w \in T_{p} M

at every point is the tangent vector to some geodesic of

\nabla

. If the two connections have the same geodesics, then this forces

A

to be antisymmetric. Therefore we have proved

In particular, if we absorb the torsion of a connection

\tilde{\nabla} = \nabla - \frac{1}{2} T

, then this does not change the geodesics. This justifies our comment at the end of Chapter 3 that torsion is often unimportant.

The geodesic equation says that the curve is parallel transporting its own tangent vector. If the connection is metric-compatible then this means that the length of the tangent vector is not changing. In Riemannian geometry, we call

∥ α^{'} ∥_{g}

the speed of the curve

α

For any point there is a special geodesic called the constant geodesic. It is the geodesic associated to the zero vector. One can check easily that

γ_{0} (t) = p

for all

t \in ℝ

solves the geodesic equation with the initial condition

γ_{0} (0) = p

and

γ_{0}^{'} (0) = 0

. The interpretation is that walking in a straight line with initial speed zero means standing still. As a function

ℝ \to M

they are constant, hence the name. This degenerate example is none-the-less important to include.

Notice that a geodesic is a parameterised curve. To give an example, both

α (t) = (t, 0)

and

β (t) = (2 t, 0)

are geodesics in

ℝ^{2}

. Their images in

ℝ^{2}

but they are not equal as parameterised curves because they have different speeds. But our intuition tells us that a straight line is about direction, not speed.

The following lemma tells us how geodesics in the same direction with different initial speeds are related. It says that they have the same image and differ only by a constant rescaling factor. For this reason, geodesics that only differ by speed are often conflated with one another.

Proof. We deal with the case $c = 0$ separately.¹ On the left hand side we have $γ_{w} (0 t) = γ_{w} (0) = p$ for all $t \in ℝ^{n}$ . On the right we have the constant geodesic $γ_{0} (t)$ , which we know is equal to $p$ for all time. Hence the two sides are defined for all $t \in ℝ$ and equal.

Now assume $c \neq 0$ . We use the first-order version of the geodesic equation, because it makes the idea a little clearer. As per definition, $γ_{w}$ is the geodesic with an initial direction of $w$ . Let $v = γ_{w}^{'}$ . It obeys

\frac{d γ_{w}^{k}}{dt} = v^{k} (t), \frac{d v^{k}}{dt} = - Γ_{i j}^{k} (γ_{w} (t)) v^{i} (t) v^{j} (t), γ_{w} (0) = p, v (0) = w .

Now consider the reparameterised curve $α (t) = γ_{w} (ct)$ . The starting point has not changed: $α (0) = γ_{w} (0) = p$ . But the velocity has changed:

\frac{dα (t)}{dt} = \frac{d γ_{w}^{k} (ct)}{dt} = c \frac{d γ_{w}^{k}}{dt} (ct) = cv (ct) = : u (t) .

In particular $u (0) = cv (0) = cw$ . Finally, we see for the second ODE

\begin{array}{l} \frac{d u^{k}}{dt} = c \frac{d v^{k} (ct)}{dt} = c^{2} \frac{d v^{k}}{dt} (ct) = - c^{2} Γ_{i j}^{k} (γ_{w} (ct)) v^{i} (ct) v^{j} (ct) = - Γ_{i j}^{k} (α (t)) u^{i} (t) u^{j} (t) . \end{array}

Thus we see that $(α, u)$ is a solution of the same ODE and IVP that $γ_{cw} (t)$ solves. By uniqueness and maximality of $γ_{cw}$ , we conclude that $α$ is the restriction of $γ_{cw}$ . But we can also run this argument in the reverse direction and conclude that $\tilde{α} (t) = γ_{cw} (c^{- 1} t)$ is a restriction of $γ_{w}$ . The conclusion must be that one exists if and only if the other does, and they are equal. □

A particular case is for

c = - 1

. The lemma says

γ_{- w} (t) = γ_{w} (- t)

: the geodesic in the reverse direction is the same as walking backwards. If we choose a time

t_{0}

and set

\tilde{p} = γ (t_{0})

and

\tilde{w} = γ_{p, w}^{'} (t_{0})

, then we have

This says that if you walk along a geodesic for a certain amount of time

t_{0}

, turn around and walk back for the same amount of time, then you end back where you started. These properties might seem obvious, but it is important to question whether our intuition carry over to the general setting.

Finally, what can be said about geodesics in an immersed manifold? For this question to be sensible, we should use the tangent connection on

M

from Definition 3.59. To recap, we have a Riemannian manifold

N

with the Levi-Civita connection and

M

is Riemannian immersed in

N

. The tangent connection on

M

is the projection of

\nabla^{N}

to the tangent space of

M

. A curve

α

M

is a geodesic iff

Because

\nabla_{α^{'}} α^{'}

is a second-order derivative of

α

we interpret it as a type of acceleration. The above equation says that a curve is a geodesic of

M

when its acceleration in

N

is always perpendicular to

M

We can relate this back to normal curvature in Section 1.5. In that situation we have

N = ℝ^{3}

and the connection is just ordinary directional derivatives

\nabla^{euc}

. Therefore

\nabla_{α^{'}}^{N} α^{'} = α^{″}

. The acceleration

α^{″}

of a curve in

ℝ^{3}

has components in the

T

and

N

directions, Equation (1.28). If we project this onto the tangent plane then we get

Hence we see that

α

is a geodesic of the surface

M

if and only if

α

is parameterised with constant speed and

N

, the normal of the curve, is perpendicular to

M

. Since any regular curve can be reparameterised by arc-length, the first condition is only a technical point. If

N

is perpendicular to

M

this means that the angle between

N

and the surface normal

ν

is zero. In other words, the normal curvature is equal to the curvature of

α

. Traditionally, one defines the geodesic curvature of

α

This is a measure of how far an arc-length parameterised curve is from being a geodesic. Alternatively, the observation

κ_{n}^{2} + κ_{g}^{2} = κ^{2}

(since the normal and geodesic curvatures are projections of the curvature vector) leads us to say a curve is a geodesic if its curvature is entirely normal.

Example 4.9 (Tangent Connection). Consider $𝕊^{2}$ . Consider a geodesic $α$ and suppose without loss of generality that $α$ is parameterised by arc-length. From the above discussion, a geodesics $α$ must have $α^{″}$ in the normal direction to the sphere, $N \cdot ν = \pm 1$ . For the sphere $ν = α$ . Therefore $ν^{'} = α^{'} = T$ . On the other hand $N^{'} = - κT + τB$ , using the Frenet equations, and we can conclude that $τ \equiv 0$ . This proves that the geodesics of the sphere lie in the plane containing the normal and their tangent vector. These are exactly the great circles.

4.2 The Hyperbolic Plane

So far we have seen the example of

𝕊^{2}

and

ℝ^{2}

. We have the sense that these spaces are special. There is a another important two-dimensional Riemannian manifold: the hyperbolic plane

ℍ^{2}

. Unlike the sphere, a theorem of Hilbert proves that the hyperbolic plane cannot be isometrically immersed into

ℝ^{3}

.² Therefore we really need to use the tools of manifolds and charts to understand this space, there can be no resorting to geometric tricks in euclidean space.

This space was first discovered in connection to the ‘parallel postulate’ of Euclid. Euclid gave five postulates³, which we would call axioms,

Clearly one of these is not like the others. The fifth postulate is called the parallel postulate, because it is equivalent to Playfair’s axiom:

There are several interesting geometries that come from replacing this axiom. If there are no parallel lines then we get projective geometry (one intersection) or spherical geometry (two intersections). If there are more than one parallel line then we get hyperbolic geometry.

For our purposes we will introduce the hyperbolic plane by fiat. Like the euclidean plane it can be covered by a single coordinate chart. There are several ‘models’ of the hyperbolic plane, but we will use the ‘half-plane’ model where the geodesics have the easiest formulas.

The first common misconception to address is the ‘boundary’ at

y = 0

. As a Riemannian manifold, there is no boundary. If you lived in the hyperbolic plane, as you tried to approach

y = 0

at constant speed you would find that the coordinates were changing at an every decreasing rate. If we explain this using vectors in coordinates, for

y

small, a unit-length vector has very small coefficients, and therefore moving at unit-speed makes only a small change in coordinates. Conversely, for large

y

, a unit-length vector has large coefficients. This is analogous to looking at the world in the Mercator map, where a plane on the equator changes its longitude far less than a plane near the poles even at the same speed.

Let us compute the Christoffel coefficients for the Levi-Civita connection. Only two derivatives of the metric are non-zero

This means that we can try to solve the geodesic equation. It is certainly possible to solve it directly, but we will solve it in a special case and then use isometries to obtain the full solution, as this is geometrically more interesting. The special case is the following: let

p = (0, 1)

the the starting point of the geodesic and

v = \partial 2

be its initial direction. If the geodesic is

α (t) = (x (t), y (t))

then the geodesic formula is

Notice that if

(x, y)

is a solution to these ODEs, so too is

(- x, y)

. Moreover,

- x (0) = - 0 = 0

and

- x^{'} (0) = - 0 = 0

, so they have the same initial condition. Therefore they must be equal, in other words

x (t) = - x (t)

, which implies

x (t) \equiv 0

. The second ODE now simplifies to

again using the initial conditions. Thus the unique geodesic through

(0, 1)

with initial vector

\partial 2

is the vertical line

γ (t) = (0, e^{t})

Already in this argument we already saw a glimpse to the method we will use to find all other geodesics. We used the reflection

R : (x, y) \mapsto (- x, y)

. This preserved the geodesic equations because this is in fact an isometry of

ℍ^{2}

. After we strip away the terminology of Definitions 3.4 and 3.9, what we see is that an isometry is a diffeomorphism, in particular a bijective map, such that the pushforward of vector doesn’t change the inner product. The pushforward of

v \in T_{(x, y)} ℍ^{2}

by reflection is

Another obvious isometry is horizontal translation

(x, y) \mapsto (x + a, y)

. From this we can conclude that the geodesic through

(a, 1)

with initial direction

\partial 2

γ (t) = (a, e^{t})

. These are simply all the vertical lines.

The translation

(x, y) \mapsto (x, y + a)

is not an isometry. For one thing, it is not even a well-defined map on

ℍ^{2}

, which only has the points of the upper half-plane. But even between points where it is well defined, we see that the metric is changed because

g_{1 1} (x, y + a) \neq g_{1 1} (x, y)

It turns out that simple scaling of the points

D_{r} : (x, y) \mapsto (rx, ry)

is an isometry. This is called dilation. We compute

Unfortunately, this does tell us any new geodesics, because it takes vertical lines to vertical lines.

We need a different idea. Notice that because the matrix of the metric is a scalar of the euclidean metric (conformal), hyperbolic angles and euclidean angles are equal:

Therefore we should look for transformations of the upper half-plane that preserve euclidean angles (conformal). Perhaps you have encountered circle inversion before. It restricts to give a bijective map on the upper half-plane

S (x, y) = {(x^{2} + y^{2})}^{- 1} (x, y)

, because

(0, 0) \notin ℍ^{2}

. We see that, for

p = (x, y)

q = S (x, y)

The calculation for

g (TS (\partial 2 | p), TS (\partial 2 | p))

is exactly the same. That leaves

Therefore

S (x, y) = {(x^{2} + y^{2})}^{- 1} (x, y)

is an isometry, not just conformal.

Remarkably circle inversion and the horizontal translations are all the isometries we need to get all the geodesics. By applying

S

to the geodesic

γ (t) = (a, e^{t})

we get the geodesic

Looking on a graphing tool we see that they are semicircles centered on the

x

-axis. This is easy to verify algebraically

In particular we see that we have semicircles of every radius. By horizontal translation then, the set of geodesics includes every vertical line as well as every semicircle centered on the

x

-axis (from now on we will just say semicircle, leaving the centering implicit).

But in fact this accounts for every unit-speed geodesic. Choose any point

p

and any direction

v

with

∥ v ∥_{g}

. If

v

\pm \partial 2

, then the desired geodesic is the vertical line. For any other

v

, by straightedge and compass (for coolness factor) one can construct a semicircle through that point with the direction as tangent. This must be the unique geodesic. For non-unit-speed geodesics, one can simply rescale

t

, using Lemma 4.6.

These isometries are not only transitive on the set of geodesics, but also on the set of points. This is easily achieved by dilating

(0, 1)

(0, y)

and then a horizontal translation to

(x, y)

. Every point can therefore be mapped to any other by first bringing it to

(0, 1)

and then sending it on its way. Spaces with a transitive set of isometries are called homogeneous. But the hyperbolic plane is also isotropic, it looks the same in every direction. This means that there is a full set of isometric rotations. Begin with

\partial 2 | (0, 1)

. Dilate and translate it to any point on a different vertical line as above. We know that this vertical line is transformed by circle inversion to a semicircle. A semicircle has every possible tangent direction except

\pm \partial 2

. You can now translate and dilate it back to

(0, 1)

. This give every direction except

- \partial 2

. But this can be achieved through two

π ∕ 2

rotations.

Now that we have determined the geodesics, we can comment on the similarities and differences to other geometries. The first observation is that between any two points there is a unique geodesic. Secondly any two geodesics can intersect at most once. Both of these properties are similar to the euclidean plane but are in contrast to spherical geometry, where antipodal points have infinitely many geodesics between them and every pair of geodesics intersects twice.

We can also consider the parallel postulate. Choose a geodesic and a point

p

external to it. We know that there is an isometry that will transform the geodesic into the

y

-axis, so we assume this without loss of generality. What are the geodesics through the point that do not cross the

y

-axis? Not only is there the vertical line through

p

, there are infinitely many semicircles. In particular, there is one semicircle through

p

which is tangent to the

y

-axis. This is called a limiting parallel. The vertical line through

p

is also a limiting parallel, based on our earlier explanation that as points increase in

y

value for constant

x

value, the distance between them shrinks.

A triangle in the hyperbolic plane is of course a shape with three geodesic sides, none of which are parallel. Just as for euclidean geometry, we can try to classify triangles up to isometry, which is called congruence in elementary geometry. By isometry, we may bring any edge of the triangle to the

y

-axis, so we assume that one vertex is at

(0, 1)

and the other

(0, b)

. Let

α

be the angle at

(0, 1)

and

β

the angle at

(0, b)

. We see that this information already uniquely determines a triangle, thus we have the ‘angle-side-angle’ rule of triangle congruence.

But more is true. There are restrictions on the angles for two other sides to intersect be able to intersect, namely

α + β < π

, just as for euclidean space. But unlike the classical situation, even if

α + β < π

this is no guarantee that the two sides meet. For any

α, β

, there is some

B > 1

such that for

b = B

the sides are limiting parallels. As

b

ranges from

B

down to

1

, the third angle ranges from

0

(limiting parallel) to

π - α - β

(all three corners very close to

(0, 1)

, so we can approximate the circles by their tangents). Therefore the sum of angles in a hyperbolic triangle is always less that

π

. Also note that the angle increases monotonically. This means there is a 1-to-1 correspondence between

b

and the third angle. In other words, two hyperbolic triangle are congruent if and only if they have corresponding angles: the ‘angle-angle-angle’ rule!

We close this section with a useful calculation method. It is often profitable to use complex numbers

x + ιy

to describe the points of

ℍ^{2}

. The chief advantage is the ease of writing isometries. Clearly horizontal translation and dilations are

z \mapsto z + a

and

z \mapsto rz

for

a, r \in ℝ

respectively. But inversion in the unit circle is

z \mapsto {\bar{z}}^{- 1}

and reflection in the

y

-axis is

z \mapsto - \bar{z}

. As you can see by the presence of complex conjugation, these later two isometries are orientation reversing; they are both reflections after all. Therefore it is common to combine them to an isometry

z \mapsto - z^{- 1}

. This along with translation and dilation generate all orientation preserving isometries. A general orientation preserving isometry is therefore

for

a, b, c, d \in ℝ

with

ad - bc = 1

. Transformations of this form are called Möbius transformations. The set of isometries is three-dimensional so it was necessary to normalise the four constants, and using the ‘determinant’ makes the formula for the inverse transform simple.

4.3 Length and Distance

In the above discussion of hyperbolic triangle, we were careful not to speak of the distance between the vertices of triangle because we simply have not yet defined distance on a Riemannian manifold. Let us remedy that situation. In euclidean space, we had a distance function already and in Theorem 1.6 were able to show that the length of a path as in Definition 1.4 was the integral of its speed. But here we have no prior distance function, but the Riemannian metric does give allows us to determine the speed. Therefore we ‘reverse’ these theorems:

Example 4.16 (Hyperbolic Plane). With this definition of length we can calculate the lengths of geodesics in the hyperbolic plane. Because we obtained the geodesics by isometries, which preserve the metric and therefore the length, it is sufficient to calculate the length of the vertical geodesic. It has the constant speed parameterisation

γ (t) = (0, e^{t}) .

Therefore

\begin{array}{l} γ^{'} (t) & = (0, e^{t}), \\ ∥ γ^{'} (t) ∥_{g}^{2} & = ∥ (0, e^{t}) ∥_{g}^{2} = \frac{1}{{(e^{t})}^{2}} [0^{2} + {(e^{t})}^{2}] = 1, \\ L (γ |_{[a, b]}) & = \int_{a}^{b} ∥ γ^{'} (t) ∥_{g} dt = \int_{a}^{b} ∥ γ^{'} (t) ∥_{g} dt = b - a . \end{array}

To put this more geometrically, the length of the geodesic connecting $(0, y_{1})$ and $(0, y_{2})$ is $| ln y_{2} - ln y_{1} | = ln | y_{2} ∕ y_{1} |$ .

There are (at least) two equivalent definitions of the distance function on a Riemannian manifold in the literature, with the difference being over which set of paths the infimum is taken. We have chosen ‘smooth curves’. Perhaps more common is to choose ‘piecewise smooth curves’. A curve is piecewise smooth if it is continuous and the set of non-smooth points is finite. These non-smooth points are called corners. Smooth functions are clearly nicer to work with than piecewise smooth function, if you work with the latter you are forever splitting the curve into its smooth intervals and you must provide correction terms for the corners. The advantage of the piecewise approach is clear if you try to prove the triangle inequality for the distance function. In the piecewise approach it is immediate because the concatenation of a piecewise smooth curve from

p

p^{'}

and from

p^{'}

p^{″}

is again piecewise smooth. However the concatenation of smooth paths will in general not be a smooth path, and some smoothing will be required.

We truly need to take the infimum, even for subsets of euclidean space. Consider

U = ℝ^{2} ∖ {0}

and two points

p, - p

. The distance between them is

2 ∥ p ∥

. The unique path in the full plane that achieves this is the straight line, but this is not a path in the manifold, because it would pass through the removed origin. However, by taking paths that pass arbitrarily close to the origin, we see that the distance between these points is still

2 ∥ p ∥

This leads to an important point: even for manifolds that are Riemannian immersed in euclidean space the distance function of Definition 4.15 is not the restriction of the euclidean distance function. Consider

U = ℝ^{2} ∖ \bar{B (0, 1)}

. Now the (limiting) shortest path between

p

and

- p

skims the boundary of the unit disk An easy estimate shows that any such path is longer than

2 \sqrt{1 + ∥ p ∥^{2}}

, which is strictly greater than

2 ∥ p ∥

. Therefore the distance function coming from the Riemannian metric and the restriction of the euclidean distance function to

U

are different.

These considerations also apply to a general metric space. Given a distance function, we can define the length of a continuous curve as in Definition 1.4. Then we can define a second distance function, called the intrinsic distance function, as in Definition 4.15, as the infimum of the length of paths. The intrinsic distance between points is easily proved to be greater than or equal to their distance. Metric spaces for which the distance function is equal to the intrinsic distance are called length spaces. For euclidean space, one can calculate that the length of a line segment is equal to the distance between the end points, which proves that euclidean space is a length space.

Now that we have enough examples to make us cautious, let us begin to prove that the Riemannian distance function is reasonably behaved. First we show that locally it is equivalent to the euclidean metric of a chart. Then we show that

d^{g}

has the three properties of a distance function. The proof that

d^{g}

is symmetric is trivial and the proof that it has positivity will be addressed in Corollary 4.20. The triangle inequality Corollary 4.27 requires Corollary 4.26, the ‘corner rounding’ lemma.

Proof. Choose a point $p$ and a chart $U \subset ℝ^{n}$ that contains it. Within this chart, there is a convex compact neighbourhood $K$ of $p$ (for example, a closed ball). In this chart we have the metric $g_{i j}$ as a matrix of functions, but we can also consider the euclidean metric on this chart $δ_{i j}$ . Consider all the vectors on $K$ that are unit-length with respect to $δ_{i j}$ . This is a compact set $K \times 𝕊^{n - 1}$ . Because $g$ is positive definite and continuous, it obtains a positive maximum $C$ and minimum $c$ on this set of vectors. By decreasing $c$ or increasing $C$ we may assume that $C = c^{- 1}$ . This is not necessary, but makes the results pleasingly symmetric. For any vector $v \in TK$ , write $v = ∥ v ∥_{euc} \hat{v}$ with $∥ \hat{v} ∥_{euc} = 1$ , then

∥ v ∥_{euc} c \leq ∥ v ∥_{g} = ∥ v ∥_{euc} ∥ \hat{v} ∥_{g} \leq ∥ v ∥_{euc} c^{- 1} .

If we apply this to paths in $K$ , we obtain

(4.19) c L^{euc} (α) = c \int_{a}^{b} ∥ α^{'} ∥_{euc} dt \leq L^{g} (α) = \int_{a}^{b} ∥ α^{'} ∥_{g} dt \leq c^{- 1} \int_{a}^{b} ∥ α^{'} ∥_{euc} dt = c^{- 1} L^{euc} (α) .

It remains to lift this inequality of lengths to an inequality of distance. As mentioned above, the euclidean distance and its intrinsic distance are equal. Hence for any smooth path from $p$ to $q$

c d^{euc} (p, q) \leq c L^{euc} (α) \leq L^{g} (α) .

This proves that $c d^{euc} (p, q)$ is a lower bound for $d^{g} (p, q)$ . Because $K$ is convex, any two points can be joined with a straight-line $ℓ$ , which achieves the euclidean distance between those points. It follows

d^{g} (p, q) \leq L^{g} (ℓ) \leq c^{- 1} L^{euc} (ℓ) \leq c^{- 1} d^{euc} (p, q) .

For small balls measured with $d^{g}$ , there is a euclidean ball within it and containing it. Since balls generate the topology, this shows the two topologies are equal. □

Hopefully this is illustrative of the general strategy in this topic: We try to reduce the situation to a single chart, and then in the chart we can compare our metric

g

with the easily understood euclidean metric on the chart.

We now prepare to prove the triangle inequality for

d^{g}

. First we give an example of blending two smooth curves.

Example 4.21 (Curve Blending). Consider the piecewise smooth curve $α : (- 1, 1) \to ℝ^{2}$ given by

t \mapsto {\begin{matrix} (t, 0) & for t \leq 0 \\ (0, t) & for t > 0 \end{matrix}

It has a corner at $t = 0$ . We can think of this as the joining of the curve $α (t) = (t, 0)$ and the curve $β (t) = (0, t)$ . Let $ϕ_{𝜀} (x)$ be a smooth step function that is $0$ for $x < - 𝜀$ and $1$ for $x > 𝜀$ . Then

γ (t) : = (1 - ϕ_{𝜀} (t)) α (t) + ϕ_{𝜀} (t) β (t)

is a smooth curve that agrees with $α$ for $t < - 𝜀$ and with $β$ for $t > 𝜀$ .

This formula for

γ

also works for smooth curves

α

and

β

in euclidean space of any dimension, so long as

α

is defined for

t < 𝜀

and

β

is defined for

t > - 𝜀

. In fact, it is not even necessary for the two curves to meet at

t = 0

Let us calculate how much the blending increases the length. For convenience, assume that the curves

α, β

are parameterised by arc-length. The speed can be bounded from above:

Considering that

α, β

are unit length parameterised, so the concatenated curve has length

2 𝜀

, the length of the blended curve is only slightly larger than the distance between the curves at

t = 0

. We can give a more concrete bound in terms of

𝜀

and that distance:

It is important to note again that the formula for blending requires that

α

is defined for

t < 𝜀

and

β

is defined for

t > - 𝜀

. So then how can we know that two smooth curves meeting at a point can both be extended past the corner? Well one answer is that Definition 1.2 defines a smooth path as the restriction of a smooth curve to a closed interval. However we give a stronger result that applies to piecewise smooth curves. This demonstrates that our definition is equivalent to the other common one in the literature. The idea is to not try to extend the curves at the corner, we allow ourselves to ‘graft on’ at a nearby point.

Example 4.23 (Curve Extension). Take a smooth curve $α : (a, 0) \to ℝ^{n}$ . For any small $𝜀 > 0$ , let $β_{𝜀}$ be the tangent line to $α$ at $α (- 2 𝜀)$ . Then blending the curve with its tangent around the point of tangency produces

(1 - ϕ_{𝜀} (t + 2 𝜀)) α (t) + ϕ_{𝜀} (t + 2 𝜀) β_{𝜀} (t) .

This is well-defined for all $t \in (a, \infty)$ It agrees with $α$ for $t < - 3 𝜀$ and with $β_{𝜀}$ for $t > - 𝜀$ .

Now we can use three blends to remove a corner from a piecewise smooth curve. In fact, performing this operation on smaller pieces of the curve ever closer to the corner results in curves that have almost the same length.

Proof. The previous two examples have shown us how to construct $α_{𝜀}$ . First, on both sides of the corner, use Example 4.23 to extend $α |_{(a, 0)}$ and $α |_{(0, b)}$ past $0$ . Then use Example 4.21 to blend the two together at $0$ . In more detail, for $t$ in $(- 3 𝜀, - 𝜀)$ we blended $α |_{(a, 0)}$ with its tangent $β_{𝜀}^{-}$ at $t = - 2 𝜀$ . Similarly for $t$ in $(𝜀, 3 𝜀)$ we blended $α |_{(0, b)}$ with its tangent $β_{𝜀}^{+}$ at $t = 2 𝜀$ . And for $t \in (- 𝜀, 𝜀)$ we blended the two tangents together.

It only remains to show that this process hasn’t made the approximation too long. The only part of the curve that has changed is for $t \in (- 3 𝜀, 3 𝜀)$

\begin{array}{l} L^{euc} (α_{𝜀} |_{(- 3 𝜀, 3 𝜀)}) = (\int_{- 3 𝜀}^{- 𝜀} + \int_{- 𝜀}^{𝜀} + \int_{𝜀}^{3 𝜀}) ∥ α_{𝜀}^{'} ∥ dt . \end{array}

Over each integral we blend two curves together. From the above length estimate (4.22), we have that this is bounded from above by

\begin{array}{l} (0 + 4 𝜀) + (∥ β_{𝜀}^{+} (0) - β_{𝜀}^{-} (0) ∥ + 4 𝜀) + (0 + 4 𝜀) . \end{array}

The $12 𝜀$ certainly tends to zero. But what about the distance between the two tangent lines at $t = 0$ . We can handle this with some triangle inequalities:

\begin{array}{l} ∥ β_{𝜀}^{+} (0) - β_{𝜀}^{-} (0) ∥ & \leq ∥ β_{𝜀}^{+} (0) - β_{𝜀}^{+} (2 𝜀) ∥ + ∥ β_{𝜀}^{+} (2 𝜀) - β_{𝜀}^{-} (- 2 𝜀) ∥ + ∥ β_{𝜀}^{-} (- 2 𝜀) - β_{𝜀}^{-} (0) ∥ \\ = 2 𝜀 + ∥ α (2 𝜀) - α (- 2 𝜀) ∥ + 2 𝜀, \end{array}

and since $α$ is continuous, this goes to zero also. □

Another approach I considered for the corner rounding process was to use the curve shortening flow. Geometric flows are an important area of research. Most famously, there is the Ricci flow, which was used to solve the Poincare conjecture. In the curve shortening flow, one considers a family

α_{t} (s)

of arc-length parameterised curves such that

for the curvature

κ_{t}

and normal

N_{t}

. If we think of

t

as a time parameter, it says that the curve is moving fastest where it is most curved, towards the center of the osculating circle. From our knowledge of curves and curvature, we know that this is equal to

We see that this is a heat equation. We know that the heat equation has very good regularity properties. Given a continuous initial condition, which in this case would be a curve

α_{0}

, the solution is smooth in

t

and analytic in

s

. Thus if we begin with a continuous curve, then we obtain a smoothing through this flow. I recommend Andrews et al to students who are intrigued by this.

Our ‘elementary’ corner rounding construction, though it seems as if it is particular to

ℝ^{n}

, can also be performed in a Riemannian manifold.

Proof. As we already indicated, the proof comes down the class of curves used to define the distance function. We defined it with smooth curves. Consider any sequences of smooth paths $α_{n}$ from $p$ to $p^{'}$ and $β_{n}$ from $p^{'}$ to $p^{″}$ . Assume that as $n \to \infty$ their lengths approach the respective distances between the points.

Now concatenate the paths to get a piecewise smooth path with possibly a corner at $p^{'}$ . Apply the corner rounding procedure of Corollary 4.26 to obtain smooth paths from $p$ to $p^{″}$ : let $γ_{n, 𝜀}$ be the $𝜀$ -blending of $α_{n}$ and $β_{n}$ . Because the distance $d^{g} (p, p^{″})$ is infimum of the lengths of paths, it is lower than any limits of this set. Therefore

d^{g} (p, p^{″}) \leq lim_{n \to 0} lim_{𝜀 \to 0} L (γ_{n, 𝜀}) = lim_{n \to 0} (L (α_{n}) + L (β_{n})) = d^{g} (p, p^{'}) + d^{g} (p^{'}, p^{″}) . □

We close this section with a classic result: that geodesics are critical points of the length functional. The proof of this statement is reminiscent of the variational approach in Section 1.6 to show that minimal surfaces have vanishing mean curvature. In that situation, we considered graphs, so we could model a variation of the surface by adding another function. In the present case, we don’t want to make a similar simplification and instead we will work directly with smooth families of paths.

Proof. We compute

\begin{array}{l} \frac{d}{ds} L (α (s, \cdot)) & = \int_{a}^{b} \frac{d}{ds} \sqrt{g (\partial_{t} α, \partial_{t} α)} dt \\ = \int_{a}^{b} \frac{1}{2} g {(\partial_{t} α, \partial_{t} α)}^{- 1 ∕ 2} 2 g (\partial_{t} α, \nabla_{\partial_{s} α} \partial_{t} α) dt \\ = \int_{a}^{b} ∥ \partial_{t} α ∥_{g}^{- 1 ∕ 2} g (\partial_{t} α, \nabla_{\partial_{s} α} \partial_{t} α) dt . \end{array}

When we set $s = 0$ into the above, $∥ \partial_{t} α ∥_{g}^{- 1 ∕ 2} = 1$ because we assumed that $α (0, t)$ was arc-length parameterised. Next we can use Lemma 3.49 to turn the covariant derivative in the $\partial_{s} α$ into on in the $\partial_{t} α$ direction. Additionally,

\frac{d}{dt} g (\partial_{s} α, \partial_{t} α) = g (\nabla_{\partial_{t} α} \partial_{s} α, \partial_{t} α) + g (\partial_{s} α, \nabla_{\partial_{t} α} \partial_{t} α) .

Putting these together gives

\begin{array}{l} {\frac{d}{ds} L (α (s, \cdot)) |}_{s = 0} & = \int_{a}^{b} g (\partial_{t} α, \nabla_{\partial_{t} α} \partial_{s} α) dt \\ = \int_{a}^{b} \frac{d}{dt} g (\partial_{s} α, \partial_{t} α) - g (\partial_{s} α, \nabla_{\partial_{t} α} \partial_{t} α) dt \\ = g (\partial_{s} α, \partial_{t} α) |_{a}^{b} - \int_{a}^{b} \frac{d}{dt} g (\partial_{s} α, \nabla_{\partial_{t} α} \partial_{t} α) dt . \end{array}

The first term vanishes because all the terms have the same endpoint, hence $\partial_{s} α (s, a) = \partial_{s} α (s, b) = 0$ . □

Example 4.29. Consider the case of $𝕊^{2}$ and take points $p, q$ that are not antipodes of one another. There is a unique great circle through these two points, and these points break it into two (arc-length parameterised) geodesic paths, one long and one short. The shorter geodesic is the minimum (we will prove this in the next section) but the longer geodesic is a saddle point in the space of smooth paths between these points. We can see that it is a saddle point by imagining variations. If most of the path is fixed, and we just add a variation in one small area then the geodesic will have the lowest length of this family of paths. On the other hand, consider all the planes that contain $p, q$ . You can do this by rotating a plane on the line through $p, q$ . If you take the family of paths that are the intersection of these planes and the sphere, then the long geodesic is the longest and the short geodesic is the shortest path between $p, q$ in this family.

4.4 Exponential Maps

Although we have given a reasonable definition of the distance on a Riemannian manifold, it is often very difficult in practice to understand this function. We have not given an example of the distance between two points yet because the prospect of searching for the infimum of length over every possible path is daunting. Even in a well-understood space such as the euclidean plane, where would you even begin? While at first glance it seems as if Theorem 4.28 simplifies the search to geodesics, this is only a complete answer if you already know that there exists a length-minimising path between the two points. Again, the example of a punctured plane shows that a length-minimising path may not exist. The example of a punctured sphere and two points either side of the puncture, such that the shorter geodesic is blocked, shows that even though the two points are connected by a geodesic (the longer geodesic), its length is not the distance.

However, what we will show in this chapter is every point has a special ball around it: every point in that ball has a unique geodesic to the center and the length of this geodesic is the distance to the center. The key gadget for this construction is the exponential map, which can be summarised as ‘follow the geodesic out from the center’. The name is due to a relationship with the exponential map in Lie group theory and actual exponentials will not be relevant to us here.

Recall, due to the Rescaling Lemma 4.6 geodesics that only differ by speed have the same image. In the exponential map we have removed this duplication by only considering

t = 1

. This is the only advantage of defining the exponential map over using geodesics directly. In fact the two are equivalent, since from the Rescaling Lemma we have that

The exponential map is a partially defined function; its value only exists if the corresponding geodesic exists at time

t = 1

. Geodesics are defined by an ODE with short-time existence, but we know nothing about their long-time existence. It is natural therefore to ask about

dom {exp}_{p} \subset T_{p} M

. Because of the constant curve

α (t) = p

we know that

0 \in dom {exp}_{p}

. Likewise, short-time existence tells us that

dom {exp}_{p}

contains a neighbourhood of

0

. The final thing we can say is that

dom {exp}_{p}

is star-shaped around

0 \in T_{p} M

: For clarity, write

t = c

in the above equation:

{exp}_{p} (cv) = γ_{p, v} (c)

. If

γ_{p, v} (1)

exists then

γ_{p, v} (c)

exists for

c < 1

because

γ_{p, v}

is maximal. This shows that if

v \in dom {exp}_{p}

then

cv \in dom {exp}_{p}

for

0 \leq c \leq 1

The next thing we have to understand is tricky because it breaks the usual hierarchy of concepts. We need to think about the tangent map

T_{v} {exp}_{p} : T_{v} (dom {exp}_{p}) \to T_{{exp}_{p} (v)} M

. In particular, for

v = 0

it is a map between

T_{p} M

and itself. Recall the definition of a manifold in Chapter 2. A chart

U

is an open subset of

ℝ^{n}

and the tangent space is

T_{p} M = ℝ^{n}

(with an equivalence relation between these

ℝ^{n}

for different charts). Likewise we can think of

dom {exp}_{p} \subset T_{p} M

as an open subset of euclidean space and

T_{v} (dom {exp}_{p})

ℝ^{n}

. Given

w \in T_{p} M

we think of it in

T_{v} (dom {exp}_{p})

as the curve

v + tw

. For any

w \in T_{p} M

we have

Because

T_{0} {exp}_{p}

is an isomorphism, the inverse function theorem says that there is a neighbourhood

V ∋ 0

such that

{exp}_{p} |_{V}

is an diffeomorphism onto its image. We can further restrict

V

so that the image is entirely within the chart

U

⁴. Therefore we have a transition function

{exp}_{p} |_{V} : V \to U

. Because

img {exp}_{p}

is entirely within

U

, it will obey the cocycle conditions with every other transition function of

M

. Because it is defined on all of

V

adding this chart to the atlas does not create any new points of

M

. In the literature this construction is called normal coordinates at

p

. We will call this chart a normal chart at

p

A normal chart has two metrics on it. Because it is a chart of

M

, of course it has the metric

g

. But

V

is also a subset of

T_{p} M

and

T_{p} M

is an inner-product space using the inner product

g |_{p}

. Therefore

V

also has the metric that just uses

\tilde{g} = g |_{p}

at every point. Clearly the two metrics are equal at

p

. The coefficients

{\tilde{g}}_{i j}

of this second metric are constants, and the second metric can be thought of as a euclidean metric. The natural question is therefore how

g

and

\tilde{g}

compare.

A normal chart

V

is centered on

p

, in the sense that

[0_{V}] = p \in M

. The most useful property of a normal chart is that rays

t \mapsto tv

, which are geodesics of

\tilde{g}

, are geodesics of

g

. This is because

{\tilde{g}}_{i j}

is constant, so its geodesics are straight lines in

V

. On the other hand, if we view these rays in the chart

U

, we have

t \mapsto {exp}_{p} (tv) = γ_{p, v} (t)

, which are by definition geodesics of

g

. For this reason we call these rays radial geodesics, without needing to specify which metric. Similarly we define geodesic balls and geodesic spheres centered at

p

to be the sets

By restricting

V

we may assume that it is a geodesic ball. Of course rays and sphere are orthogonal with respect to

\tilde{g}

but remarkably:

Proof. Choose any smooth function $ω : (- 𝜀, 𝜀) \to T_{p} M$ with $∥ ω ∥_{\tilde{g}} = 1$ and consider the family of curves in $V$

α (s, t) = tω (s) .

This family has the property that every main curve $α (s, \cdot)$ is a radial geodesic and that every transverse curve $α (\cdot, t)$ lies in a geodesic sphere. This means that $\partial_{s} α (0, t)$ is a tangent vector to the geodesic sphere $\partial {\tilde{B}}_{t}$ . Conversely, given any tangent vector to a geodesic sphere arises in this way.

The lemma comes down to showing that $\partial_{t} α (0, t)$ and $\partial_{s} α (0, t)$ are orthogonal for all $t$ with respect to $g$ . Because $\partial_{s} α (0, 0) = 0 ω^{'} (0) = 0$ it is enough to prove that $g (\partial_{t} α (0, t), \partial_{s} α (0, t))$ is constant. We compute, for $\nabla$ the Levi-Civita connection of $g$ ,

\begin{array}{l} \frac{\partial}{∂t} g (\partial_{t} α (0, t), \partial_{s} α (0, t)) & = g (\nabla_{\partial_{t} α} \partial_{t} α (0, t), \partial_{s} α (0, t)) + g (\partial_{t} α (0, t), \nabla_{\partial_{t} α} \partial_{s} α (0, t)) \\ = 0 + g (\partial_{t} α (0, t), \nabla_{\partial_{s} α} \partial_{t} α (0, t)) \\ = \frac{\partial}{∂s} \frac{1}{2} g (\partial_{t} α (0, t), \partial_{t} α (0, t)) = 0 . \end{array}

The facts we have used here are that a geodesic parallel transports its tangent vector (Definition 4.1), a geodesic is constant speed (Lemma 4.5), and symmetry of covariant derivatives for families of curves (Lemma 3.49). □

Example 4.32 (Hyperbolic Plane). How does this apply to the hyperbolic plane? The hyperbolic plane is covered by a single chart $U = ℍ^{2}$ . Consider the set of geodesics through any point $p$ . We know that they are defined for all time, positive and negative. This tells us that ${exp}_{p}$ is defined on the whole tangent space. We also know that for any point $q$ there is a geodesic from $p$ to $q$ . This tells us that ${exp}_{p}$ is surjective onto $U$ . Finally, these geodesics only intersect at $p$ . This means that ${exp}_{p}$ is diffeomorphism from $T_{p} M$ to $U$ , and the normal chart at $p$ covers all of $ℍ^{2}$ .

Let us be explicit for the point $p = (0, 1)$ . The geodesics though this point have the form $γ (t) = (0, e^{t})$ or

γ (t) = (a^{2} + 1) (\frac{a}{a^{2} + e^{2 t}} - \frac{a}{a^{2} + 1}, \frac{e^{t}}{a^{2} + e^{2 t}}) = (\frac{a}{a^{2} + e^{2 t}} (1 - e^{2 t}), \frac{a^{2} + 1}{a^{2} + e^{2 t}} e^{t}) .

We should try to combine these into a single formula. In the limit $a \to \pm \infty$ we have $γ (t) \to (0, e^{t})$ . Therefore the problem is that $a$ is not parameterised correctly in some sense. Let $a = cot 𝜃 ∕ 2$ . Then

\begin{array}{l} {sin}^{2} \frac{𝜃}{2} (a^{2} + e^{2 t}) & = {sin}^{2} \frac{𝜃}{2} ({csc}^{2} \frac{𝜃}{2} - 1 + e^{2 t}) = 1 + \frac{1}{2} (1 - cos 𝜃) (- 1 + e^{2 t}) \\ = \frac{1}{2} [(1 + cos 𝜃) + (1 - cos 𝜃) e^{2 t}] \end{array}

and

\begin{array}{l} γ (t) & = \frac{2 {sin}^{2} \frac{𝜃}{2}}{(1 + cos 𝜃) + (1 - cos 𝜃) e^{2 t}} (\frac{cos \frac{𝜃}{2}}{sin \frac{𝜃}{2}} (1 - e^{2 t}), {csc}^{2} \frac{𝜃}{2} e^{t}) \\ = \frac{1}{(1 + cos 𝜃) + (1 - cos 𝜃) e^{2 t}} (sin 𝜃 (1 - e^{2 t}), 2 e^{t}) . \end{array}

Now we see that we have a nice parameterisation $γ_{𝜃} (t)$ of the geodesics through $(0, 1)$ , with $𝜃$ giving the angle of the tangent with $\partial_{2}$ . For example $𝜃 = 0$ gives $γ_{0} (t) = (0, e^{t})$ and $𝜃 = π$ gives $γ_{π} (t) = (0, e^{- t})$ . More generally we see that

γ_{𝜃} (- t) = \frac{1}{(1 + cos 𝜃) e^{2 t} + (1 - cos 𝜃)} (sin 𝜃 (e^{2 t} - 1), 2 e^{t}) = γ_{𝜃 + π} (t) .

In fact, $(t, 𝜃)$ are polar coordinates for the normal chart at $(0, 1)$ .

This gives us a formula for geodesic spheres. In normal coordinates of course, geodesic spheres around $p$ are just $t = r$ . In $(x, y)$ coordinates, they are

α (𝜃) = \frac{1}{(1 + cos 𝜃) + (1 - cos 𝜃) R^{2}} (sin 𝜃 (1 - R^{2}), 2 R),

for $R = e^{r}$ . These are euclidean circles with centers at $(0, cosh r) = (0, 0.5 (R + R^{- 1}))$ and radii $sinh r = 0.5 (R - R^{- 1})$ .

Example 4.33. We can apply the same analysis to the sphere. Consider the set of geodesics through any point $p$ . We know that they are defined for all time, positive and negative. This tells us that ${exp}_{p}$ is defined on the whole tangent space. We also know that for any point $q$ there is a geodesic from $p$ to $q$ . This tells us that ${exp}_{p}$ is surjective onto $U$ . However, this time all geodesics from $p$ intersect at the antipode of $p$ . Therefore ${exp}_{p}$ is not injective. To construct a normal chart we therefore have to restrict to a subset of $T_{p} M$ .

For the south pole, to choose a definite point, we know the geodesics are the lines of longitude. Therefore the normal chart at this point has to have lines of longitude as rays. The normal chart is in fact just a rescaling of $U_{N}$ (stereographic projection) so that the rays are unit speed geodesics.

Proof. We again use a normal chart $V = {\tilde{B}}_{r^{'}}$ at $p$ . Any unit speed radial geodesic has the form $γ (t) = tω$ for some $∥ γ^{'} (0) ∥_{g} = ∥ γ^{'} (0) ∥_{\tilde{g}} = ∥ ω ∥_{\tilde{g}} = 1$ . The point $q = rω$ lies on the geodesic sphere $\partial {\tilde{B}}_{r} \subset {\tilde{B}}_{r^{'}}$ . Since $v$ is tangent vector of $γ$ at $p$ and geodesics are constant speed

∥ γ^{'} ∥_{g} = ∥ γ^{'} (0) ∥_{g} = ∥ γ^{'} (0) ∥_{\tilde{g}} = ∥ ω ∥_{\tilde{g}} = 1 .

Therefore

L (γ) = \int_{0}^{r} ∥ γ^{'} ∥_{g} dt = \int_{0}^{r} 1 dt = r .

Now suppose first that $α$ stays entirely within the geodesic ball ${\tilde{B}}_{r^{'}}$ . This means we can write $α (t) = ρ (t) ω (t)$ , where $ρ : [a, b] \to ℝ_{\geq 0}$ and $ω : [a, b] \to T_{p} M$ with $∥ ω (t) ∥_{\tilde{g}} = 1$ . So notice that $ρ ω^{'}$ is tangent to the geodesic sphere $\partial {\tilde{B}}_{ρ}$ and $ρ^{'} ω$ is tangent to a radial geodesic. Then with the help of Gauss’ Lemma 4.31

\begin{array}{l} ∥ α^{'} (t) ∥_{g}^{2} & = g |_{α (t)} (ρ^{'} ω + ρ ω^{'}, ρ^{'} ω + ρ ω^{'}) \\ = {(ρ^{'})}^{2} g |_{α (t)} (ω, ω) + 2 g |_{α (t)} (ρ^{'} ω, ρ ω^{'}) + ρ^{2} g |_{α (t)} (ω^{'}, ω^{'}) \\ = {(ρ^{'})}^{2} + 0 + ρ^{2} g |_{α (t)} (ω^{'}, ω^{'}) \\ \geq {(ρ^{'})}^{2} . \end{array}

Therefore

L (α) = \int_{a}^{b} ∥ α^{'} (t) ∥_{g} dt \geq \int_{a}^{b} ρ^{'} (t) dt = ρ (b) - ρ (a) = r - 0 = L (γ) .

Finally, if $α$ does not stay entirely within the geodesic ball ${\tilde{B}}_{r^{'}}$ , there there must be some first time $t = \tilde{b}$ that it crosses $\partial {\tilde{B}}_{r}$ . Then

L (α) \geq L (α |_{[a, \tilde{b}]}) \geq L (γ) .

Thus the radial geodesic is a length-minimiser from $p$ to $q$ .

For the converse, observe that it is only possible to have equality if $α$ lies entirely within the geodesic ball and $g |_{α (t)} (ω^{'}, ω^{'}) = 0$ for all $t$ . This implies that $ω^{'} = 0$ and that $α$ is radial. □

Example 4.36 (Hyperbolic Plane). From Example 4.32 we know that for any point the normal chart covers the entire of the hyperbolic plane. Therefore the distance between any two points is given by the length of the geodesic path between them. This was calculated for vertical geodesics in Example 4.16. We can develop this into a general distance formula.

Consider two points $p = (x_{1}, y_{1}), q = (x_{2}, y_{2}) \in ℍ^{2}$ . We will use isometries to bring them both to the $y$ -axis. Firstly we translate so that $p^{'} = (0, y_{1}), q^{'} = (w, y_{2})$ for $w = x_{2} - x_{1}$ . Next, we construct an isometry using the complex number form. Suppose that $f$ transforms $p^{'}$ to $(0, 1) = ι$ and $q^{'}$ to $(0, r) = ιr$ . Then the inverse transform $f^{- 1}$ obeys

\frac{aι + b}{cι + d} = p^{'} = ι y_{1}, \frac{aιr + b}{cιr + d} = q^{'} = w + ι y_{2} .

From the left equation, we see that $a = y_{1} d$ and $b = - y_{1} c$ . Continuing with the right equation, we have

\begin{array}{l} ι y_{1} rd - y_{1} c & = wd - y_{2} rc + ι (y_{2} d + wrc), \\ Re : 0 & = (y_{1} - y_{2} r) c + wd \\ Im : 0 & = wrc + (y_{2} - y_{1} r) d, \\ \Rightarrow 0 & = (y_{2} - y_{1} r) (y_{1} - y_{2} r) - w^{2} r \\ = y_{1} y_{2} - (y_{1}^{2} + y_{2}^{2} + w^{2}) r + y_{1} y_{2} r^{2} \\ b & : = \frac{y_{1}^{2} + y_{2}^{2} + w^{2}}{2 y_{1} y_{2}} \\ r & = b \pm \sqrt{b^{2} - 1} . \end{array}

Thus the distance between $p$ and $q$ is given by $ln | \frac{r}{1} | = ln (b + \sqrt{b^{2} - 1})$ . In the case that $x_{1} = x_{2}$ we see that $b^{2} - 1 = {(y_{2}^{2} - y_{1}^{2})}^{2} ∕ {(2 y_{1} y_{2})}^{2}$ so that the distance formula reduces to

ln \frac{y_{1}^{2} + y_{2}^{2} + y_{2}^{2} - y_{1}^{2}}{2 y_{1} y_{2}} = ln \frac{y_{2}}{y_{1}}

as expected from Exercise 4.16.

One can continue to torture the distance formula until it yields to geometric interpretation:

\begin{array}{l} 4 & y_{1} y_{2} (b + \sqrt{b^{2} - 1}) \\ = 2 y_{1}^{2} + 2 y_{2}^{2} + 2 w^{2} + 2 \sqrt{{(y_{1}^{2} + y_{2}^{2} + w^{2})}^{2} - 4 y_{1}^{2} y_{2}^{2}} \\ = 2 y_{1}^{2} + 2 y_{2}^{2} + 2 w^{2} + 2 \sqrt{(y_{1}^{2} - 2 y_{1} y_{2} + y_{2}^{2} + w^{2}) (y_{1}^{2} + 2 y_{1} y_{2} + y_{2}^{2} + w^{2})} \\ = {(y_{1} - y_{2})}^{2} + {(y_{1} + y_{2})}^{2} + 2 w^{2} + 2 \sqrt{({(y_{1} - y_{2})}^{2} + w^{2}) ({(y_{1} + y_{2})}^{2} + w^{2})} \\ = {(\sqrt{{(y_{1} - y_{2})}^{2} + w^{2}} + \sqrt{{(y_{1} + y_{2})}^{2} + w^{2}})}^{2} . \end{array}

We have the root of the sum of squares, so this must be some euclidean distance in the hyperbolic plane. Indeed, the left square root is the distance between $p$ and $q$ . For the other square root, if we take $p, q$ , and the semicircle they lie on, and reflect those in the $x$ -axis to make a full circle then the second square root is the distance between $p$ and the reflection of $q$ .

There is one more property of normal charts that we will need in the next chapter, namely that the metric in a normal chart has a nice form at

p

. For any chart, because

g |_{p}

is non-degenerate we can always find an orthonormal basis of

T_{p} M

. We can then make a linear change of coordinates such that this basis is a coordinate vector basis near

p

. Doing this will diagonalise the metric, so

g_{i j} (p) = δ_{i j}

. But typically

g_{i j}

are non-constant, so this property only holds at

p

and

∂k g_{i j} (p) \neq 0

. A special feature of normal charts is that the partial derivatives of the metric coefficients are zero, so that this procedure simplifies

g_{i j}

p

up to second order.

Example 4.38 (Hyperbolic Plane). Consider $p = (0, 1) \in ℍ^{2}$ . In the usual coordinates, $g_{i j} = y^{- 2} δ_{i j}$ so $g_{i j} (p) = δ_{i j}$ . But $\partial 2 g_{i j} = - 2 y^{- 3} δ_{i j}$ so $\partial 2 g_{i j} (p) \neq 0$ .

Recall Example 4.32. In that exercise we constructed polar coordinates for a normal chart at $p$ . Namely $(t, 𝜃)$ were related to the usual coordinates by

\begin{array}{l} (x^{1}, x^{2}) = (x, y) & = \frac{1}{(1 + cos 𝜃) + (1 - cos 𝜃) e^{2 t}} (sin 𝜃 (1 - e^{2 t}), 2 e^{t}) \\ = \frac{1}{cosh t - cos 𝜃 sinh t} (- sin 𝜃 sinh t, 1) . \end{array}

Let $(u^{1}, u^{2})$ be (cartesian) normal coordinates at $p$ with $(u^{1}, u^{2}) = (t cos 𝜃, t sin 𝜃)$ . Then we can calculate the metric in these normal coordinates using that change of chart formula from Section 3.1:

\begin{array}{l} {\tilde{g}}_{i j} = \frac{\partial x^{k}}{\partial u^{i}} \frac{\partial x^{l}}{\partial u^{j}} g_{k l} = \frac{\partial x^{k}}{\partial u^{i}} \frac{\partial x^{l}}{\partial u^{j}} y^{- 2} δ_{k l} = (\frac{\partial x^{1}}{\partial u^{i}} \frac{\partial x^{1}}{\partial u^{j}} + \frac{\partial x^{2}}{\partial u^{i}} \frac{\partial x^{2}}{\partial u^{j}}) y^{- 2} \end{array}

What we want to see is that ${\tilde{g}}_{i j} (p) = δ_{i j}$ and $∂k {\tilde{g}}_{i j} (p) = 0$ . The direct approach, while elementary, is ugly. To make our point it is sufficient to give Taylor series for the coefficients of the metric up to first order, which requires in turn the Taylor series of the change of coordinates up to second order. The key function to understand is the denominator

d : = cosh t - cos 𝜃 sinh t = cosh t - t cos 𝜃 \frac{sinh t}{t} .

Both $cosh t$ and $\frac{sinh t}{t}$ are even analytic functions of $t$ . Therefore inserting $t = \sqrt{{(u^{1})}^{2} + {(u^{2})}^{2}}$ gives

\begin{array}{l} d & = (1 + \frac{1}{2!} t^{2} + \dots) - u^{1} (1 + \frac{1}{3!} t^{2} + \dots) \\ = (1 + \frac{1}{2!} ({(u^{1})}^{2} + {(u^{2})}^{2}) + \dots) - u^{1} (1 + \frac{1}{3!} ({(u^{1})}^{2} + {(u^{2})}^{2}) + \dots) \\ = 1 - u^{1} + \frac{1}{2} {(u^{1})}^{2} + \frac{1}{2} {(u^{2})}^{2} + \dots, \end{array}

an analytic function of the normal coordinates. As $y = d^{- 1}$ , we can easily write down the Taylor series for $y^{- 2}$

y^{- 2} = 1 - 2 u^{1} + \dots .

For $y$ itself, clearly $y (p) = 1$ and

\begin{array}{l} \frac{∂y}{\partial u^{1}} & = - \frac{1}{d^{2}} \frac{∂d}{\partial u^{1}} & \frac{∂y}{\partial u^{1}} (p) & = 1 \\ \frac{∂y}{\partial u^{2}} & = - \frac{1}{d^{2}} \frac{∂d}{\partial u^{2}} & \frac{∂y}{\partial u^{2}} (p) & = 0 \\ \frac{\partial^{2} y}{\partial u^{1} \partial u^{1}} & = \frac{2}{d^{3}} {(\frac{∂d}{\partial u^{1}})}^{2} - \frac{1}{d^{2}} \frac{\partial^{2} d}{\partial u^{1} \partial u^{1}} & \frac{\partial^{2} y}{\partial u^{1} \partial u^{1}} (p) & = 1 \\ \frac{\partial^{2} y}{\partial u^{1} \partial u^{2}} & = \frac{2}{d^{3}} \frac{∂d}{\partial u^{1}} \frac{∂d}{\partial u^{2}} - \frac{1}{d^{2}} \frac{\partial^{2} d}{\partial u^{1} \partial u^{2}} & \frac{\partial^{2} y}{\partial u^{1} \partial u^{2}} (p) & = 0 \\ \frac{\partial^{2} y}{\partial u^{2} \partial u^{2}} & = \frac{2}{d^{3}} {(\frac{∂d}{\partial u^{2}})}^{2} - \frac{1}{d^{2}} \frac{\partial^{2} d}{\partial u^{2} \partial u^{2}} & \frac{\partial^{2} y}{\partial u^{2} \partial u^{2}} (p) & = - 1 . \end{array}

So the Taylor series of $y$ up to second order is

y = 1 + u^{1} + \frac{1}{2} {(u^{1})}^{2} - \frac{1}{2} {(u^{2})}^{2} .

Next $x = - d^{- 1} sin 𝜃 sinh t = - y u^{2} \frac{sinh t}{t}$ so up to second order

\begin{array}{l} x = - (1 + u^{1} + \dots) u^{2} (1 + \dots) = - u^{2} - u^{1} u^{2} . \end{array}

Finally we have up to first order

\begin{array}{l} {\tilde{g}}_{1 1} & = ({(\frac{∂x}{\partial u^{1}})}^{2} + {(\frac{∂y}{\partial u^{1}})}^{2}) y^{- 2} = ({(- u^{2})}^{2} + {(1 + u^{1})}^{2}) (1 - 2 u^{1}) \\ = (1 + 2 u^{1}) (1 - 2 u^{1}) = 1 \\ {\tilde{g}}_{1 2} & = (\frac{∂x}{\partial u^{1}} \frac{∂x}{\partial u^{2}} + \frac{∂y}{\partial u^{1}} \frac{∂y}{\partial u^{2}}) y^{- 2} = ((- u^{2}) (- 1 - u^{1}) + (1 + u^{1}) (- u^{2})) (1 - 2 u^{1}) \\ = (u^{2} - u^{2}) (1 - 2 u^{1}) = 0 \\ {\tilde{g}}_{2 2} & = ({(\frac{∂x}{\partial u^{2}})}^{2} + {(\frac{∂y}{\partial u^{2}})}^{2}) y^{- 2} = ({(- 1 - u^{1})}^{2} + {(- u^{2})}^{2}) (1 - 2 u^{1}) \\ = (1 + 2 u^{1}) (1 - 2 u^{1}) = 1 . \end{array}

We will finish this chapter by indicating various directions in which one could continue. One can investigate further with Riemannian manifolds as metric spaces. We have encountered several times the example of the punctured plane and how it ‘blocks’ geodesics. The Hopf-Rinow theorem states that a connected Riemannian manifold is complete as a metric space if and only if every geodesic exists for all time. One can also develop the theory of the exponential map further. We have mentioned its connection to the exponential map in Lie group theory, so this could be expounded. We can also ask at every

p

what is the largest geodesic ball on which

{exp}_{p}

is injective, called the injectivity radius. Relatedly, we have the cut locus at

p

which asks when geodesics from

p

stop being the length-minimisers. We can also consider families of geodesics, using the Jacobi field, of which the radial geodesics are one example. Another example would be geodesics beginning on some hypersurface. A typical question is to ask at what rate these geodesics are moving apart.

Naturally one can look for special manifolds. We found the geodesics of the hyperbolic plane using isometries. As mentioned, spaces whose isometries are transitive are called homogeneous, and one whose isometries are transitive on the unit sphere of

T_{p} M

are called isotropic at

p

. Spaces such that for every point there is an isometry that acts as

- id

T_{p} M

are called symmetric spaces, and there is a complete classification.

These are all relatively ‘pure’ Riemannian geometry questions, in that they try to understand the intrinsic structure of a Riemannian manifold. But we can also take Riemannian manifolds as the setting to investigate all types of geometric problems. Geometric flows are one example. The final direction we will mention however is harmonic maps. Both minimal surfaces and geodesics are extremal for a functional, surface area and length respectively. If we model both of these problems as embedding stretched rubber objects and seeing the ‘minimal tension’ configuration, then this motivates the definition of a harmonic map. The name is due to the defining equation being a generalisation of the Laplacian to the context of Riemannian manifolds (compare the the Laplacian in Equation (1.41)). My PhD work was on harmonic maps from a torus into

𝕊^{3}

¹In the proof of Lee Lemma 5.8 any $c \in ℝ$ is allowed but certain places in the proof use $1 ∕ c$ .

²It is unknown whether it can be isometrically immersed into $ℝ^{4}$ , but it can be into $ℝ^{5}$

³Here I am using the translation of Richard Fitzpatrick, based on the authoritative Greek edition of J L Heiberg. A beautiful version designed for teaching can be found here, but it changes the numbering.

⁴It is not necessary to do this step, but it makes things simpler.

Chapter 4Geodesics

4.1 Straight Lines

4.2 The Hyperbolic Plane

4.3 Length and Distance

4.4 Exponential Maps

Chapter 4
Geodesics