In this chapter we give the ‘patchwork’ definition of manifolds. Manifolds are geometrically nice spaces and a natural generalisation of . The most common way to define a manifold is as a special type of topological space, namely a ‘second-countable Hausdorff locally-euclidean topological space with an atlas’1. Because students usually encounter differential geometry before abstract topology, the lecturer then gives a speed run of all the definitions in topology. I think this approach is better suited to the second time you encounter manifolds. Then you already know a little bit about what makes manifolds nice, and you can appreciate the interesting but weird topological spaces that need to be excluded from the definition. The standard approach in effect defines a manifold by saying what a manifold isn’t. In the approach below we avoid defining general topological spaces and instead use concrete gluing construction of open sets of . After this construction we will still need to impose certain conditions, so topology cannot be avoided completely, but hopefully they are suitable for a new-commer to manifolds.
Before we dive into theory, we define a concept that you probably know but have never had a word for. A partial function from to is a function from a subset to . In the context of partial functions, a function with is called a total function. Many common functions are really partial functions: and are partial functions from to , with being respectively and . There doesn’t seem to be good standard terminology to talk about and , though is often called the natural domain. Let’s call the source of the function and the domain, with the symbols and . You are no doubt familiar with the difference between the codomain (also called the target) and the image (also called the range). We will use for partial functions (harpoon arrow), in contrast to for total functions.
Many students in Analysis I are confused about the relationship between injective and surjective, and those students are correct to be confused. Just as surjective means that the image is equal to the codomain, total means that the domain is equal to the source; they are the true counterparts to one another. In fact a partial function that is injective has an inverse partial function. If is injective then
is a perfectly valid definition of a partial function with and .
Besides inverses, many of the usual definitions for functions carry over with sensible modifications. For example, partial functions from to are equal if they have the same domains and are equal on all inputs. Likewise the composition of two partial functions and is a partial function , but the domain of will be smaller than of the domain of if the image of lies partly outside the domain of . We should also think about the ‘empty’ partial function. According to the set theory definition of a function, there is exactly one function from the empty set to any other set. This is usually considered a quirk in the definition. But the composition of partial functions where results in the empty function, so we cannot ignore it.
Remark 2.1. In this course, functions and partial functions will always have open sets as their sources and their domains will be open subsets, unless specifically noted otherwise. This is needed so that derivatives can be defined.
Fix a dimension . To keep track of the pieces we will glue together, let us introduce an index set . In our examples this will usually be a finite set, but we do not make this assumption generally. For every let be an open subset of . is called a chart. These are the pieces we will glue together.
For two charts, we describe how to glue them together using a partial function. A gluing function is a partial function that is a homeomorphism from its domain to its image. This means it is a homeomorphism (bijective, continuous, continuous inverse) between open subsets and . The idea is that the point is glued to . Note the order of the subscripts: “from to ”. The points of are not glued to any points of . Other names for gluing functions include ‘transition functions’, ‘change of coordinate functions’, and ‘overlap functions’. We allow here the possibility that and is the empty function, this represents the situation that and are not glued together at all. To avoid having too many named sets, we will mostly use instead of .
This information tells how to glue the pieces together, but how should we represent the completed glued object? First we define the disjoint union
which is a set of pairs. We think of this as saying that even if a point is common to both and , in the disjoint union we consider it as two separate points. For example, if and then the normal union is but the disjoint union is two intervals. We often do not write the index if it is clear, and even when it is not clear we tend to write it as a subscript. Continuing the example, has two points that might both be called , namely and . Formally these points should be written and respectively.
We want to create an equivalence relation on the disjoint union of all the charts such that iff . If this is to be an equivalence relation, the set of gluing functions is required to have certain properties. To get reflexivity of , we need . Symmetry of the relation holds if and only if . These are simple enough, but expressing the condition for transitivity is more difficult. The usual way to express the transitivity condition is that , but if we have symmetry then this is equivalent to . That means for all we need and . In the language of partial functions
If a set of gluing functions have these three properties, and thus defines an equivalence relation, we say that fulfill the cocycle conditions.
Definition 2.2. An atlas is a tuple , where fulfills the cocycle conditions. We have included in our definition of atlas to make all the charts have the same dimension (some authors allow spaces with a mix of dimensions). We call the glued space.2 Each ‘point’ in is an equivalence class of points in different .
Example 2.3 (Euclidean Space). Take any open subset . We can construct the trivial atlas for as follows. Let the index set and the only chart. Then is a gluing function. The cocycle condition is fulfilled, so is an atlas. The corresponding equivalence relation is the weakest equivalence relation on , namely is equivalent to itself but no other points. Therefore we say .
Example 2.4 (Polar Coordinates). Next we consider an example with two charts: . Consider the plane and the half-strip . To give an atlas it is sufficient to describe , because the cocycle condition requires and , as well as . Set
This is a homeomorphism from to . Therefore is an atlas. This example shows why is sometimes called a change of coordinates function.
Every point of is glued to some point of , but not every point of is glued to some point of . This leads to the points of the glued space are of two types: either they are an equivalence class with two points for and , or they are an equivalence class with a single point for with . Every equivalence class in contains an element of , so we might say casually .
Example 2.5 (Glued Circle). Consider , and . As in the previous example, it is sufficient to give . Set and and give the gluing function by the formula
For example, this tells us that we should glue to and to . Here the glued space is a circle, which you can see by cutting two strips of paper of the same length, drawing a number line from to on each of them, and then gluing as instructed. Every point of is equivalent to a point of or to .
Example 2.6. Consider everything the same as in Example 2.5, but this time give the formula
The space is a called an interval with two origins. This is because every point of is either for , or , or .
The interval with two origins might seem like a harmless curiosity, but in fact it is a weird topological space that we want to avoid. Let us say that a sequence in converges to a point if the sequence converges to in the chart . In Example 2.6 consider the sequence for . We can view this sequence in or . In it has the limit , but in is has the limit . Therefore in this sequence has two different limits!
There are other ways that this space is badly behaved. It is not a metric space, because the distance between the two origins is zero. So although every point has a neighbourhood that is homeomorphic to an open subset of euclidean space (locally euclidean), it is not really like euclidean space. We therefore want to exclude such pathological spaces. It turns out that the above example is the only way a gluing can cause non-unique limits.
Lemma 2.7 (Non-unique Limits). A glued space has non-unique limits if and only if there is a sequence such that converges to a point and converges to a point , for some charts .
Proof. Suppose first we have a sequence such that converges to a point and converges to a point . By definition, and are both limits of the sequence . But is outside the domain of the gluing function, so by definition is it not glued to any point of . Therefore and are distinct points in .
Conversely suppose a glued space has a sequence with two distinct limits. In euclidean space limits are unique, so if there are two limits then they must come from two charts and . Let’s use the notation and with . Suppose were in the domain of the gluing function. The gluing function by definition is a continuous function, so we would have . But means exactly that and this contradicts our assumption that the limits are distinct in . Therefore is not in the domain of , but it is the limit of a sequence in the domain. We have shown that . □
Definition 2.8. We say that an atlas has the unique limit property if it does not not satisfy the condition of Lemma 2.7. That is, if for every and every sequence that converges to some , the sequence does not converge in .
Observe why this is not an issue with Example 2.5. Consider the sequence in . It has the limit . On the other hand, the sequence converges to in , but is not in . Therefore this ‘other’ limit point is not in the space .
The other way that gluing can produce a topologically bad space is if we glue too many charts together. We will not provide an example of this; interested students may search for the ‘ long line’ or the ‘long ray’, which are standard examples of this phenomenon.
Definition 2.9. A manifold is an atlas with the unique limit property and such that the index set is countable.
Exercise 2.10. For students who know topology: show that this is equivalent to the standard definition of a manifold.
It is very common to talk about the glued space as the manifold without explicitly stating the atlas. This is similar to talking about a vector space as the set of vectors, when in fact it is the operations of addition and scalar multiplication that make a vector space interesting.
There are different sorts of manifolds, based on additional conditions on the gluing functions. We will use the notation that a function is when it is -times continuously differentiable. By convention, means that the function is continuous, and means that the function is smooth. An atlas (or a manifold) is called when all of the gluing functions are . Probably the most common type of manifold that is studied, and the one we will study in this course, are smooth manifolds. Henceforth, when we say manifold we mean smooth manifold.
Because we have given a non-standard definition of manifolds, we should explain how this compares to the standard definition. We do this using the example of a sphere and stereographic projection, which seems to be the first non-trivial example in every book on manifolds.
Example 2.11 (Stereographic Projection). The circle is commonly defined as the set , and we name the north pole and the south pole . If one draws chooses a point other than the north pole, and considers the line through and , there is a unique intersection point on the -axis. This is a bijective function called stereographic projection. A nice geometry exercise using similar triangles gives the formula
Likewise stereographic projection from the south pole is
Notice that these constructions are ill-defined when applied to the pole itself, because a single point does not determine a line, so it is naturally a partial function on the circle.
The inverses of the these functions,
and
are regular parametrisations of (parts of) the circle in the sense of Section 1.1. So given a point on the sphere, we can apply to get a point in , called its coordinate with respect to , and putting the coordinate into the parameterisation gives back the point . If we know the coordinate with respect to then is the coordinate with respect to . For this reason is the change of coordinates function. In this example, one can calculate
Notice that it is not possible to cover the circle by a single regular parameterisation (parameterisations must be injective). Thus to deal with spaces generally we must consider multiple overlapping parameterisations and change of coordinate functions.
In this example we have presented the standard approach to manifolds through coordinate functions and parameterisations. Let us connect it now with our definition. The index set is . The image of the coordinate functions are the charts, and . And the gluing function from to is . This data forms an atlas.
But what does this atlas have to do with the circle? We indeed see that the equivalence classes of points are , or for and . That is, there are in one-to-one correspondence with the points of . The glued space is a space that has all the same points as and the same topology, but it is not a subset of a euclidean space. Whatever we can describe on must be intrinsic to the circle, independent of its relationship to .
Exercise 2.12 (Stereographic Projection). Repeat the above construction for the n-sphere . We choose3 our coordinates on such that the north pole is and the south pole is . Show that stereographic projection have the formulas
and inverse stereographic projection is
Finally, show that the transition function is
In the standard definition of a manifold you begin with and coordinate functions . The procedure in Example 2.11 to go from this data to our definition of a manifold is fully general: define the charts to be and the gluing functions as .
Conversely, if you begin with a manifold in our sense then is the glued space . There is of course the canonical projection that sends every element to its equivalence class, but this is too coarse. We call the restriction a parameterisation of . It sends a point of to its equivalence class in . We really should think of this as a parameterisation because an ordinary set in euclidean space is describing part of a complicated object . In the other direction is the coordinate function. It sends an equivalence class to its representative that lies in . We usually write for the parameterisation rather than , as it is unnecessary to have two symbols. Thus the two approaches to manifolds give equivalent information.
Let us summarise our terminology and the relations between the objects. Charts are open sets of . An atlas is set of charts and gluing functions with the cocycle property. A gluing function can also be called a change of coordinate function or a transition function. The information of an atlas allows us to glue the charts together to get a manifold . Functions from the charts to the manifold are called parametrisations and functions from the manifold to the charts are coordinates. The composition of a parameterisation and a coordinate is a transition function. One of the drawbacks of differential geometry being an old and widely practised field is that notation and terminology has been around for a long time and is not completely standardised. Different authors use the word chart to describe , or .
Next we want to define functions between manifolds. On one hand, there is nothing to do. If we have manifolds and , they are sets, and a function is defined in the normal way. But manifolds are more than sets of points, they have atlases. Let
be atlases for and respectively. Then we can look at ‘in charts’. This means we look at the partial functions
These are functions between euclidean spaces, so we can ask whether they are .
Definition 2.13. A function is at a point if there is a chart and chart such that is , and it obeys a further technical condition.4 We say that is on if it is at every point of .
The definition of uses a chart. But we know that a point may belong to more than one chart. This opens the possibility that is at according to one chart, but not according to another chart. However, because of the relation
the definition does not depend on the which charts are used.
Example 2.14 (Euclidean Space). We have already seen that open subsets of euclidean space are manifolds in a particularly simple way, namely the coordinates and parametrisations are the identity function. Therefore a function in charts is the same thing as a function. This shows that a function between euclidean spaces is according to the manifold definition if and only if it is according to the ordinary definition.
Example 2.15 (Stereographic Projection). Consider the circle and the function given by . This is the height function. We can look at this function in charts
and likewise
It may help to understand this if we calculate it for a few points. Consider the south pole , which has a height . In -coordinates the south pole is and as expected. Now consider the point . Clearly is has . In -coordinates it is and in -coordinates it is . So then we compute
In conclusion, in charts is nothing other than a manipulation of the formula for to use coordinates; it gives the same result.
Perhaps not unexpectedly, this is a smooth function in the sense of manifolds because both and are smooth functions in the usual sense.
Example 2.16 (Glued Circle). We can give a function that embeds into euclidean space . It is easier to write the formulas in charts. We define
We can see that this is well-defined in the following way. Take a point . Then
What we have shown is that if then . This means that it doesn’t matter which chart you use, the result is the same. In other words we have defined a function that doesn’t depend on charts, is defined on .
This example shows us how to embed Example 2.5 into to get what we would normally think of as the circle. But please keep in mind that manifolds are defined as the gluing of charts; they are defined as an abstract space that does not need live in a bigger space. There are many ways to embed the circle into euclidean space. Even when we define a manifold starting with a subset of euclidean space, we leave the embedding behind.
Our definition of manifolds makes it easy to define (tangent) vectors and vector fields. At any point we have the tangent vectors and a vector field on is a function . To make this into a definition on a manifold, however, we need a way to make this independent of the chart. Alternatively, we need a way to compare vectors that are defined using different charts. There are essentially two equivalent ways to do this: using curves and using directional derivatives. We first give an example using curves as motivation.
Example 2.17 (Polar Coordinates). Consider the plane with cartesian coordinates , but also polar coordinates , as in Example 2.4. The transition function from polar to cartesian is . We consider vectors at the point with , which is equivalent to .
Let , a curve in with . In the chart it is a horizontal line with tangent vector . But we can also view this curve in the chart. It has the formula
At it has the tangent vector .
But these two charts represent the same points of manifold; the curves and have the same points under the gluing equivalence relationship. Likewise we should think of the two tangent vectors, at in the chart and at in the chart, as equivalent.
Both methods have the same setup. Let be two charts and the transition (we leave off the subscripts for this explanation to simplify notation). Let be a point in , , and be a vector on and be a vector on .
Curve method: Consider the curve . This curve has and . Using the transition, we also have a curve in with . The idea is that in the first chart is transformed to in the second chart. Using the chain rule
where is the matrix of partial derivatives of , also called the Jacobian matrix, evaluated at the point .
Directional derivative method: Take any function . We can use the transition function to write as a function on , at least near , namely . We compute the derivative in the direction of at the point and apply the chain rule
We see that the derivative of at in the direction of is equal to the derivative of at in the direction . From both of these methods we get the same answer:
Definition 2.18. The vector at is equivalent to the vector at when . This formula is called the change of coordinates for vectors. We denote the set of equivalence classes of vectors at a point by , called the tangent space to at .
Because of the second method, it is common to write a tangent vector as . This notation has two advantages. First that the change of coordinates is built into the notation via the chain rule, as above. And second, if we leave out the point at which we should evaluate the derivative then is a vector field. In fact it gives a basis of the vector fields. Every vector field on can be written as a function
for functions . Because vector fields thought of this way can be evaluated at a point to give a tangent vector as well as act on a function, the notation is potentially ambiguous. We will use for evaluation and for action on a function.
Example 2.19 (Stereographic Projection). Let’s see how the coordinate vector fields transform between the two charts of stereographic projection on . In particular has the Jacobian matrix
Therefore
We should note that this equality, which is really equivalence of vectors as per Definition 2.18, holds on the overlap between the two charts, namely away from the north and south poles. In particular, the vector is not defined at the north pole, this expression only has meaning on the chart . If we were to define a vector field by the formula
then this is a well-defined vector field on all of , because it gives a vector at every point and on overlaps the two cases give equivalent vectors. Observe that is the zero vector.
On the other hand, if we consider the vector field on then it corresponds to the vector field
on . This has no continuous extension to all of .
The main lesson of this example is that what looks like the ‘same vector’ at two different points in one chart look like completely different vectors in another chart. From the above example, and look the same in the chart, but in the chart these two vectors are
respectively. This is because the Jacobian has a dependence on , on the point in the manifold. Hence the equivalence relation is different at different points. The consequence is that there is no easy way to identity tangent vectors at different points of a manifold. We will examine how this problem can be overcome in the next chapter, which concerns ‘connections’ between different points of a manifold.
Example 2.20 (Stereographic Projection). Let us continue Example 2.19 to see that equivalent vectors give the same directional derivatives. Consider how the vector field
acts on the height function from Example 2.15. In the chart we have
Likewise in the chart we have
So in the chart for the point we have that the function is changing in the direction by . And in the chart for the corresponding point we have that the function is changing in the direction by . The directional derivatives in the two charts agree. This is a demonstration that everything is all well-defined.
Example 2.21 (Glued Circle). Because the Jacobian of the transition function is just the constant , the following is a well-defined vector field:
We can act this vector field on the function defined in charts as
which is just the first component of the function from Example 2.16. We get
This is a well-defined function on , which we know because it is multiplied by the second component from Example 2.16.
These two examples illustrate that a vector field applied in a chart to a function is another function . The fact that we used the chain rule to define equivalence of vectors is precisely the condition to ensure that the resulting functions in charts piece together to give a well-defined function on the whole manifold.
If we have a function between two manifolds , the tangent map or pushforward of at is a function between the tangent spaces . We will define it in two ways. The easiest definition is using the curve method of vectors. If we have a vector then in a chart the curve is a representative of this vector. Observe that is a curve in . We define to be the tangent vector of the curve at .
We also give a practical formula for calculating the tangent map. The curve must lie in some chart of . In charts we have and . So the tangent vector of the curve is
using the chain rule and the fact that , . Thus we see in charts that the tangent map is the Jacobian of .
We could have also given a definition of the pushforward based on the directional derivative idea. If is the pushforward of a vector , then we can ask how acts on a function . The observation is that and
Example 2.22 (Stereographic Projection). Consider the inclusion map . As a formula it is just identity, but we consider it as a function between manifolds. We can ask how the vector , a coordinate vector of the chart, is pushed forward by this map to a vector in , since is mapped to . Let us calculate the tangent map in the chart . As per above, this is Jacobian of :
and at we have
In coordinates, the vector is just , so its pushforward in coordinates is
Note that this vector is tangent to in the usual sense.
An important observation should be made about the previous example. We were careful to stress that the tangent map was applied to a vector at a particular point and not a vector field. That is because the tangent map does not, as students sometimes assume, take vector fields to vector fields. There are two ways it fails to do this. We could easily generalise this example to at any point of , which would then give a vector on . But not every point of gets a vector. Only the points in the image of get a vector. Therefore the result is not a vector field on .
The other way that the tangent map can fail to transform a vector field into another vector field, which does not apply to the above example, is if the function is not injective. In that case some points in the image get multiple vectors from the tangent map. Since a vector field is a certain type of function, and functions must have exactly one output, the pushforward of a vector field may not be a function.
Exercise 2.23 (Stereographic Projection). Verify the following formula for the tangent map of in the chart.
We end this section by highlighting a distinction that we have elided until now. A tangent vector is an intrinsic concept on a manifold, it does not require the manifold to be immersed in euclidean space. And yet in the previous chapter we repeatedly talked about tangent vectors to surfaces as a vector in that was tangent in the geometrical sense. The connections between these two ideas is exactly the idea of immersion and pushforward.
We commented on this at the end of Example 2.22. Another possible example would be Example 2.16. We saw that a manifold can be mapped into a euclidean space by a function. The pushforward of its tangent vectors by this function are exactly the tangent vectors in the intuitive picture. Thus for an immersion, it is possible to consider the elements of as certain elements of , and this is unambiguous because by definition is injective.
This section will not be used later in the script. It exists just to flex how natural the construction of the tangent bundle is in our approach to manifolds.
Above we have defined the tangent space to a manifold at a point and discussed how the vectors of and should be thought of as distinct, even if and are in the same chart. But we also worked with examples of vector fields, whose values are vectors at different points. We can reconcile this tension by putting all the vectors of a manifold together into a new manifold:
Definition 2.25. Let be a manifold with atlas . We construct a new atlas , where and
The corresponding manifold is called the tangent bundle . There is a function in charts by
We tend to think of the tangent bundle of as all the vectors of , with the understanding that vectors at different points are distinct from one another. The function is called the canonical projection of the bundle. Intuitively it takes a tangent vector to its base point. Hence are all the vectors which live at the same point, in other words .
The tangent bundle allows us to speak and reason formally about tangent vectors as a whole. We can define a vector field on as a function from to with the property that . This property says that for every point the vector must have the base point , which is exactly what a vector field is.
Exercise 2.26. Check that the tangent bundle is a manifold. In particular you must check the unique limit condition.
Example 2.27 (Euclidean Space). Since the trivial atlas for an open subset has only one chart , so too does its tangent bundle . Therefore the tangent bundle is just the cartesian product. The only difference between and usual way we think about vectors in euclidean space is that the base point of the vector is important for the tangent bundle.
Example 2.28 (Glued Circle). We saw in Example 2.21 that there was a vector field on that was never zero. Because is one dimensional, every vector in it must be a scalar multiple of . This means we can define a function from to
Allowing to change gives us a function from to . Thus the tangent bundle of the circle is also a product. Tangent bundles that a products are called trivial.
Example 2.29. In this example we want to show a non-trivial tangent bundle, but it’s actually somewhat difficult to prove non-triviality. Instead we will try to convey the idea. The tangent bundle of is non-trivial. If it were trivial, that would mean there would exist a smooth bijective correspondence between and . We could use this correspondence to write any vector field on as with . Conversely there would exist a vector field with the form . This vector field is never zero.
But it is a theorem that every vector field on the sphere has at least one zero. This is called the hairy ball theorem or the hedgehog theorem. There are many intuitive interpretations of this theorem; one is that there must be somewhere on earth where the wind is not blowing. Try imagining different wind maps, and perhaps you will convince yourself of the truth of this theorem. This is one reason that the tangent bundle of cannot be trivial.
The tangent map takes its nicest form when expressed with the tangent bundle. We can collect together all the tangent maps into a single map . Using the formula in charts in terms of the Jacobian we get
This formula shows that the tangent map is the generalisation of the Jacobian, and all local properties of the Jacobian carry over to the tangent map.
Exercise 2.30. Prove that the above formula for the tangent map is well-defined (in different charts) by using the chain rule for Jacobians: for and
Exercise 2.31 (Glued Circle). Calculate the tangent map for from Example 2.16. Use this to give an alternative description of the tangent bundle by locating it as a set inside .
As you have seen, when working with vectors in charts, there are many summations, but all of them are just from to . There is a convention, called the Einstein summation convention or the Einstein rule, that allows us to omit all these sigmas.
Definition 2.32 (Einstein Summation). We apply the following notational rule: when an index occurs twice in a term, once as an upper index and once as a lower index, we sum that index over its range. For the purposes of this rule counts as a lower index.
In this notation, the chain rule reads
A vector field that has coefficients with respect to a chart can neatly be written as , and its expression with respect to a second chart might be written using the equivalences of vectors as
This convention is useful for even ordinary linear algebra. If you have a matrix we would normally write the entries . However if we instead write it with a mix of upper and lower indices then we can write matrix multiplication in the following way
Likewise matrix multiplication would be . The saying to turn matrix algebra into index algebra with summation convention is “Upper indices go up to down; lower indices go left to right”.
We know that bilinear functions can also be represented as a matrix; in usual matrix notation:
In summation convention we write this as , with the vector components having upper indices, which forces to have two lower indices. This is actually an advantage of index notation over matrix notation. Though linear transforms and bilinear transforms can both be represented as a matrix, those matrix representations behave differently under change of basis. We can see that they are different in index notation, but not in matrix notation.
Some authors do not have the condition that you need one upper and one lower index, and allow summation over any repeated index. I do not like this, because it makes it impossible to talk about the diagonal elements . If you need to sum over two indices on the same level, you can of course use a summation sign. Alternatively you can use an identity matrix, whose entries are usually called . For example
Finally, some useful rules of calculation are
We have seen examples of how vectors are a type of directional derivative of functions and that a vector field applied to a function gives another function. A natural question is to ask whether we can also differentiate a vector field using a vector field. The answer is yes: in fact vector fields can differentiate many objects on manifolds using a process called the Lie derivative , but the process complicated and we won’t explain it here. We show with the following example why the ‘obvious’ way to differentiate vector fields doesn’t work.
Example 2.33 (Polar Coordinates). You might guess that you can differentiate a vector field just by differentiating its coefficient functions:
but this doesn’t produce a well-defined vector field, because in different charts it produces inequivalent vectors.
Let us give an example. Consider the vector fields
on . For example, if we act on with this definition, in we have
But in , polar coordinates, we have
So the same calculation in polar coordinates gives
Hence this operation depends on which coordinates one uses. It is not well-defined on the manifold.
It turns out the correct way to differentiate vector fields is to take the commutator of the above guess.
Definition 2.34. The Lie derivative of a vector field with respect to a vector field , also called the Lie bracket, acts on a function by
From this definition we can ask how to write it in charts:
So even though the definition of appears to have second derivatives of , these cancel out and what remains is indeed a (first order) vector field acting on .
Example 2.35 (Polar Coordinates). Let us take the vector fields from the previous example and compute their Lie bracket. In the chart
and in the chart
Whenever we have an expression in charts, it is good to check that in different charts it produces equivalent vectors. We do so in the following lemma. This is not strictly necessary, because the definition doesn’t use charts, but good practice none-the-less.
Lemma 2.36. The above chart expression gives a vector field that is independent of the choice of chart.
Proof. If and are the expressions of two vector fields in the -chart, then the expression of these fields in the -chart is
In this chart, the th component of is
In total then, in the vector field in the -chart is
But we see that this is equivalent to the expression in the -chart. Therefore the definition produces a well-defined vector field. □
There are of course many things that can be said about the Lie bracket. The first observation is that it is -bilinear and antisymmetric: and . If you have two coordinate vector fields, then their Lie bracket is zero. Thus one interpretation of the Lie bracket is that it is a measurement of how far two vector fields are from being coordinate vector fields. The final property that we will give is a product rule: if is a function, then
Example 2.37 (Euclidean Space). We give some euclidean examples. Consider the plane . Let and . Then plugging in the definitions
Next consider . Then
Finally, set . then
1Sources differ: some use second-countable, others use the ‘Lindelöff’ property, which is equivalent in this context, and others use the more general ‘paracompact’
2If you are familiar with topology, the quotient topology makes a locally euclidean topological space.
3Another common convention is to zero-index the components the have the in the zeroth component
4It is necessary to require that has an open neighbourhood such that . If you omit this condition, it is possible to make an example where is continuous in every chart but is not continuous on as a whole. See Lee “Introduction to Smooth Manifolds” Problem 2-1 for an example.