What is a symplectic manifold, really?

I’m teaching a graduate course in symplectic geometry and GIT this semester, and am going to try to produce some posts related to lectures I’m giving there. Hopefully, this will help me think things through and put some new exposition out there on the internet.

So, obviously, the first question is “what is a symplectic manifold?” Now, wikipedia will tell you it’s a manifold equipped with a non-degenerate closed 2-form. Certainly that’s right, but it doesn’t tell a novice in symplectic geometry much. Why think about such a structure?

So let me try to put a different spin on this. This isn’t all that new of a spin (in fact, Henry Cohn wrote almost exactly the same thing here), but I don’t know of anywhere symplectic manifolds are really presented like this: I want to think of a symplectic manifold as a space where one can do a particular flavor of classical mechanics.  I’m going to define a mathematical object called a phase space. This is supposed to be a set of observable facts about a physical system (a “phase”); each point might represent a specific position and specific momentum, or it might be something coarser. Informally, we want that if we specify an energy function which only depends on the phase, then we can tell how the phase evolves with time, and this evolution is “reasonable.” More formally a phase space is a manifold M equipped with the following structure

  • If f:M\to \mathbb{R} is a smooth compactly supported function, then there is a time evolution a_f:M\times R \to M such that a_f(a_f(x,u),t)=a_f(x,u+t). Physically, we think of this as the energy function specifying how the system evolves over time.
  • Conservation of energy: f(a_f(x,t))=f(t).
  • No conserved quantities: for any two points x and y, there is a chain of energy functions and times f_i,t_i such that applying the time evolution for the f_i‘s in order for t_i goes from x and y.
  • Linearity under superposition: the flow a_f is the exponential of a vector field X_f, and we have that X_{f+g}=X_f+X_g and $X_{cf}=cX_f$ for all constants $c$.
  • Equilibrium: if x is a critical point of f, then a_f(x,t)=x for all t.
  • The assignment from energy functions to flows is equivariant under any of the flows: a_{f(a_g(-,t))}(x,u)=a_f(a_g(x,u),t).

All of these are hopefully intuitive properties for a physically system to have.

For example, if we let M=T^*N for some manifold N, we can think of this as the phase space for a single particle running around in N (or more generally n particles in Y and N=Y^n), where the covector measures momentum.  This case, we can split our position into space and momentum coordinates (x_p,x_q); the time derivative of x_p is a vector on N and the time derivative of x_q is a convector.  On the other hand, for any function E, the differential along the space coordinates \frac{dE}{dp} is a covector, and along the momentum coordinates \frac{dE}{dp} is a vector.   Hamilton’s equations rewrite Newton’s laws of motion as

\frac{dx_p}{dt}=-\frac{dE}{dq}

\frac{dx_q}{dt}=\frac{dE}{dp}

This gives a rule for obtaining X_E=(-\frac{dE}{dq},\frac{dE}{dp}), and the flow is obtained by integrating this vector field.

Now, I hope you’ve all guessed what the coming theorem is:

Theorem. A phase space is the same thing as a symplectic manifold.

So, given a phase space, how does one find the symplectic structure? Well, by the equilibrium condition, the vector $X_f$ at a point depends only on $df$: if df and dg are equal at a point, then X_f and X_g agree there too, since it is an equilibrium of $f-g$. Thus, by linearity, we have a linear map \xi:T^*M\to TM from the cotangent to the tangent bundle of M, which captures the assignment from energy functions to vector fields. By the lack of conserved quantities, this must be an isomorphism.

Of course, an isomorphism between a vector bundle and its dual can be thought of as an element of its tensor square T^*M\otimes T^*M; if x_i are coordinates in a neighborhood in M, then we have a coordinate independent 2-tensor given by

\omega=\sum dx_i\otimes\xi^{-1}(\partial/\partial x_i).

That is, if we let \xi_{ij} be the matrix coefficients of \xi and \xi^{-1}_{ij} the matrix coefficients of its inverse,

\omega=\sum \xi^{-1}_{ij}dx_i\otimes dx_j.

I’d like to show that this is a 2-form, that is, that \xi (and thus \xi^{-1}) has an anti-symmetric matrix for any basis and its dual.

So, by conservation of energy applied to the function x_i, we have that \xi_{ii}=\langle \xi(dx_i),dx_i\rangle=0. Furthermore, applied to x_i+x_j, we have that \xi_{ij}+\xi_{ji}=\langle \xi(dx_i+dx_j),dx_i+dx_j\rangle=0, so indeed \omega is a 2-form.

We’re almost to a symplectic manifold. We have a non-degenerate 2-form, we just need to know why its closed. Conveniently, we have one axiom we haven’t used: the equivariance of the assignment from energies to flows under the flows themselves. We can how restate this in terms of \omega: it says that \omega is invariant under all of the flows corresponding to functions. In terms of the vector fields X_f, we say that the Lie derivative of \omega along X_f is trivial. This can be restated more compactly: there’s a formula for the Lie derivative of a 2-form which is

\mathcal{L}_{X_f}\omega=d(\omega(X_f,-))+(d\omega)(X_f,-,-).

By definition, we have that \omega(X_f,-) though of as a 1-form is just df. In particular, this 1-form is closed, and we just have

0=\mathcal{L}_{X_f}\omega=(d\omega)(X_f,-,-),

which is the same as saying that d\omega=0.

Hooray! That finishes the proof one direction: the proof of the other direction can be found (in scattered pieces) in any text on symplectic geometry. The vector field X_f is called the Hamiltonian vector field of f, and you’ll most often find these properties phrased in terms of this or its associated Poisson bracket \{f,g\}=\mathcal{L}_{X_f}(g).  Thus, conservation of energy becomes antisymmetry \{f,f\}=0 and equivariance becomes the Jacobi identity \{f,\{g,h\}\}=\{\{f,g\},h\}+\{g,\{f,h\}\} (note to Lie algebraists: this is the identity that Jacobi actually knew.  He had no idea what a Lie algebra or group was).

This ends our first installment; I’ll continue as I come across bits of exposition that I think actually add to the exposition in Cannas da Silva.

7 thoughts on “What is a symplectic manifold, really?

  1. “No conserved quantities” is a cute name for the condition that there’s only one symplectic leaf, but I’m a little bothered by it. I would have thought a “conserved quantity” was, say, a Casimir, and there are plenty of Poisson manifolds with trivial Poisson center. I do agree that “no conserved quantities” is a reasonable physical axioms: the only way we set up experiments is by manipulating the energy functions involved, and the axiom is that we can in fact prepare any state from any other.

  2. This depends on how you interpret “quantity;” after all, if there is more than one symplectic leaf (and the space is connected), one of the leaves is lower dimensional, and there is some function that is constant on that leaf. From the perspective someone living on the smaller leaf, that observable is conserved.

  3. If I remember well symplectic manifolds and geometry were introduced by Arnold (Mathematical Methods of Classical Mechanics) and since the (only) reason to impose closedness of the symplectic form is as a slick formulation of “invariant under hamiltonian phase flows” we have the following interesting historical situation.

    Arnold was very critical of Bourbaki but in his book he states the definition of a symplectic manifold, including closedness of $\omega$ and then proves a “Theorem: The form… is an integral invariant of a hamiltonian phase flow.”.

    But nowadays, accustomed as we are now to wikipedia and mathoverflow intuitive explanations, we know this is misleading, the real theorem is that we can characterize/axiomatize symplectic forms (i.e. those yielding invariants under hamiltonian flows, in particular the volume form) as closed forms.

    So history has produced this educationally awkward situation: the presentations based on time/flow-invariance which are (it seems to me) absolutely necessary to make any sense of the “closed form” condition are not standard.

    (I am no expert and may be wrong. I apologize if I am and please correct me.)

  4. There is of course another physical explanation: (Pre-)Symplectic manifolds arise from variational principles.Since most of classical physics can be described by such a principle it is natural to consider symplectic manifolds. Along this road one will immediatly encounter moment maps and so on.

Comments are closed.