Hamiltonian systems with constraints
Audience: graduate students with prior exposure to Lagrangian/Hamiltonian mechanics and basic field theory.
Goal: learn the Dirac–Bergmann treatment of constrained Hamiltonian systems and see it in action for Maxwell theory, the Proca (massive vector) field, and general relativity (ADM).
1. Why constraints appear
Section titled “1. Why constraints appear”In ordinary (regular) Lagrangian mechanics, the Legendre map
is invertible because the Hessian
has . One can solve and define the Hamiltonian .
A constrained system arises when the Hessian is singular: . Then the momenta are not independent; they satisfy relations
called primary constraints. The symbol (“weak equality”) means the relation holds on the constraint surface in phase space, but you should not use it inside Poisson brackets until you have computed them.
In gauge theories the singularity is not an accident: it reflects redundancy in the description (gauge symmetry). In massive theories (e.g. Proca) singularity can also occur because some variables are nondynamical multipliers even without gauge symmetry.
2. Dirac–Bergmann algorithm in a nutshell
Section titled “2. Dirac–Bergmann algorithm in a nutshell”2.1 Canonical Poisson brackets for fields
Section titled “2.1 Canonical Poisson brackets for fields”For canonical fields ,
2.2 Total Hamiltonian and consistency
Section titled “2.2 Total Hamiltonian and consistency”Given a Lagrangian density :
-
Define canonical momenta
Relations among that do not determine velocities are primary constraints .
-
Canonical Hamiltonian (Legendre transform where possible)
-
Total Hamiltonian
where are Lagrange multipliers enforcing the primary constraints.
-
Consistency conditions Require constraints be preserved under time evolution:
This may:
- produce secondary constraints, tertiary, etc.; and/or
- fix some multipliers .
Continue until closure.
2.3 First-class vs second-class
Section titled “2.3 First-class vs second-class”Let be the complete set of constraints.
-
First-class constraint: for all .
These generate gauge transformations (redundancies). -
Second-class constraint: the matrix (kernel)
is (functionally) invertible on the constraint surface.
These do not generate gauge; they simply remove phase-space directions.
2.4 Dirac bracket (for second-class constraints)
Section titled “2.4 Dirac bracket (for second-class constraints)”If are second-class and is invertible with inverse ,
Then for all , so you may set strongly after switching to Dirac brackets.
2.5 Counting physical degrees of freedom
Section titled “2.5 Counting physical degrees of freedom”Let be the number of configuration fields per space point (so phase space has dimension ). Let be the number of first-class constraints and the number of second-class constraints (per point, in an appropriate local sense). Then
Equivalently, in phase space:
3. Worked example I: Maxwell theory (massless spin-1)
Section titled “3. Worked example I: Maxwell theory (massless spin-1)”3.1 Lagrangian and 3+1 split
Section titled “3.1 Lagrangian and 3+1 split”Take vacuum Maxwell in flat space:
With our signature,
Define
Then
3.2 Canonical momenta and primary constraint
Section titled “3.2 Canonical momenta and primary constraint”Define
There is no in , hence
For ,
Thus is the electric field.
3.3 Canonical Hamiltonian
Section titled “3.3 Canonical Hamiltonian”Compute the Hamiltonian density
Solve :
Since ,
Integrating by parts (dropping boundary terms),
so the canonical Hamiltonian is
3.4 Total Hamiltonian and Gauss constraint
Section titled “3.4 Total Hamiltonian and Gauss constraint”Add the primary constraint with multiplier :
Preserve :
This yields the secondary constraint
i.e. Gauss’s law in vacuum.
No further constraints arise; the multiplier remains undetermined.
3.5 Constraint algebra and first-class nature
Section titled “3.5 Constraint algebra and first-class nature”Using canonical brackets,
Thus and are first-class.
3.6 Gauge generator (one gauge function, two constraints)
Section titled “3.6 Gauge generator (one gauge function, two constraints)”A gauge parameter is a function . The appropriate generator can be written (Castellani’s algorithm) as
Then
so , the usual gauge symmetry.
3.7 Degrees of freedom
Section titled “3.7 Degrees of freedom”We have configuration fields . Constraints: first-class and . Hence
the two transverse photon polarizations.
3.8 Optional: explicit reduction in Coulomb gauge
Section titled “3.8 Optional: explicit reduction in Coulomb gauge”To see the “constraint + gauge” removal explicitly, decompose
Then Gauss’s constraint is , which removes the longitudinal momentum (up to boundary conditions). The longitudinal coordinate is removed by gauge.
If you impose Coulomb gauge , then form a second-class pair:
which is invertible (as an operator) after specifying boundary conditions. The reduced Hamiltonian becomes a Hamiltonian for the transverse fields only:
4. Worked example II: Proca (massive spin-1)
Section titled “4. Worked example II: Proca (massive spin-1)”4.1 Lagrangian and sign conventions
Section titled “4.1 Lagrangian and sign conventions”With our signature , a convenient Proca Lagrangian that leads to a positive-energy Hamiltonian is
The field equation is
and taking gives the on-shell constraint
4.2 3+1 split
Section titled “4.2 3+1 split”Using , we get
Therefore
4.3 Canonical momenta and primary constraint
Section titled “4.3 Canonical momenta and primary constraint”Exactly as in Maxwell,
and
4.4 Canonical Hamiltonian
Section titled “4.4 Canonical Hamiltonian”Compute
One finds
Integrating by parts,
4.5 Total Hamiltonian and secondary constraint
Section titled “4.5 Total Hamiltonian and secondary constraint”Total Hamiltonian:
Preserve :
So the secondary constraint is
4.6 Second-class structure (no gauge symmetry)
Section titled “4.6 Second-class structure (no gauge symmetry)”Compute the constraint bracket:
Thus are second-class: there is no gauge symmetry. Consistency now fixes the multiplier rather than leaving it arbitrary.
4.7 Eliminating and the reduced Hamiltonian
Section titled “4.7 Eliminating A0A_0A0 and the reduced Hamiltonian”The constraint is algebraic in :
Substitute into the Hamiltonian. The -dependent terms combine as
Hence the reduced Hamiltonian is
which is manifestly bounded below.
4.8 Degrees of freedom
Section titled “4.8 Degrees of freedom”Here , , . Thus
corresponding to helicities of a massive spin-1 particle.
4.9 Optional: Stückelberg trick and “gauge vs second-class”
Section titled “4.9 Optional: Stückelberg trick and “gauge vs second-class””Introduce a scalar and replace
Then the Proca mass term becomes gauge invariant under
The theory is now gauge invariant (first-class constraints reappear), but it contains an extra field . After gauge fixing (e.g. ) you recover Proca with 3 physical DOF. This is a useful conceptual bridge: second-class constraints can be viewed as gauge-fixed first-class systems (under appropriate extensions).
5. General relativity as a constrained Hamiltonian system (ADM)
Section titled “5. General relativity as a constrained Hamiltonian system (ADM)”The Hamiltonian formulation of GR is the prototype of a field theory with:
- singular Lagrangian (lapse and shift are nondynamical),
- first-class constraints (encoding diffeomorphism invariance),
- a nontrivial constraint algebra with structure functions (hypersurface deformation algebra).
We sketch the derivation carefully enough to see where each ingredient comes from.
5.1 Einstein–Hilbert action and 3+1 split
Section titled “5.1 Einstein–Hilbert action and 3+1 split”Start from the Einstein–Hilbert action (with cosmological constant )
The boundary term (e.g. Gibbons–Hawking–York) ensures a well-posed variational principle when fixing the induced metric on the boundary.
Assume spacetime is foliated by spacelike hypersurfaces , with coordinates on each . The spacetime metric can be written in ADM form:
where:
- is the induced spatial metric on ,
- is the lapse,
- is the shift (with ).
Useful identities:
where .
5.2 Extrinsic curvature and kinematics
Section titled “5.2 Extrinsic curvature and kinematics”Define the covariant derivative compatible with : .
The extrinsic curvature of embedded in spacetime is
This shows explicitly that appears linearly in , while and do not appear at all.
5.3 ADM Lagrangian density
Section titled “5.3 ADM Lagrangian density”A standard result of the Gauss–Codazzi decomposition (up to total derivatives absorbed by ) is:
Therefore, dropping total derivatives already accounted for by boundary terms, the ADM Lagrangian density is
5.4 Canonical momenta
Section titled “5.4 Canonical momenta”The canonical momentum conjugate to is
Since enters only through , and
one finds
Taking the trace gives
You can invert to express in terms of :
Primary constraints: because and do not appear in ,
Thus,
are primary constraints.
5.5 Canonical Hamiltonian and the ADM constraints
Section titled “5.5 Canonical Hamiltonian and the ADM constraints”The canonical Hamiltonian is
where should be expressed using
A key computation uses integration by parts:
(up to boundary terms). After rewriting in terms of , one arrives at the standard ADM form
Here is a boundary term (e.g. ADM energy for asymptotically flat spacetimes). The bulk constraint densities are:
Momentum (diffeomorphism) constraint
Hamiltonian (scalar) constraint
(again, up to convention-dependent signs/factors).
5.6 Total Hamiltonian and secondary constraints
Section titled “5.6 Total Hamiltonian and secondary constraints”The total Hamiltonian adds the primary constraints:
Preserving gives
hence the Hamiltonian constraint
Preserving gives
hence the momentum constraints
No new independent constraints appear beyond these (for pure GR); instead, consistency fixes nothing because the theory is gauge invariant (diffeomorphisms).
5.7 Smeared constraints and the constraint algebra
Section titled “5.7 Smeared constraints and the constraint algebra”It is cleaner to use smeared functionals:
Then (schematically) the Poisson brackets close as:
This is the hypersurface deformation algebra (often called “Dirac algebra”). It is not a Lie algebra with constant structure constants; it has structure functions involving .
The closure implies that and are first-class (together with the primary constraints ).
5.8 Gauge interpretation: diffeomorphisms
Section titled “5.8 Gauge interpretation: diffeomorphisms”- generates spatial diffeomorphisms on :
- generates normal deformations of the hypersurface (time reparametrizations / refoliations).
A precise mapping between and spacetime diffeomorphisms requires care, because the algebra closes with structure functions; nonetheless, the standard viewpoint is that these first-class constraints encode the redundancy under diffeomorphisms.
5.9 Degrees of freedom of GR
Section titled “5.9 Degrees of freedom of GR”In 3+1D, the configuration variable is a symmetric tensor: per point.
Constraints:
- primary: (1), (3),
- secondary: (1), (3).
Altogether there are 8 constraints; however, the standard DOF counting for GR focuses on the true canonical pair and treats as multipliers.
On the phase space:
- phase-space dimensions per point,
- there are independent first-class constraints .
Thus
These are the two polarizations of the graviton (gravitational waves) in 4D.
5.10 Linearized check (TT gauge intuition)
Section titled “5.10 Linearized check (TT gauge intuition)”Linearize around Minkowski: . In harmonic gauge one can reduce to transverse-traceless (TT) components satisfying a wave equation. The TT condition removes gauge redundancy and constraints, leaving two propagating modes—consistent with the Hamiltonian count above.
Details: harmonic gauge ⇒ TT wave equation and the “2 polarizations” count
Start from
Infinitesimal diffeomorphisms act as a gauge symmetry:
It is convenient to use the trace-reversed field
The harmonic (Lorenz) gauge condition is
In this gauge, the vacuum linearized Einstein equations simplify to
So the radiative degrees propagate as massless waves.
Harmonic gauge does not fully fix the gauge: it is preserved by residual transformations with
For a plane wave , the field equation gives and the gauge condition gives transversality . Using the residual gauge freedom one can impose the stronger TT conditions (for a wave moving along ):
The only nonzero components then live in the block transverse to the propagation direction and satisfy
A standard basis is
i.e. the two gravitational-wave polarizations.
This is the linearized counterpart of the Hamiltonian counting: the constraints remove non-propagating components and the gauge symmetry quotients out redundancies, leaving two physical modes (in 4D), exactly as in §5.9.
6. Conceptual comparison: Maxwell vs Proca vs GR
Section titled “6. Conceptual comparison: Maxwell vs Proca vs GR”6.1 Why first-class “removes more” than second-class
Section titled “6.1 Why first-class “removes more” than second-class”A single first-class constraint does two things:
- it restricts to the constraint surface, and
- it generates a gauge flow—points along that flow are physically equivalent.
Therefore each first-class constraint removes two phase-space dimensions (one for the surface, one for the orbit), while each second-class constraint removes only one.
- Maxwell: constraints are first-class gauge symmetry 2 physical DOF.
- Proca: constraints are second-class no gauge symmetry 3 physical DOF.
- GR: constraints are first-class diffeomorphism redundancy 2 physical DOF in 4D.
6.2 Nondynamical fields and constraints
Section titled “6.2 Nondynamical fields and constraints”Both Maxwell and Proca have because has no time derivative.
The difference is what happens next:
- Maxwell: preservation produces Gauss law (first-class).
- Proca: preservation produces (second-class with ).
In GR, lapse and shift are nondynamical: . Their preservation produces Hamiltonian/momentum constraints, all first-class.
7. Exercises
Section titled “7. Exercises”-
Maxwell with sources: Add to the Maxwell Lagrangian and show that Gauss’s law becomes . Discuss which parts of the constraint structure change.
-
Coulomb gauge Dirac bracket: In Maxwell theory, impose Coulomb gauge and compute the Dirac bracket for the transverse fields.
-
Proca Dirac bracket: Treat and as second-class and compute the Dirac bracket for and .
-
ADM constraint algebra: Verify at least one of the ADM bracket relations using smeared constraints and integration by parts.
-
GR DOF in dimensions: Generalize the ADM DOF count to spacetime dimensions and show that the number of propagating graviton DOF is .
8. References for further study
Section titled “8. References for further study”- P. A. M. Dirac, Lectures on Quantum Mechanics (constrained Hamiltonian systems).
- M. Henneaux & C. Teitelboim, Quantization of Gauge Systems (modern, systematic).
- K. Sundermeyer, Constrained Dynamics (classic).
- R. M. Wald, General Relativity (ADM, constraints, canonical structure).
- E. Poisson, A Relativist’s Toolkit (3+1 tools, extrinsic curvature).
- Arnowitt–Deser–Misner (ADM) original papers/lectures (historical source).
- 梁灿彬, 周彬. 微分几何入门与广义相对论(下册)第二版. 科学出版社.