Symmetry 2010, 2, 970-998; doi:10.3390/sym2020970
OPEN ACCESS
symmetry
ISSN 2073-8994
www.mdpi.com/journal/symmetry
Review
Noether Symmetries and Covariant Conservation Laws in Classical, Relativistic and Quantum Physics
Lorenzo Fatibene 1'2'*, Mauro Francaviglia 1'2'3 and Silvio Mercadante 12
1 Department of Mathematics, University of Torino, via C. Alberto 10, Torino, Italy
2 INFN, Sezione di Torino, Iniziativa Specifica Na12, Torino 10125, Italy
3 LCS, University of Calabria, Cosenza, Italy
* Author to whom correspondence should be addressed; E-Mail: lorenzo.fatibene@unito.it; Tel.: +39-011-670-2933.
Received: 12 January 2010; in revised form: 13 April 2010 / Accepted: 21 April 2010 / Published: 29 April 2010
Abstract: We review the Lagrangian formulation of (generalised) Noether symmetries in the framework of Calculus of Variations in Jet Bundles, with a special attention to so-called "Natural Theories" and "Gauge-Natural Theories" that include all relevant Field Theories and physical applications (from Mechanics to General Relativity, to Gauge Theories, Supersymmetric Theories, Spinors, etc.). It is discussed how the use of Poincare-Cartan forms and decompositions of natural (or gauge-natural) variational operators give rise to notions such as "generators of Noether symmetries", energy and reduced energy flow, Bianchi identities, weak and strong conservation laws, covariant conservation laws, Hamiltonian-like conservation laws (such as, e.g., so-called ADM laws in General Relativity) with emphasis on the physical interpretation of the quantities calculated in specific cases (energy, angular momentum, entropy, etc.). A few substantially new and very recent applications/examples are presented to better show the power of the methods introduced: one in Classical Mechanics (definition of strong conservation laws in a frame-independent setting and a discussion on the way in which conserved quantities depend on the choice of an observer); one in Classical Field Theories (energy and entropy in General Relativity, in its standard formulation, in its spin-frame formulation, in its first order formulation "a la Palatini" and in its extensions to Non-Linear Gravity Theories); one in Quantum Field Theories (applications to conservation laws in Loop Quantum Gravity via spin connections and Barbero-Immirzi connections).
Keywords: Noether symmetries; (gauge)-natural theories
1. Introduction
Symmetries have acquired a central role in Physics. In Theoretical Physics discrete symmetries encode most of the intriguing structure of the Standard Model for particles, in Chemistry they encode for spectroscopic and physical properties of molecules.
Lie groups of transformations encode properties (often enhancing physical interpretation) of dynamical and Lagrangian systems. For example, theoretical Relativistic Cosmology is entirely based on the symmetry ansatz of homogeneity and isotropy (which of course can be later relaxed by using techniques of Perturbation Theory). Most applications of Quantum Gravity (either loopy or stringy) to Cosmology are entirely based on symmetries since the approach in full generality is still hindered by massive technical difficulties.
In the beginning of the 20th century, Emmy Noether discovered (see [1]) a relation between (continuous families of) symmetries of Lagrangian systems and their first integrals, i.e., physical quantities which remain constant during the evolution of the system and are often related to fundamental physical quantities such as energy, momentum, angular momentum, etc.
Symmetries of mechanical systems (together with their associated conserved quantities) are the basis of the definition of integrable systems that form a class of systems for which dynamics can be determined in general by integration. Generally speaking, knowing a conserved quantity of a dynamical system allows to reduce the dimension in which the system is defined. General integration techniques, such as Hamilton-Jacobi equation, rely in fact entirely on existence of conserved quantities.
Very interesting issues arise when one extends the notion of transformation to encompass transformations depending on velocities (and more generally on accelerations, etc.). These can be seen as transformations on the infinite jet prolongation of the configuration space where the dynamical system can be seen to be prolonged. Here infinite dimensionality enters strategically to require new techniques to extend the procedures which are standard in a finite dimensional arena.
Another interesting setting where symmetries play their role is the framework of Lagrangian field theories, which are the current basis for any approach to fundamental interactions in Physics. This corresponds, loosely speaking, to consider a dynamical system with a continuous infinity of degrees of freedom. In this context, (second) Noether theorem still holds true, though conservation laws needs to be interpreted correctly and differently from what is usually done in Mechanics (in view of the so-called first Noether theorem); if the system lives on a manifold M of dimension m = dim(M)—which usually is identified with the physical spacetime, or, mathematically speaking, m is the number of independent variables of the equations—Noether theorem implies the existence of a (m — 1)-form on M called the Noether current Such a current results to be closed along solutions and hence implies a continuity equation. The conserved quantities are thence defined as the integrals of such currents on a (m — 1)-volume in M. The continuity equation holding for the Noether current relates the changes of conserved quantities to the flows at the boundary of the region (something entering or escaping the region) and some residual at singularities. This setting is particularly suitable for physical interpretation; it was in fact developed, e.g., to define electric charges in Electromagnetism. This setting results to go far beyond Electromagnetism and to be the typical situations for (at least) all fundamental interactions.
In our approach the difference between first and second Noether theorem that is often drawn in literature is quite evanescent. In fact, on one hand in field theory the so-called global transformations cannot in general be defined in a gauge covariant way and only so-called local transformations are intrinsically defined, unless configuration bundle is trivialized (as it happens in Mechanics). In any event, both local and global transformations are generated by a vector field on the configuration bundle of the system. On the other hand, Mechanics can be considered as a particular case of field theory. In fact, in standard (Newtonian) Mechanics m = l (since M = R) and Noether currents are functions; being closed they are constant along solutions and the situation introduced above is obtained as a special case.
Of course, in Mechanics one can also consider local transformations (e.g., in relativistic point particle invariance with respect to reparametrizations is considered); in this case the Lagrangian is degenerate and constraints on momenta appear. Such constraints are known to be hidden in some (consequences) of the Lagrange equations though currently the most efficient tool to analyze the situation is based on canonical Hamiltonian formalism (see [2-7]). However, let us here only mention that these constraints are somehow trivial as long as conservation laws are considered since their associated currents vanish on-shell (once initial conditions obey constraints). The meaning of these conservation laws is very different depending if one is considering first of second Noether theorem but we shall discuss these differences below when discussing the hole argument. Hereafter, we are interested in conservation laws and hence we shall mainly consider general transformations and their relation to continuity equations.
In Field Theory the role of conservation laws, though certainly important as a support to the physical interpretation, appears to be weaker than in Mechanics since the mechanical geometrical picture is lost (we mean, the picture of first integrals each of which determines a level hypersurface of configuration space so that motion is constrained on the intersection of all these hypersurfaces and first integrals define a reduced system living on it). Noether currents are (m — l)-forms and define no level hypersurface; on the other hand a Field Theory has infinitely many degrees of freedom and it is not obvious how to extend mechanical techniques to use conservation laws with the purpose of reducing the systems accordingly; however, though extra care is due to infinite dimensionality, such a reduction can be done; see [8].
On the other hand, the history of Field Theory is strongly entangled with symmetries. All the theory of continuity equations was developed to account in Field Theory of charge conservation. When Einstein proposed equations to describe the gravitational field in General Relativity (GR) their form was obtained on the basis of conservation of energy—momentum tensor describing matter (as well as, independently, by Hilbert on the basis of his variational principle).
GR itself is a source of fantastic examples of the role of symmetries in Field Theories. One can prove general theorems to show that Noether currents are not only closed forms, but even exact forms along solutions. This introduces a superpotential U for each Noether current and conserved quantities are obtained by surface integrals of the superpotential. Generalized Stokes theorem in this context establishes a bridge between conserved quantities and (co)homology of forms (and hence with topology and global structure of the spacetime manifold) which happens to be central in the physical interpretation of the model.
Similar situations can be recognized in other areas of Physics such as in Gauge Theories; the same structures of superpotential forms can be generically recognized in Gauge Theories as well as back in
Mechanics when one wants to use (or needs to use, as it happens for the relativistic material points) homogeneous formalism (i.e., treating time and space on equal footing).
Hereafter we shall show some of the relations among symmetries and conservation laws in different areas, preferring a unifying languages that stress similarities.
2. Geometrical Setting
We shall here present notation for a geometrical setting for Lagrangian systems which encompasses both Field Theories and a suitable setting for Mechanics. We refer the reader to [9] for further and deeper details.
Let M be an (orientable, connected, paracompact) manifold of dimension m = dim(M) with local coordinates xp that will be eventually considered as independent variables in the Variational Calculus.
Let C = (C, M, n, F) be a bundle over M with projection n and standard fiber F; see [9]. Let yi be local coordinates on F that will eventually represent the dependent variables of the variational principle representing the dynamics of the Lagrangian system under considerat ion.
A vector w E TC is vertical iff Tn(w) = 0, i.e., is expressed in any fibered coordinate system (xp, y%) as w = w%di. The set of all vertical vectors at p E C is denoted by Vp; the union V(n) = UpVp C TC is a sub-bundle of TC.
This setting is general enough to encode Field Theories and Mechanics; in particular in Field Theory M is assumed to represent spacetime (hence usually m = 4) while in Mechanics one has M = R (hence m =1) and base coordinates xp usually reduce to t. Configurations of the system are locally given by assigning the dependent variables as functions of the independent ones. In Field Theories this means yi (x), while in Mechanics yi(t) locally represent curves in F (which in this case is called the configuration space of the system). Of course global structure of C (or F) encodes the global properties of the system; in particular in Field Theories one considers transformation rules of fields with respect to changes of fibered coordinates; these transformation rules encode in particular the geometrical character of fields, i.e., for a Riemannian metric one has for example
ix ' p = x' p(x)
' = p -a (1)
= Jp 9pa Jv
where Jp denotes the anti-Jacobian of the coordinate change on spacetime, namely of x= x' p(x).
Moreover, these transformation rules encode also global properties, in the technical sense that once they have been fixed one can prove that there exists a unique bundle C (modulo isomorphisms) having those functions as transition functions. This is quite satisfactory from the physical viewpoint since local descriptions given by local observers in terms of local fibered coordinates, together with transformation rules (which dictate how one can deduce the readings of an observer knowing the readings of any nearby observer) allow to uniquely describe the physical situation in an observer-independent and global way.
Global configurations are global sections of the bundle C, i.e., maps a : M ^ C such that n oa = idM. They are locally expressed by functions yi(x) and transformation rules account for global properties. Usually only local sections exist (i.e., sections on open subsets U C M) and the existence of global sections of C implies topological restrictions and/or obstructions.
Once the configuration bundle C is given one can define a new bundle, canonically (i.e., functorially) associated to C, which accounts for derivatives of dependent variables with respect to independent variables up to some finite order k. This new bundle is called the k-order prolongation of C and it is denoted by JkC. If (x^,y%) are fibered coordinates on C, then (xyl,yll, y^,...) are fibered coordinates on JkC, where y% stands for the first derivatives, ylilv stands for the second derivatives (and are accordingly assumed to be symmetric in the lower indices (^v)), and so on.
Any JkC is a bundle over any other Jk-sC (for any integer s > 0). All prolongations of a bundle define a projective family and one can define its inverse limit, that is denoted by J^C. (This is a bundle in a broader sense, since it is infinite dimensional).
A Lagrangian of order k is a bundle map L : JkC m Am(M) where Am(M) is the bundle of m-forms over M. Equivalently, the Lagrangian can be seen as a horizontal m-form over JkC. The action is the functional defined as
Ad [a]=/ L o jk a (2)
for any m-region D c M and any (local) configuration (i.e., section) a. Hamilton stationary action principle is in this framework a definition for critical sections, i.e., sections that are critical points of the action functional with respect to a canonical class of deformations. A deformation is a vertical vector field X on C. Let us denote by $s its flow so that we can drag any configuration a along X defining a l-parameter family of configurations as = $s o a. The variation of the action along the deformation X is defined as
Sx Ad [a] = J d L ◦ jka.s) |s=o (3)
Hamilton stationary action principle: a configuration a is critical iff for any compact m-region D c M and deformation X with supp(X) c D one has
Sx Ad [a] = 0 (4)
Equivalently, one can also consider more general deformations only requiring that X vanishes on the boundary dD together with its derivatives up to order k — l. Critical sections can be shown to obey Euler-Lagrangian equations and physically represent allowed configurations (i.e., configurations which satisfy field equations). This framework reduces locally to the usual Variational Calculus.
As we said, in Mechanics one has M = R and m = l. Being R a contractible manifold the bundle C is necessarily trivial, i.e., diffeomorphic to a Cartesian product R x F; see [10]. Nevertheless, this diffeomorphism is not canonical, but it depends on a reference frame which is realized mathematically by a connection on the bundle C. A connection is a family of hyperplanes Hp c TpC such that H = UpHp is a sub-bundle in TC and at each point TpC = Vp ®Hp (by the way, since the curvature of the connection is skew and in this case m = l, any connection on C is flat). A connection in this case is represented by a distribution Hp of rank l on C; being of rank l it is involutive and, by Frobenious theorem, integrable. Hence one has defined a foliation in curves (trajectories, i.e. unparametrized curves, to be precise) of C. The leaves yp are nowhere vertical and establish diffeomorphisms between any pair of fibers ^-1(i0) and n-1(t1); hence they induce a particular diffeomorphism t : C ^ R x F, i.e., a global trivialization, in which yp : t m- (t, f0) for some constant f0 E F. These sections represent rest for the global observer associated to the trivialization t defined above. One has thence two possible frameworks for Mechanics:
one on R x F, that is a framework for a fixed observer (or reference frame); and another on C, that is potentially independent of the observer. In both cases the framework is suitable for describing in particular any holonomic, possibly time-dependent, Lagrangian system.
Since the Lagrangian is used only to define the action (and then the action itself is used in Hamilton principle), in all Field Theories one could add to the Lagrangian terms which do not affect the value of the action functional. In fact there are forms, called contactforms, which vanish when computed along each configuration of the system; in fact, if a form is contact it factorizes terms such as := dyi — y\dxx, := dyp — ylilXdxx, and so on.
A Poincare-Cartan form for a Lagrangian system is a form 6L = L + Q on J2k-1C that differs from the Lagrangian L by a contact form Q. The action can be written also in terms of the Poincare-Cartan form as
Ad[a]= / L o jka = i (jka)*0L (5)
The contact term Q, which does not affect field equations, solutions and so on, can then be tuned to enhance the properties related to conservation laws and symmetries.
One requires (besides some other technical requirements; see [9]) that for all vertical fields X(2k-1) of J2k-1 C the form iX dOL is contact. Here iX denotes the usual duality between forms and vector fields. In the next Section we shall see how Poincare-Cartan forms are adapted to conservation laws and symmetries. Below we shall see explicit examples for coordinate expressions of Poincare-Cartan form.
The theory of Poincare-Cartan form was fully developed in the late '70s-early '80s (see [11-15]). In Mechanics one can prove that there exists a unique Poincaree-Cartan form at each order. In Field Theory, there is a unique Poincaree-Cartan form in theories of order 1 , there is a canonical choice for theories of order 2, while for Field Theories of order k > 3 there is a Poincare-Cartan form for each connection on the base manifold (one needs integration by parts to define it, and one needs covariant integration by parts to control globality; different connections define then different Poincaree-Cartan forms when k > 3). This non-trivial (and to some extent unexpected) structure on uniqueness issue is very beautiful and inspiring, for example for issues related to Hamiltonian formalisms.
2.1. Field Equations
Most of Variational Calculus is in fact related (if not completely encoded) in how deformations defined on the configuration bundle C prolong to higher order jet prolongations JkC.
A (projectable) vector field E = £pdp + £idi (here projectable refers to the fact that the components £p are functions of the independent variables only, i.e., £p(x)) can be prolonged to jet bundles to define vector fields j kX on each JkC.
In general, let : C C be the flow of X which projects onto the flow tps : M ^ M of the vector field £ = £pdp = (n)*E on M. For any configuration a one can drag it along the flow by defining as = $s o a o p-1; these can be easily checked to be configurations as well. Accordingly, one can define prolonged flows jk$s : JkC — JkC : jXa — jks(x)as together with their infinitesimal generators j k X = j $s|s=Q. S
For example for k = l one has that
j 1x = ^+?dt + de — y) dt (6)
is a good (i.e., global) vector field on J1C. Here dt = dt + ytdi + y^xdi +... denotes the total derivative operator for (local) functions on JkC.
If one restricts to deformations (i.e., vertical vector fields) prolongations are
X = Xidi
j 1x = x jdj + dX idt
j 2X = Xjdj + dXjd? + dtv X jdtv (7)
Field equations of a (k-order) Lagrangian L are the differential equations which a configuration should obey in order to be critical. The (k-order) Lagrangian form is locally expressed as
L = C(xx,yi,yix,... )ds (8)
where ds is the local canonical basis of m-forms on M induced by coordinates xt (i.e., local volume). The deformation is in the form X = Xjdi = Syidi.
By suitable integration by parts one can split the Lagrangian form in two parts, so that
SxAd [a]= / EiXids + / FfXidst (9)
JD JdD
where dst is the local canonical basis of (m — l)-forms on M induced by coordinates xt. This splitting is encoded in global bundle morphisms
E : J2kC M V*(C) 0 Am(M) F : J2k-1C M V*(Jk-1C) 0 Am(M)
where V*(C) are dual to vertical vectors on C, while V*(Jk-1C) are dual to vertical vectors on Jk-1C. Denoting the dualities by < | > one has local expressions
< E|X >= EiXids
< F|jk-1 X >= FfXidst For a first order Lagrangian one has for instance
E- = — — d —
i dyi (l2)
Ft = ()
fi dyj,
While the Euler-Lagrange morphism E is directly related to field equations (and it is unique at all orders) the Poincare-Cartan morphism F, that is not unique in general because of the non-uniqueness of the Poincare-Cartan form, is more properly related to conservation laws.
Globally one can characterize the morphisms E and F by the so-called first variation formula, that once integrated gives (9)
< 5L\jkX >=< E\X > +Div < F|jk-lX > (13)
where Div is the divergence operator that corresponds, after evaluation along a configuration, to exterior differential on forms on M.
Let us remark that in Field Theory most Lagrangians are highly degenerate, which means that they do not depend on all derivatives of fields but just on some (suitable) combinations of them. Usually, these combinations are chosen to define some geometrical object with simple transformation laws. In this case it is often convenient to proceed by covariant integration by parts, in order to control globality of each single term in the equations instead of controlling only the globality of the whole equation. Examples will be considered below.
Before turning our attention to symmetries let us mention that the role played by flows above can be replaced by more general objects. Let us consider a family of maps : JlC — C that are locally expressed by
x = x' V(x)
y'* = Y *(xx,yl,y\)
and represent a sort of field transformation depending on the derivatives of field (as it happens in general for supersymmetries, Backlund transformations, etc.). Also in this more general case—in which the transformation can no longer be interpreted as a geometrical transformation on the manifold C—one can still define an infinitesimalgenerator
E = e(xX)d, + C(x\yl ,y\)di (15)
However, this object is no longer a vector field on C (since its components are not local functions on C). Objects like this are called generalized vector fields and can be seen as sections E of the bundle (nk)*(TC) — JkC that is defined by pull-back as follows:
(nk )*(TC)-► TC
Jk C-C
By the usual abuse of language the section E has local expression (15).
One can then extend the prolongation to generalized vector fields by formally using the same formulae (6) and (7). In the projective limit these families generate well-defined vector fields on J^C, while they are, although in a not yet formalized way, generalized at each finite level JkC.
We shall see below examples of such generalized vector fields when they represent symmetries and hence they define conservation laws. Let us stress that Emmy Noether was in fact the first to consider these examples just in her early studies aimed at showing a sort of inverse theorem; each conservation law can be generated by a suitable generalized symmetry.
3. Noether Theorem
Since field equations are mainly encoded in the geometry of jet prolongations (see [16]) Noether theorem can be understood in terms of Lie derivatives (see [17]).
Let S = £tdt + £idi be a (projectable) vector field on the configuration bundle C that projects onto a vector field £ = £tdt on M and let a be a configuration; let us define the Lie derivative of a with respect to S to be
£sa = (£t(x)yt — £i(x,y))di (l6)
This is a (generalized) vertical vector field and it accounts for the change of the configuration when dragged along the flow of S. The same expression holds true when S is a generalized vector field by itself.
This Lie derivative is natural in the sense that it preserves commutators, i.e.,
%1,S2] = [£Sx , £»2 ] (l7)
Of course this is true when commutators are considered as the commutators of (possibly generalized) vector fields on C. It is essential to notice here that configurations can be in principle dragged along the flow of vector fields on C only. In general, there is no dragging in C along vector fields on the base M, as one is instead "physically" used to expect in many physically relevant cases (e.g. in relativistic applications). Accordingly, there is no reason to expect that in general one could define Lie derivative of configurations in C along vector fields on M, nor that this can be done in such a way that they are natural, i.e., they preserve commutators.
There are however specific bundles in which one could naturally associate a vector field £ on C to each vector field £ on M (as, e.g., natural bundles or, in a sense, gauge-natural bundles; see [9].) For naturality one has that
[£>< r=[£,C] (l8)
One classical example of this situation is on tangent bundles C = TM; on any tangent bundle one has in fact a tangent lift of vector fields which preserves commutators. We have to stress that the existence of such a natural lift is a property of the bundle C. When such a lift is defined the bundle C is in fact called natural; one could prove that natural bundles are associated to (some finite higher order) frame bundle Ls(M) on the base manifold; see [18]. The tangent bundles are in fact always associated to the standard frame bundle L(M).
Only on natural bundles one can define dragging along £ and the corresponding Lie derivative is defined as
£ a := £a (l9)
which is in turn defined as above. Since the lift £ M £ is natural one can prove that this Lie derivative is also natural, i.e.,
%,Z]a = %Y a = [£S> £C]a = [£' £z]a (20)
We shall present below various examples of this and similar situations. We wish to stress here that the naturality (17) of Lie derivatives on C is essential for applications to conservation laws, while the naturality of the lift (18)—as well as the naturality (20) of the Lie derivatives on M—is not. These
are "good news," since not all Field Theories have a natural lift; examples will be presented in Gauge Theories and Spinor Theories, in which one could not define a gauge covariant lift that is natural while conservation laws are still perfectly defined.
Noether theorem asserts a correspondence between Lagrangian symmetries and conservation laws. A Lagrangian symmetry is a (possibly generalized) vector field E on the configuration bundle C such that
< 5L \ jk£Bo >= Div L+ < a\jr£Ba >) (21)
for some bundle morphism a : Jk-lC — V*(JrC) ® Am-i(M).
By using the first variation formula on the l.h.s. one can easily prove Noether theorem in the form
< E \ £sa> +Div (< F \ jk-l£sa >) = Div iL+ < a\jr£sa >) (22)
that can be recasted as
Div (< F \ jk-l£sa > -i^L- < a\jr>) = - < E \ £Ba > (23)
By direct inspection, the Noether current defined as E =< F \ jk-l£^a > —i?L— < a\jr£Ba > is closed along critical configurations, which manifestly annihilate the r.h.s. Of course the difficulty in finding conservation laws is here replaced by the difficulties in finding Lagrangian symmetries E. Examples will also be given below.
Symmetries can be described and treated directly also in terms of the Poincare-Cartan form. Condition (21) can be written in an equivalent form as follows
£j2k-i S6L = da (24)
for some (m — 1)-form a; i.e., a Lagrangian symmetry leaves the Poincare-Cartan form invariant. This characterization of symmetries is beautiful and geometric, and this is a motivation to define Poincare-Cartan form.
Noether theorem is then simply obtained in the usual form for Lie derivatives of forms, namely
£j2k-iE0L = Div(ij2k-iE0L) + ij2k-isDiv6L (25)
The term ij2k-isDiv6L = 0 is nothing but field equations and the Noether current is given as
E = ij2k-iB0L — a (26)
which, of course, is closed on-shell.
Conservation laws are thence expressed as (on-shell) closures of Noether currents. If one chooses coordinates x^ = (t, x%) adapted to a spacelike foliation of M, to mimic what one is used to do when choosing Cartesian coordinates in Minkowski space, the conservation law is expressed as
p + divj = 0 (27)
where we set p = E and j1 = E and • denotes derivative with respect to time t. These kind of equations are called continuity equations since p is a density and the quantity
Q = i P = i E jd jd
(for any spacelike (m — l)-region D in M) is conserved, in the sense that its variations are controlled by the flow of j through the boundary of D. Having said that, conservation laws dE =0 are nothing but the covariant form of continuity equations.
For a wide class of Field Theories (including Natural and Gauge-Natural Theories) one can also show that Noether currents are not only closed forms on-shell, but they are also exact; see [9]. One can in fact show, by defining an explicit and algorithmic procedure of covariant integration by parts, that in those cases the Noether current can be (globally) recasted as
E = E + DivU (29)
The (m — l)-form E is called the reduced current and it vanishes on-shell, while the (m — l)-form U is called the superpotential. Examples will be presented below. Accordingly, the Noether current is written on-shell as the differential of the superpotential.
The corresponding conserved quantities are thence defined as surface integrals of the superpotential Q = fQ U on (m — 2)-regions Q c M. This establishes a deep connection between conserved quantities and cohomology of the spacetime manifold M.
The conservation laws involving Noether current (i.e., dE = 0) hold on on-shell, i.e., along critical configurations, and therefore they are called weak conservation laws. When one defines the superpotential U then an equivalent conservation law can be written as d(E — E) = d(DivU); this holds true for each single configuration (also non-critical). For such a reason these conservation laws induced by superpotentials are also called strong conservation laws.
In the case of Mechanics one can use the augmented de-Rham sequence
0 M R M A0(M) m A1(M) M ... (30)
and the Noether current is an element in A0(M) that is closed on-shell. In view of exactness at the relevant level, Noether currents in Mechanics are therefore constant on-shell and they are called first integrals.
Finally, let us remark that everything we said above applies also to Lagrangian symmetries S; equivalently, one has a flow of transformations that leaves the Poincaree-Cartan form invariant, namely the flow of S, i.e., a l-parameter group of symmetries. In many interesting cases, some of which will be discussed below, Field Theories have a symmetry group of dimension higher than l.
Noether current, which depends on the symmetry generator, can also be considered as a map from the Lie algebra of (infinitesimal) symmetries to (on-shell) closed (m — l)-forms. Such a map is usually called momentum map (see [2]) and the naturality with respect to Lie derivation is the remnant of the group structure of symmetries and their preservation.
Let us stress that symmetries are required to form a group of transformations of configurations, not necessarily a group of transformations of spacetime. Sometimes a group of spacetime transformations naturally induces a group of transformations on configurations (e.g., when configurations are represented in terms of spacetime tensors or more generally by geometrical objects) but in general it does not. From this viewpoint, Lie derivatives of fields with respect to spacetime vector fields is an additional (and not necessary) structure; one somehow needs to require naturality (17) while the lift (19) and its naturality (18) is not always available (nor, in fact, necessary).
4. Applications to Mechanics
As examples in Mechanics we shall briefly review some standard examples with the aim of fixing notation.
The standard framework for first order Mechanics is based on a configuration manifold Q with points that correspond to system configurations; local coordinates qi are also known as Lagrangian coordinates for the system. The histories of the system are encoded by (parametrized) curves 7 : R — Q in the configuration space Q.
Equivalently, one can define, as discussed above, the configuration bundle C; by means of a reference frame (i.e., a connection on C) one can define a global trivialization C ~ R x Q. We shall denote fibered coordinates on C by (t,qi).
In this setting histories are in 1-to-1 correspondence with sections of the configuration bundle, that will be denoted by an abuse of language again by 7. Since we are restricting to the first order Mechanics, the Lagrangian should be given on the first jet prolongation J*(R x Q). By means of the reference frame we have a special set of motions singled out in the configuration bundle; these denote "rest motions" and can be prolonged (as we saw for any motion) to first jet bundle. They define a global isomorphism J*(R x Q) ~ R x TQ since they define what one has to understand for "zero velocity" (of course depending on the reference frame).
Accordingly, we can use fibered coordinates (t, qi, u%) on R x TQ, where ui denote the Lagrangian velocities. The dynamics is described by a Lagrangian
L = L(t,qi,ui)dt (31)
or equivalently by a Poincaré-Cartan form
0L = L(t,qi,ui )dt + PiJ (32)
where we set pi = dL (t, q, u) and = dqi — uidt for the relevant contact 1-form on R x TQ.
The Noether theorem presented in general above is generalized in order to encompass the standard one for:
i) a symmetry that is a vertical vector field on the configuration bundle, i.e., 5 = £i(q)di such that its
tangent prolongation 5 leaves the Lagrangian invariant (5(L) = 0);
ii) a symmetry that is a (non-vertical) vector field 5 on the configuration bundle, e.g., if the Lagrangian
density is independent of time t then 5 = dt is in fact a symmetry;
iii) vector fields that leave essentially invariant the system (e.g., when in the notation introduced above
one has a = 0);
iv) generalized vector fields to obtain first integrals that are not simply linear in the momenta, as for
example the Runge-Lenz vector in Kepler's motion.
4.1. Case (i)
One can always change fibered coordinates to new fibered coordinates (t,qi) in which S = d1. That is a symmetry iff the Lagrangian does not depend on q1, i.e., it is cyclic. In other words these cases correspond to all cases of ignorable coordinates, though S is a symmetry in all coordinate systems, while q1 is cyclic only when S = d1. Thence these cases are equivalent to a coordinate free notation for cyclic coordinates.
The corresponding first integral is
E = ^ <33>
that in adapted coordinates corresponds in fact to the momentum p1 conjugated to the cyclic coordinate q1.
For example, a particle in a constant gravitational field is described by the Lagrangian
L = m (x2 + y2 + z2) — mgz (34)
in Cartesian coordinates (x, y, z).
The coordinate x (y, respectively) is cyclic and the corresponding momentum px = mx (py = my, respectively) is a first integral that corresponds to linearmomentum.
The vector field S = xdy — ydx is a symmetry and it corresponds to the fact that in cylindrical coordinates (r, 9, z) the angular coordinate 9 is cyclic. The corresponding first integral
E = m (xy — yx) (35)
corresponds to the z-component of angularmomentum.
In cylindrical coordinates one has in fact S = de and the corresponding first integral is given by E = mr29.
4.2. Case (ii)
When the Lagrangian is independent of time t (e.g., whenever holonomic constraints are imposed with no explicit time dependence) the vector field S = dt is a symmetry and the corresponding first integral is given by
H = pU — L (36)
that corresponds to mechanical energy.
In the case of particles in a constant gravitational field one has
H = m {x2 + y2 + z2) + mgz (37)
4.3. Case (iii)
For the Lagrangian (34) the vector field S = dz fails to leave the Lagrangian invariant; in fact one has Lie derivatives £sx = £sy = 0 and £Bz = — l. The l.h.s. of the covariance identity (21) reads then as
< SL | jk£sa >= —mg = — (—mgt) (38)
that fails to vanish though it is easily recognized to be the total derivative of a quantity a = -mgt. This is for sure physically expected since, in view of gravitational field constancy, the system is unchanged if everything is translated up along the z-axis. If this is physically trivial, one should notice that mathematically one needs to consider generalized symmetries even to encompass these simple examples. Noether theorem hence applies and the corresponding first integral is
F = m (—z + gt) (39)
In this case we know the general solution of the equations of motion (x = Xq ++ Ugt, y — Xq ~~+ Wqt, z = xQ + u^t — 1 gt2) and it is easy to show that the quantity F = —mW3 is in fact constant on-shell.
4.4. Case (iv)
For Kepler system
L = 1 (r2 + r202) + K (40)
one can consider the following generalized vector fields
51 = —r20 sin 0dr — (2r0 cos 0 + r sin 0)do
52 = r20 cos 0dr + (—2r0sin 0 + r cos 0)de (41) These are symmetries; in fact, the l.h.s. of the covariance identity for the first vector reads as
< 5L | jk£Sla >= d (r2 sin 0r0 + r3^2 cos 0 + k cos ^ (42)
while for the second vector it reads as
< 5L I jk£B2a >= — dt (r2 cos 0r0 — r302 sin 0 — k cos ^ (43)
The corresponding first integrals RA = — ^Q — a (A = 1, 2) are
Ri = r302 cos 0 + r2r0 sin 0 — k cos 0
Rq = r302 sin 0 — r2r0 cos 0 — k sin 0 (44)
One can easily check that these are the two components of the vector field
R = Ridx + R2dy = v x (r x v) — ndr (45)
which is called Laplace vector or Runge-Lenz vector. In other words the vector R is constant on-shell. This vector was known in Kepler problems (or, equivalently, in Coulomb electrostatics) to be related to the fact that perihelia are fixed (and when perturbations are introduced it relates to the precession of perihelia).
Notice that the vector R is quadratic in the Lagrangian velocities. Since momenta are linear in the Lagrangian velocities this means that R is quadratic in the momenta. Hence, it could not be obtained by ordinary Noether theorem which produces only first integrals linear in the momenta.
5. GR and Natural Theories
General Relativity (GR) is based on the principle of general covariance (together with the equivalence principle). General covariance is a symmetry requirement; one assumes that spacetime diffeomorphisms Diff(M) act on configurations (i.e., fields are geometrical objects, e.g., tensor fields) and all these transformations induced by spacetime diffeomorphisms are symmetries for the dynamics.
This assumption combined with Noether theorem has plenty of consequences. The configuration bundle is a natural bundle, the Lie derivatives are defined with respect to spacetime vector fields that are all symmetries and they all generate conservation laws. Moreover, one can show (see [9]) that Noether currents always admit superpotentials and conservation laws are always defined a la Gauss by surface integrals.
Moreover, in such kind of theories the whole set of conservation laws are equivalent to the dynamics of the system. In fact, one has a k-order Lagrangian L which defines field equations E = 0 via first variation formula
< SLjkX >=< E|X > +Div < F|jk-1X > (46)
First variation formula holds in particular for the Lie derivative along each spacetime vector field X = L and gives conservation laws
DivE = — < E|£o > (47)
for the Noether currents E =< Fjk-1£% o > —i%L. Then one can define superpotentials U
E = E + DivU (48)
The reduced current is always a combination of field equations. Thus if one knows Noether currents of a Natural Theory and conserved quantities by means of their superpotentials, then one can compute the reduced currents (i.e., field equations) purely in terms of conservation laws.
For example, let us consider "standard GR", which is defined as a second order theory on the configuration bundle Lor(M) of Lorentian metrics on spacetime M, with coordinates (x^,g^v). Dynamics is defined by the Hilbert Lagrangian
LH = ^Rds (49)
where R is the scalar curvature of the metric g and ^fg denotes the square root of the absolute value of the determinant of the metric g^v.
The variation of this Lagrangian along the deformation X = 5ga^ dap
< 5Lh \j2X >= — 1 Rga^j ögaßds + VA ggaß5uxaß) ds (50)
where u^ß = {gj^ß — ö(a{g}ß)e and {g}aß denotes the Levi-Civita connection of the metric g, i.e. its Christoffel symbols. Thence we have
< E\X >= ^g(Rafi — 2Rga^j ögaßds < F\jlX >= ^~ggaßöuxaßdsx (51)
Let us stress that each spacetime vector field £—and not only Killing vectors as sometimes erroneously claimed in the literature—generates conservation laws. By expanding the Lie derivatives in terms of the spacetime vectors field one obtains Noether current as
e = Vg ( (2Raa — R£a + {gMK — ga(Yv*dsa (52)
which by suitable covariant integration by parts can be recasted as E = E + DivU, where we set
E = 2Vg (ffa — 2R^) CXdsa U = £adsal3 (53)
The superpotential U is called Komarpotential in honor of Komar who first proposed it, though originally restricted to a timelike Killing vector £, while here £ is instead any spacetime vector field.
As we discussed above, one can deduce field equations Rai — 1 Rgap = 0 from Noether current E and Komar superpotential U by computing reduced current as
E = E — DivU (54)
just noticing that this quantity has to vanish for all solutions and all symmetry generators £.
This situation, in fact, is completely general for any Natural Theory of any order and any matter coupling, since it is based on general covariance principle only. Any Natural Theory comes with a huge symmetry group, namely Diff(M), which identifies intrinsically (i.e., in an observer independent way) a huge set of conservation laws and conserved quantities.
Problems start when one wants to identify some physically relevant quantity—e.g., the energy, the momentum, the angular momentum,... —within this intrinsic set of conservation laws. This is already a problem in Newtonian Mechanics (see [19,20]) where it should be clear from the very beginning that there is no intrinsic notion as the energy of a system, in the sense that different observers (even inertial observers) do in fact measure different energies, momenta, . . . for the same system. This obvious circumstance is often undervalued (or even ignored) in current literature on Mechanics, with the consequence of generating a number of misunderstandings about conservation laws that reverberate and amplify in Field Theories. This would indeed be a trivial remark if it were not used sometimes in the literature to argue that conserved quantities in GR must have a non-covariant genesis. Such arguments come down to (at least) two different main points:
i) covariant conservation laws are not conservation laws;
ii) covariant conserved quantities would not depend on the observer as physically expected and as
pseudotensorial prescriptions do.
Both these points have a long history and can be somehow traced back to Einstein himself. However, after almost one century of investigations they can be today shown to be flat wrong at least in some sense.
5.1. Covariant Conservation Laws
Item (i) comes historically from the observation that covariant conservation laws
VE = 0
would not reduce to continuity equations (which is what one usually means for "conservation") due to terms depending on the connection {g used to define the covariant derivatives. This would be certainly the case if E^ were components of a vector field. Then covariant conservation laws would be conservation laws tout court only for those observers (here identified with coordinate systems) in which such terms vanish (i.e., when {g}^ = 0). Only in these cases covariant conservation laws are genuine conservation laws and they are intrinsically non-covariant since they break down general covariance.
This argument is true for all currents E except in one single case: the case in which E^ is a vector density of weight l. In this case there are two terms depending on the connection and they cancel out. In other words, when E^ is a vector density of weight l the covariant divergence is automatically identical to the ordinary divergence, i.e.,
VE = dE (56)
and they always define true continuity equations for any observer.
Of course, Noether theorem in the form presented here dictates for Noether currents to be (m — l)-forms, i.e., their components E^ are in fact vector densities of weight l. Hence it always produces authentic conservation laws which are at the same time covariant.
5.2. Observers
Item (ii) comes from the belief that covariant conservation laws would necessarily define conserved quantities independent of the observer. This is certainly true for the whole set of conservation quantities; it is defined covariantly and all observers agree on it.
However, each observer can then be asked to identify within the set of all conserved quantities which one represents a physically relevant quantity, e.g., the energy. It is not the notion of first integral to be observer dependent; a quantity either changes or not during the evolution of the system and all observers agree on it. But it is rather the energy that is related (in Newtonian Mechanics, but also in Special Relativity) to time translations; since in SR time depends on the observer, the notion of energy depends on the observer, accordingly.
In GR each observer (which is identified with a local coordinate system together with a protocol to synchronize clocks at different points, that foliates spacetime with leaves representing synchronous event sets; such foliations are known as ADM-foliations, see [21]) comes with a timelike vector field £. The Noether quantity associated to such a timelike vector field (no request about it being Killing; see [22]) is defined to be the energy for the observer for which timeflow is given by £.
Since the prescription is covariant, any other observer agrees that this energy is the Noether quantity associated to that timeflow £; however, different observers might disagree on £ being the timeflow and consequently that such a conserved quantity has to be considered to be the energy of the system. On the contrary, they use their own timeflow £' and define their own energy. In this way, which is the only reasonable way to extend to GR what one is used to do in SR, despite the prescription is covariant still the energy is not and still it depends on the observer.
It is therefore an open issue whether we really need to know what is the energy or we could have a fundamentally good description of the physical world by just using the set of conserved quantities. In other words, if energy is just something we are used from Newtonian Physics, or rather one can produce
a fundamental description of the physical world without singling out special conserved quantities to be given special meanings.
One could also ask whether it is useful to have covariant prescriptions for conserved quantities, since sooner or later covariance have to be broken in favor of observer-dependent quantities. Why one should not be satisfied with coordinate-depending prescriptions such as the ones based on pseudotensors? Let us counter-argue that on manifolds there is nothing defined as coordinate dependent integrals to define conserved quantities from pseudotensors. Pseudotensors are coordinate dependent quantities, often their genesis from geometric objects is not clear, they are known in some coordinate system but they are often unspecified in other coordinates. Moreover—and for obvious reasons—they lack of any coherent and (mathematically and physically) intrinsic meaning, if any may even exist!
We do not claim that pseudotensors should be completely forbidden in Physics, since they appear to be often "useful"; but we believe that for a reasonable, global and covariant interpretation their genesis from some geometric object must be always made explicit and, in any case, they have to be treated cum grano salis. Pseudotensors should be defined starting from geometric objects by fixing coordinates in order to neglect some term. This scenario has two good features to be noticed: it implicitly defines the coordinate systems in which pseudotensors can be used and it defines its integrals by means of the integral of the original geometric object. Let us refer to [22] for an example of this coherent strategy.
5.3. Extended Theories of Gravitation
As we said above, the structure of GR is not peculiar of the Hilbert Lagrangian (49); most of this structure is determined by the symmetry group, i.e., spacetime diffeomorphisms, and it is a feature of any generally covariant theory.
Let us here briefly consider the class of Extended Theories of Gravitation (ETG); see [23-25]. This class of theories is used in Cosmology and Astrophysics in order to model phenomena and observations that are usually related to dark matter and dark energy; [26].
Let us first consider a Lorentzian metric g^v and a connection as fundamental fields. The configuration bundle is the product Lor(M) x C(M) of the bundle of Lorentzian metrics and the bundle of connections on M (here assumed for simplicity with no torsion). Here connections are in principle independent of the metric field.
Let us denote by Rae^v (Rev) the Riemann (Ricci) tensor of the connection Г and by R = gev Rpv the scalar curvature which depends on both the metric and the connection. These curvatures are not to be confused with the curvatures , (gR@v, ^-R of the metric g alone. This double affine structure
(the one induced by Г and the one induced by g) opens a number of issues about physical interpretation; see [27].
In the notation here introduced let us consider the Lagrangian
Lf = /gf (R) ds (57)
where f is a generic analytic function of the scalar curvature. For the specific case f (R) = R this reduces to standard Hilbert-Einstein Gravity (49).
When varying, the metric and connection are deformed independently and there are field equations from variation of both fields:
f (R)%v) — 2 fg^v = 0 (5g)
Va (^gf'g^) = 0
The second equation is solved defining a new metric field = f (R)g^v conformal to the original one; the connection is then forced to be the Levi-Civita connection of the metric h, i.e., P^ = {h}^.
The trace (by ) of the first equation f (R)R — 2R = 0 is called the master equation and (once f is considered a fixed function) it is algebraic in the scalar curvature R. In general the master equation allows to solve the scalar curvature as a function of the matter sources; here we are considering the vacuum case and R is constant (on M) and determined by the zeroes of the master equation (which are finite and generically simple). When this information is plugged into the first field equation one can show that the metric h obeys modified Einstein equations; the function f can be chosen so that the modified dynamics accounts for dark matter/dark energy phenomenology with no need to introduce exotic matter sources, which currently cannot be observed directly.
Again these models are generally covariant and any spacetime diffeomorphism is a symmetry of the theory; they are Natural Theories. Any spacetime vector field £ is a symmetry generator and its Noether current is given by
E = V~g (((R) gap <p — £af (R)) dsA
=2Vh(^(h)RAe — 4(h)RSA^J CdsA + V^VhV[e£A]) dsA (59)
where we used the master equation, r = {h} and the curvature tensors of the conformal metric h are denoted by (h)Ra^v, (h)Rpv and (h)R.
This corresponds to define the reduced current and the superpotential as:
EE = 2Vh ^(h)RAe — l(h)R8A^ £edsA U = VhVl"£A]dsAe (60)
Notice once again that the reduced current E identically vanishes on-shell and it is equivalent to field equations.
ETG are also considered in the purely metric formalism, in which one considers the connection r to be r = {g} from the beginning, i.e., at the kinematical level. The configuration bundle is simply Lor(M) and the Lagrangian (57) (now considered as a function of the metric g alone) is second order. This leads to forth order equations with modified effects with respect to standard GR.
From this point of view standard GR is shown to be degenerate (it has second order field equations) with respect to the extensions considered here (which generically have forth order field equations). Let us remark how the structure of symmetry group and conservation laws is instead similar in standard GR and ETG. It does not depend in fact on the particular dynamics considered but just on the principle of general covariance.
6. Gauge Theories
Natural Theories are defined starting from a spacetime manifold M together with its diffeomorphisms that encode symmetries. It is well-known that this is suitable for some physical systems (specifically,
Relativistic Theories) while there are more general physical systems (e.g., Gauge Theories) which need bigger symmetry groups. Gauge Theories are needed today to describe some fundamental interactions and together with GR they are enough for Fundamental Physics.
A Gauge Theory depends on a principal bundle P with a structure Lie group G over spacetime M. Principal automorphisms Aut(P) form a transformation group; this group projects onto Diff(M), but Diff(M) is not in general realised as a subgroup of Aut(P). Automorphisms $ e Aut(P) are realised locally as changes of coordinates together with an element of G acting at all spacetime points (which in Physics is called a local action of G).
A Gauge-Natural Theory is a Field Theory in which Aut(P) acts on configurations generating Noether symmetries of the dynamics. This request forces the configuration bundle to be a gauge-natural bundle (see [9,28,29]); all fundamental theories of Physics can be cast in this form.
As in Natural Theories, also in Gauge-Natural Theories one can associate a Noether current to any right-invariant vector field on P (i.e., to each generator of principal automorphisms). Noether currents always allow superpotentials (see [9]) and field equations can be obtained purely from conservation laws. Yang-Mills theories will be discussed below.
6.1. Yang-Mills Theory
Let us briefly present Yang-Mills theories as a paradigm for Gauge(-Natural) Theories. Also Dirac spinors share most of the characteristics of Gauge Theories and can in fact be formulated as a Gauge-Natural Theory; see [9].
Let us consider a semisimple Lie group G and denote by 5 the Cartan-Killing (ad-invariant) metric on its Lie algebra g. Let P be a principal bundle with structure group G and C(P) be the bundle of principal connections on P. Automorphisms act on connections so that C(P) is a gauge-natural bundle. Once one fixes a 5-orthonormal basis TA in the Lie algebra 0 a connection is expressed by coefficients x); the curvature is defined by Ffv = dfuA — dvuA + cabcuB^, where cabc are "structure constants" of the group G. An automorphism on P is locally expressed as
x = x' f(x)
g = t(x) • 9
for some local map tp : M ^ G, • being the product in the group; it acts on the curvature as
FfA = adB (p)F£ JJ (62)
where Jp is the anti-Jacobian of the coordinate change and adA is the adjoint action of the group G on its Lie algebra 0.
The Yang-Mills Lagrangian
Lym = — 4 V95ab FAv FBa 9ßP9v° ds (63)
defines the dynamics and gauge transformations (i.e., automorphisms in P) are symmetries.
The first variation formula of the Lagrangian with respect to the deformation X = 5gaßdaß + 5uAdA defines the morphisms
< E\X >= {Vf ) 5uB + VgHßv5gfV) ds < F|X >= 5uBdsß (64)
where we set F^V := SabFBag^pgva and H^ := FAAFAvA — 4F&FAag^v. The tensor H^ is also known as the energy-momentum tensor of the Yang-Mills field. If the metric is freezed, then field equations are V^y/gFAV) = 0, i.e., they are the Yang-Mills equations. In general, the Yang-Mills Lagrangian is coupled to gravity and H^V acts as a source for the gravitational field.
Given a pointwise basis pA of vertical right-invariant vector fields on P, symmetry generators are in the form S = £^(x)d^ + £A(x)pA. The corresponding Noether current is
E = —J~g (HV£ + Fv/V,£a) ds„ (65)
where we set £A := £A + uAfor the vertical part of S with respect to the dynamical connection u. The superpotential is given by
U = — £A ds,v (66)
In the special case G = U(l) (i.e., Maxwell's Electromagnetism) the group is commutative and the adjoint representation is trivial; the curvature is then a 2-form on spacetime and it is invariant with respect to (proper) gauge transformations (i.e., vertical automorphisms). In this case Yang-Mills theory reduces to standard Electromagnetism and the connection u^ represents the quadripotential of electromagnetic field. Field equations reduce to Maxwell equations and gauge charges reduce to electric charges.
6.2. Hole Argument
There is another important crosspoint among GR, Gauge Theories and their symmetries. It is called the hole argument and it was first discovered in GR by Einstein, though it persists in Gauge Theories as well. It is originated when among symmetries of the system there are compact supported symmetries, i.e., symmetries that vanish (or became the identity) out of a compact sub-domain of spacetime M.
In GR one can easily define compact supported spacetime diffeomorphisms (and vector fields) on any paracompact spacetime manifold M. In Gauge Theories one can define compact supported automorphisms. Of course in Mechanics it is difficult to find examples of compact supported symmetries (e.g., there are no compact supported rotations in space). The only well-known exception is curve reparametrization in SR; there Physics is totally represented in terms of trajectories in spacetime (the so-called worldlines) and their parametrizations are introduced for technical reasons only, since they are not endowed with any physical meaning. Equations of motion should however induce equations on trajectories; hence they must be invariant with respect to general reparametrizations, which are thence required to be symmetries of the system. In this case, it is easy to produce compact support reparametrizations since the parameter lives in R which is paracompact.
The hole argument is based on the fact that symmetries transform solutions into solutions (since they leave the action invariant). One can show in an elementary way that compact supported symmetries contradict directly and essentially Cauchy theorem (i.e., determinism). In fact, given a solution a and acting on it by a compact supported symmetry $ one obtains a new solution a' = $*a. The two solutions a and a' are identical everywhere except within the compact support D of $. One has thence two different solutions which are exactly the same far in the past (in particular on a spacelike hypersurface where initial conditions can be imposed and which does not intersect D c M) while they differ at some point (in D).
This is a very general property of possible dynamics with symmetry groups that contain compact supported symmetries. Since in Physics one does not want to give up determinism too easily the only other possible way out is to assume that configurations which differ by compact supported symmetries (e.g., a and a' above) represent in fact the same physical situation. One physical situation can be represented by many different mathematical representations, i.e., physical situations are not in 1-to-1 correspondence with configurations but with equivalence classes of configurations. In other words, physical situations are realized by some suitable quotient of configuration space.
Dynamics must therefore respect the quotient (that is equivalent to assume that a change of representative in equivalence class is a dynamical symmetry). On the other way, just because of the structure of the symmetry group, one cannot hope to observe any difference among different representatives of the same class of equivalence. In other words, symmetries not only constrain physically reasonable dynamics but they also impose strong constraints on what can be physically observed. Accordingly, in all cases that are relevant to fundamental interaction Physics, the dynamics is degenerate.
This is, to the best of our knowledge, the widest definition of what one could mean by Gauge Theories. A Gauge Theory could be defined as a Field Theory with a symmetry group that contains enough compact supported symmetries so to be able to act non-trivially on any compact hole in the base manifold M (being trivial outside the holes, as above). [Of course if M is compact then any symmetry is compactly supported; here we mean the support is small, i.e., disjoint from the Cauchy surface where initial conditions are set.]. Accordingly, its dynamics is degenerate and it induces a deterministic dynamics on the space of gauge classes of configurations that are identified with physical situations. Of course, one cannot obtain every feature of standard gauge theories under these very mild hypotheses. For example, one has to ensure the symmetry group to be infinite dimensional (or, equivalently, parametrized by functions) to allow reduction of the gauge degrees of freedom. However, these are the mildest conditions for which field equations are underdetermined and hole argument apply. If one wants to keep be compliant to the standard nomenclature these could be called Underdetermined Field Theories or Generalized Gauge Theories.
For this reason physical gravitational fields in GR are not described in terms of "metrics" but rather in terms of "geometries" (i.e., equivalence classes of metrics with respect to spacetime diffeomorphisms). In Gauge Theories Yang-Mills fields are identified with gauge classes of connections on P. In SR motions are unparametrized worldlines.
7. Frame-Affine Formalism for GR
In order to provide an example of concrete application of our formalism here introduced in action we shall here consider the application to the so called Holst's action principle (see [30]) which is used as an equivalent formulation of GR suitable for developing LQG through the use of the Barbero-Immirzi connection (see [31-34] as well as references quoted therein).
Let M be a m-dimensional manifold (which will be required to allow global metrics of signature n = (r, s), with m = r + s). Let us denote by x^ local coordinates on M, which induce a basis d^ of
tangent spaces; let L(M) denote the general frame bundle of M and set (x,, VJ,) for fibered coordinates on L(M). We can define a right-invariant basis for vertical vectors on L(M)
p^ = V ,
rv v a
The general frame bundle is natural (see [28]), hence any spacetime vector field £ = d^ defines a natural lift on L(M)
£ = e d, + d, £v p,
We stress that the lift vector field £ is global whenever £ is global. A connection on L(M) is denoted by Pg, and it defines a lift
r : TM ^ TL(M) : £"d, ^ £* (d, - ra,pi)
This lift does not in general preserve commutators, unless the connection is flat.
Let now (£,M,n, SO(n)) be a principal bundle over the manifold M and let (x^,S%) be (overdetermined) fibered "coordinates" on the principal bundle £. We can define a right-invariant pointwise basis aab for vertical vectors on £ by setting
&ab = Vd[aPb]
pb = Sc
„d _ nd_
Sc dSb
where nab is the canonical diagonal matrix of signature n = (r, s) and square brackets denote skew-symmetrization over indices. A connection on £ is in the form
u = dx, 0 (d, - ufajb)
Also in this case the connection on £ induces connections on any associated bundle and there defines covariant derivatives of sections.
A frame is a bundle map e : £ ^ L(M) which preserves the right action, i.e., such that
- L(M )
i.e., e o RS = Ri(S) o e, where R denotes the relevant canonical right actions defined on both principal bundles £ and L(M) and where i : SO(n) ^ GL(m) is the canonical group inclusion. We stress that on any M which allows global metrics of signature n the bundle £ can always be chosen so that there exist global frames; see [35]. Locally the frame is represented by invertible matrices and it defines a spacetime metric = nab evh which is called the induced metric.
As for the Levi-Civita connection, a frame defines a connection on £ (called the spin-connection of the frame) given by
eaa (r^ + d,eba)
where {g}^ denote Christoffel symbols of the induced metric. The spin-connection is compatible with the frame in the sense that
V,eva = d%eva + {g}x»eXa - "ca%< = 0 (74)
In general the (natural) lift £ to L(M) of a spacetime vector field £ is not adapted to the image e(£) C L(M) and thence it does not define a vector field on £. With this notation the Kosmann lift of
£ = £%d% is defined by £K = £%d% + £aboab (see [36]) where we set:
tb = eb% - tfe (75)
and where ea% = nace% while ebv denote the inverse frame matrix.
Let us stress that despite appearing so, the Kosmann lift (75) does not in fact depend on the connection, but just on the frame and its first derivatives. The same lift can be written as
£.ab = v[b£a - u%b£% where we set £a = £%e% since one can prove that
Vb£a = ea V%£v e% (76)
Another useful equivalent expression for the Kosmann lift is giving the vertical part of the lift with respect to the spin connection (see [9], pp. 288-290), namely
£ab) := + "ab£% = eb% = V[b£a (77)
This last expression is useful since it expresses a manifestly covariant quantity.
We have to stress that the Kosmann lift does not preserve commutators. In fact if one considers two spacetime vectors £ and Z and computes the Kosmann lift of the commutator [£, Z] one can easily prove that
[£, Z] k = [£k, IK] + 2eakgaX£ZgxpebOab (78)
Thence only if one restricts to Killing vectors (i.e., £%g = 0) one recovers that the lift preserves commutators.
Let us now consider tetrad-affine formulation of GR: the fundamental fields are a Lorentz connection r%b and a vielbein ea = e% dx%. The connection defines the curvature form Rab = 2 Rab%v dx% A dxv. Let us also set e = det |e%|, Ra% = Rab%vevh and R = Rab%ve%evh; here evh denotes the inverse frame matrix of ebv. The frame also defines a metric g%v = e%nabebv which in turn defines its Levi-Civita spacetime connection ra%.
On a spacetime of dimension 4 let us consider the Lagrangian
LtA = Rab a ec A ed tabcd (79)
By variation we obtain
6LtA = - 2ee^Ra% - 2 Re%e% 6ed% - eabcdV% (ecpeda) 6ra%b
+ eabcdV% (ecped ¿r%b) (80)
Thus one obtains field equations in the form
Ra _ 1 Rea
R t 2 t
Vit (ePea]
The second field equation forces the connection to be the connection induced by the frame, i.e., Tf
then the first equation forces the induced metric to obey Einstein equations.
This Field Theory is dynamically equivalent to standard GR, in the sense that it obeys equivalent field equations. However, the theory is in fact richer in its physical interpretation, since the use of different variables and action principles generate larger symmetry and extra conservation laws. In fact, this theory has a bigger symmetry group being both generally covariant and Lorentz covariant.
Noether theorem implies then conservation of the current
= 4eeiX fjf - £V LtA
along any Lorentz gauge generator S = £Vdt + £abcab. The Lie derivative of a connection is given by
£ xRab\v + V £a
where we set iab = £ab + £xTf. Hence one obtains
= 4eet (R% - 2R£" - 4Vv (ee*evb) £ab + 4Vv (ee^£ab)
The first and second terms vanish on-shell; hence one obtains
= 4Vv (eeteV f)
Let us stress that this current depends only on the Lorentz generator £ab.
Here is the issue with physical interpretation: we have two equivalent formulations of Einstein GR. Noether currents in one case depend on spacetime vector fields, while in tetrad-affine formulation Noether currents depend on Lorentz generators that a priori have nothing to do with spacetime transformations. Let us stress of course that unless the spacetime is Minkowski, there is no class of (global) spacetime diffeomorphisms representing Lorentz transformations.
Considering the dynamical equivalence at the level of field equations and solution space, one would like this equivalence to be extended at the level of conservation laws. Moreover, some of the conserved quantities in standard GR are known to be related to physical quantities such as energy, momentum and angular momentum, while one would wish to be able to identify the corresponding quantities also in the second formulation. Kosmann lift is in fact essential to relate Lorentz generators to spacetime diffeomorphisms and the corresponding conservation laws.
The Noether current (85) can be restricted by setting S = £K, so that one obtains
4Vv (eVt£v)
which corresponds to the standard conserved quantity associated to spacetime diffeomorphisms in GR written in terms of Komar superpotential. This (and only this) restores the equivalence between standard GR and tetrad-affine formulation at the level of conservation laws. As a further example let us consider the covariant Lagrangian:
Lh = LtA + pRab A ea A eb (87)
which is known as Holst's Lagrangian. By variations one obtains equations
e"d R - ¡Re;) el - /3Rdp,v= 0
V7 ( Ie d] A n (88)
V[„, el0e\) = 0
[% ea]
The second equation still imposes {g}%b = u%b; this in turns implies Ra[p%v] = 0 (first Bianchi identity) and hence Einstein equations. This shows how Holst's Lagrangian provides an equivalent formulation of standard GR as well.
It is interesting to check if also in this case the equivalence is preserved also at level of conservation laws. The Noether current is
E%H = 4ee%eV^ + ee%cevdecd^b£Brab - £%LH (89)
As in the previous case this can be recasted modulo terms vanishing on-shell as follows
EH - E%A = Vv (ee%eVe^t*) (90)
Again this has nothing to do with spacetime symmetries and in general it would affect conserved quantities. When Kosmann lift is again inserted into these conservation laws one obtains
EH -EA = Vv (Vp£CT e%vP.) (91)
that vanishes being the divergence of a divergence. Hence once again the correspondence at the level of conservation laws is preserved when the Kosmann lift is used.
8. Conclusions and Perspectives
The kinematics and dynamics of most fundamental interaction Physics can be defined purely in terms of symmetries. Noether theorem and conservation laws define a considerable part of its physical interpretation.
The role of strong conservation laws is still unclear. One can imagine some role in the Hamiltonian framework that is however, still unclear and it deserves further investigations.
From a fundamental viewpoint one should also investigate whether one could describe Nature in terms of the whole set of conservation laws, without selecting special conservation laws to be endowed with a special meanings. In this direction we can mention the approach to Noether theorem in terms of momentum maps; see [2].
Acknowledgements
The authors wish to thank C. Rovelli for discussions.
This work is partially supported by MIUR: PRIN 2005 on Leggi di conservazione e termodinamica in meccanica dei continui e teorie di campo. We also acknowledge the contribution of INFN (Iniziativa Specifica NA12) and the local research funds of Dipartimento di Matematica of Torino University.
This paper is published despite the effects of the Italian law 133/08. This law drastically reduces public funds to public Italian universities, which is particularly dangerous for free scientific research, and it will prevent young researchers from getting a position, either temporary or tenured, in Italy (http://groups.google.it/group/scienceaction). The authors are protesting against this law to obtain its cancellation.
References
1. Noether, E. Invariante variationsprobleme. Nachr. d. Knig. Gesellsch. d. Wiss. zu Gttingen, Math-phys. Klasse 1918, 235-257.
2. Gotay, M.J.; Isenberg, J.; Marsden, J.E.; Montgomery, R. Momentum maps and classical relativistic fields - Part I: covariant field theory. math-ph. 2004, arXiv: physics/9801019.
3. Batlle, C.; Gomis, J.; Pons, J.M.; Roman-Roy, N. Equivalence between the Lagrangian and Hamiltonian formalism for constrained systems. J. Math. Phys. 1986, 27, 2953-2962.
4. Batlle, C.; Gomis, J.; Gracia, X.; Pons, J.M. Noethers theorem and gauge transformations: Application to the bosonic string and CP2,n-1) model. J. Math. Phys. 1989, 30, 1345-1350.
5. Gomis, J.; Henneaux, M.; Pons, J.M. Existence theorem for gauge symmetries in Hamiltonian constrained systems. Class. Quantum Grav. 1990, 7, 1089-1096.
6. Cariñena, J.F. Theory of Singular Lagrangians. Forts. der Phys. 1990, 38, 641-679.
7. de Leon, M.; Marrero, J.C.; Marín, J.; Muñoz, M.C.; Roman, N. Singular Lagrangians systems on Jet bundles. Forts. der Phys. 2002, 50, 105-169.
8. Castrillon Lopez, M.; Marsden, J.E. Covariant and dynamical reduction for principal fiber bundles field theories. Ann. Global Anal. Geom. 2008, 30, 263-285.
9. Fatibene, L.; Francaviglia, M. Natural and Gauge Natural Formalism for Classical Field Theories: A Geometric Perspective Including Spinors and Gauge Theories; Kluver Academic Publisher: Dordrecht, The Netherlands, 2003.
10. Steenrod, N. Topology of Fibre Bundles; Princeton University Press: Princeton, NJ, USA, 1951.
11. Kolar, I. A geometrical version of the higher order Hamilton formalism in fibred manifolds. J. Geom. Phys. 1984,1, 127-137.
12. García, P.L. The Poincaré-Cartan invariant in the calculusof variations. Symposia Math. 1976, 14, 219.
13. García, P.L.; Muñoz, J. On the geometrical structure of higher order variational calculus. In Proceedings of the IUTAM-ISIMM Symposium on Modern Developments in Analytical Mechanics—Torino, Italy, 7-11 July 1982; Benenti, S., Francaviglia, M., Lichnerowicz, A., Eds.; Tecnoprint: Bologna, Italy, 1983; p. 127.
14. Ferraris, M. Fibered connections and global poincar cartan forms in higher order calculus of variations. In Proceedings of the Conference on Differential Geometry and its Applications, 1983, Nov Msto na Morav, Czech Republic; Krupka, D., Ed.; J.E. Purkyne University: Brno, Czech Republic; 1984, pp. 61-91.
15. Ferraris, M.; Francaviglia, M. On the theory of formal connections and fibered connections in fibered manifolds. In Quaderni di Matematica, Universita di Torino, Quaderno n.86; Department of Mathematics, University of Torino: Torino, Italy, 1984.
16. Saunders, D.J. The geometry of jet bundles; Cambridge University Press: Cambridge, UK, 1989.
17. Trautman, A. Invariance of lagrangian sytems. In Papers in Honour ofJ.L. Synge; Clarenden Press: Oxford, UK, 1972; pp. 85-99.
18. Palais, R.S.; Terng, C.L. Natural bundles have finite order. Topology 1977,16, 271-277.
19. Fatibene, L.; Ferraris, M.; Francaviglia, M. Augmented variational principles and relative conservation laws in classical field theory. Int. J. Geom. Meth. Mod. Phys. 2005, 2, 373-392.
20. Carini, M.; Fatibene, L.; Francaviglia, M. Covariant energy in mechanics. Int. J. Geom. Methods Mod. Phys. 2007, 4, 907-918.
21. Arnowitt, R.; Deser, S.; Misner, C. Dynamical structure and definition of energy in general relativity. Phys. Rev. 1959,116, 1322-1330.
22. Fatibene, L.; Ferraris, M.; Francaviglia, M.; Lusanna, L. ADM Pseudotensors, conserved quantities and covariant conservation laws in general relativity, (in preparation).
23. Magnano, G.; Ferraris, M.; Francaviglia, M. Legendre transformation and dynamical structure of higher-derivative gravity. Class. Quant. Grav. 1990, 7, 557-570.
24. Capozziello, S.; Francaviglia, M. Extended theories of gravitation and their cosmological and astrophysical applications. Gen. Rel. Grav. 2008, 40, 357-420.
25. Sotiriou, T.P.; Faraoni, V. f (R)-theories of gravity. Rev. Mod. Phys. 2010, 82, 451-497.
26. Capozziello, S.; De Laurentis, M.; Faraoni, V. A bird's eye view of f (R)-gravity. gr-qc. 2009, arXiv:0909.4672.
27. Capozziello, S.; De Laurentis, M.; Francaviglia, M.; Mercadante, S. From dark energy and dark matter to dark metric. Found. Phys. 2009, 39, 1161-1176.
28. Kolar, I.; Michor, P.W.; Slovak, J. Natural Operations in Differential Geometry; Springer-Verlag: New York, NY, USA, 1993.
29. Eck, D.J. Gauge-natural bundles and generalized gauge theories. Mem. Amer. Math. Soc. 1981,33, 247.
30. Holst, S. Barbero's Hamiltonian derived from a generalized Hilbert-Palatini action. Phys. Rev. 1996, D53, 5966.
31. Barbero, F. Real Ashtekar variables for Lorentzian signature space-time. Phys. Rev. 1996, D51, 5507.
32. Immirzi, G. Quantum gravity and regge calculus. Nucl. Phys. Proc. Suppl. 1997, 57, 65-72.
33. Fatibene, L.; Francaviglia, M.; Rovelli, C. On a Covariant formulation of the Barbero-Immirzi Connection. Class. Quant. Grav. 2007, 24, 3055-3066.
34. Fatibene, L.; Francaviglia, M.; Rovelli, C. Lagrangian formulation of Ashtekar-Barbero-Immirzi Gravity. Class. Quant. Grav. 2007, 24, 4207-4217.
35. Fatibene, L.; Francaviglia, M. Deformations of spin structures and gravity. Acta Phys. Pol. B 1998, 29, 915-928.
36. Fatibene, L.; Ferraris, M.; Francaviglia, M.; Godina, M. A geometric definition of lie derivative for spinor fields. In Proceedings of the 6th International Conference on Differential Geometry and its Applications, 28 August-1 September, 1995; Kolar, I., Ed.; MU University: Brno, Czech Republic, 1996; p. 549.
© 2010 by the authors; licensee MDPI, Basel, Switzerland. This article is an open-access article distributed under the terms and conditions of the Creative Commons Attribution license http://creativecommons.Org/licenses/by/3.0/.