Dynamics


From Encyclopedia Britannica (11th edition, 1910)

EncycloReader

Dynamics (from Gr. δύναμις, strength), the name of a branch of the science of Mechanics (q.v.). The term was at one time restricted to the treatment of motion as affected by force, being thus opposed to Statics, which investigated equilibrium or conditions of rest. In more recent times the word has been applied comprehensively to the action of force on bodies either at rest or in motion, thus including “dynamics” (now termed kinetics) in the restricted sense and “statics.”

Analytical Dynamics.—The fundamental principles of dynamics, and their application to special problems, are explained in the articles Mechanics and Motion, Laws of, where brief indications are also given of the more general methods of investigating the properties of a dynamical system, independently of the accidents of its particular constitution, which were inaugurated by J.L. Lagrange. These methods, in addition to the unity and breadth which they have introduced into the treatment of pure dynamics, have a peculiar interest in relation to modern physical speculation, which finds itself confronted in various directions with the problem of explaining on dynamical principles the properties of systems whose ultimate mechanism can at present only be vaguely conjectured. In determining the properties of such systems the methods of analytical geometry and of the infinitesimal calculus (or, more generally, of mathematical analysis) are necessarily employed; for this reason the subject has been named Analytical Dynamics. The following article is devoted to an outline of such portions of general dynamical theory as seem to be most important from the physical point of view.

1. General Equations of Impulsive Motion.

The systems contemplated by Lagrange are composed of discrete particles, or of rigid bodies, in finite number, connected (it may be) in various ways by invariable geometrical relations, the fundamental postulate being that the position of every particle of the system at any time can be completely specified by means of the instantaneous values of a finite number of independent variables q1, q2, ... qn, each of which admits of continuous variation over a certain range, so that if x, y, z be the Cartesian co-ordinates of any one particle, we have for example

x = ƒ(q1, q2, ... qn), y = &c., z = &c.,

(1)

where the functions ƒ differ (of course) from particle to particle. In modern language, the variables q1, q2, ... qn are generalized co-ordinates serving to specify the configuration of the system; their derivatives with respect to the time are denoted by q˙1, q˙2, ... q˙n, and are called the generalized components of velocity. The continuous sequence of configurations assumed by the system in any actual or imagined motion (subject to the given connexions) is called the path.

For the purposes of a connected outline of the whole subject it is convenient to deviate somewhat from the historical order of development, and to begin with the consideration of Impulsive motion. impulsive motion. Whatever the actual motion of the system at any instant, we may conceive it to be generated instantaneously from rest by the application of proper impulses. On this view we have, if x, y, z be the rectangular co-ordinates of any particle m,

mẋ = X′, mẏ = Y′, mz˙ = Z′,

(2)

where X′, Y′, Z′ are the components of the impulse on m. Now let δx, δy, δz be any infinitesimal variations of x, y, z which are consistent with the connexions of the system, and let us form the equation

Σm(ẋδx + ẏδy + z˙δz) = Σ(X′δx + Y′δy + Z′δz),

(3)

where the sign Σ indicates (as throughout this article) a summation extending over all the particles of the system. To transform (3) into an equation involving the variations δq1, δq2, ... of the generalized co-ordinates, we have

ẋ = ∂x 1 + ∂x 2 + ..., &c., &c.
∂q1 ∂q2
(4)
δx = ∂x δq1 + ∂x δq2 + ..., &c., &c.
∂q1 ∂q2
(5)

and therefore

Σm(ẋδx + ẏδy + z˙δz) = A111 + A122 + ...)δq1 + (A211 + A222 + ...)δq2 + ...,

(6)

where

Arr = Σm { ( ∂x ) ² + ( ∂y ) ² + ( ∂z ) ² },
∂qr   ∂qr   ∂qr  
(7)
Ars = Σm { ∂x ∂x + ∂y ∂y + ∂z ∂z } = Asr.
∂qr ∂qs ∂qr ∂qs ∂qr ∂qs

If we form the expression for the kinetic energy Τ of the system, we find

2Τ = Σm(ẋ² + ẏ² + z˙²) = A111² + A222² ... 2A1212 + ...

(8)

The coefficients A11, A22, ... A12, ... are by an obvious analogy called the coefficients of inertia of the system; they are in general functions of the co-ordinates q1, q2,.... The equation (6) may now be written

Σm(ẋδx + ẏδy + z˙δz) = ∂Τ δq1 + ∂Τ δq2 + ...
∂q˙1 ∂q˙2
(9)

This maybe regarded as the cardinal formula in Lagrange’s method. For the right-hand side of (3) we may write

Σ(X′δx + Y′δy + Z′δz) = Q′1δq1 + Q′2δq2 + ... ,

(10)

where

Q′r = Σ(X′ ∂x + Y′ ∂y + Z′ ∂z ).
∂qr ∂qr ∂qr
(11)

The quantities Q1, Q2, ... are called the generalized components of impulse. Comparing (9) and (10), we have, since the variations δq1, δq2,... are independent,

∂Τ = Q′1, ∂Τ = Q′2, ...
∂q˙1 ∂q˙2
(12)

These are the general equations of impulsive motion.

It is now usual to write

pr = ∂Τ .
∂q˙r
(13)

The quantities p1, p2, ... represent the effects of the several component impulses on the system, and are therefore called the generalized components of momentum. In terms of them we have

Σm(ẋδx + ẏδy + z˙δz) = p1δq1 + p2δq2 + ...

(14)

Also, since Τ is a homogeneous quadratic function of the velocities q˙1, q˙2 ...,

2Τ = p11 + p22 + ...

(15)

This follows independently from (14), assuming the special variations δx = ẋdt, &c., and therefore δq1 = q˙1dt, δq2 = q˙2dt, ...

Again, if the values of the velocities and the momenta Reciprocal theorems. in any other motion of the system through the same configuration be distinguished by accents, we have the identity

p1q˙′1 + p2q˙′2 + ... = p′11 + p′22 + ...,

(16)

each side being equal to the symmetrical expression

A111q˙′1 + A222q˙′2 + ... + A12(q˙1q˙′2 + q˙′12) + ...

(17)

The theorem (16) leads to some important reciprocal relations. Thus, let us suppose that the momenta p1, p2, ... all vanish with the exception of p1, and similarly that the momenta p′1, p′2, ... all vanish except p′2. We have then p1q˙′1 = p′22, or

2 : p1 = q˙′1 : p′2

(18)

The interpretation is simplest when the co-ordinates q1, q2 are both of the same kind, e.g. both lines or both angles. We may then conveniently put p1 = p′2, and assert that the velocity of the first type due to an impulse of the second type is equal to the velocity of the second type due to an equal impulse of the first type. As an example, suppose we have a chain of straight links hinged each to the next, extended in a straight line, and free to move. A blow at right angles to the chain, at any point P, will produce a certain velocity at any other point Q; the theorem asserts that an equal velocity will be produced at P by an equal blow at Q. Again, an impulsive couple acting on any link A will produce a certain angular velocity in any other link B; an equal couple applied to B will produce an equal angular velocity in A. Also if an impulse F applied at P produce an angular velocity ω in a link A, a couple Fa applied to A will produce a linear velocity ωa at P. Historically, we may note that reciprocal relations in dynamics were first recognized by H.L.F. Helmholtz in the domain of acoustics; their use has been greatly extended by Lord Rayleigh.

The equations (13) determine the momenta p1, p2,... as linear functions of the velocities q˙1, q˙2,... Solving these, we can express q˙1, q˙2 ... as linear functions of p1, p2,... The resulting equations give us the velocities produced by any given Velocities in terms of momenta. system of impulses. Further, by substitution in (8), we can express the kinetic energy as a homogeneous quadratic function of the momenta p1, p2,... The kinetic energy, as so expressed, will be denoted by Τ`; thus

2Τ` = A`11p1² + A`22p2² + ... + 2A`12p-p2 + ...

(19)

where A`11, A`22,... A`12,... are certain coefficients depending on the configuration. They have been called by Maxwell the coefficients of mobility of the system. When the form (19) is given, the values of the velocities in terms of the momenta can be expressed in a remarkable form due to Sir W.R. Hamilton. The formula (15) may be written

p11 + p22 + ... = Τ + Τ`,

(20)

where Τ is supposed expressed as in (8), and Τ` as in (19). Hence if, for the moment, we denote by δ a variation affecting the velocities, and therefore the momenta, but not the configuration, we have

p1δq˙1 + q˙1δp + p2δq˙2 + q˙2δp2 + ... = δΤ + δΤ`

   = ∂Τ δq˙1 + ∂Τ δq˙2 + ... + ∂Τ` δp1 + ∂Τ` δp2 + ...
∂q˙1 ∂q˙2 ∂p1 ∂p2
(21)

In virtue of (13) this reduces to

1δp1 + q˙2δp2 + ... = ∂Τ` δp1 + ∂Τ` δp2 + ...
∂p1 ∂p2
(22)

Since δp1, δp2, ... may be taken to be independent, we infer that

1 = ∂Τ` ,    q˙2 = ∂Τ` , ...
∂p1 ∂p2
(23)

In the very remarkable exposition of the matter given by James Clerk Maxwell in his Electricity and Magnetism, the Hamiltonian expressions (23) for the velocities in terms of the impulses are obtained directly from first principles, and the formulae (13) are then deduced by an inversion of the above argument.

An important modification of the above process was introduced by E.J. Routh and Lord Kelvin and P.G. Tait. Instead of expressing the kinetic energy in terms of the velocities alone, or in terms of the momenta alone, we may express it in Routh’s modification. terms of the velocities corresponding to some of the co-ordinates, say q1, q2, ... qm, and of the momenta corresponding to the remaining co-ordinates, which (for the sake of distinction) we may denote by χ, χ′, χ″, .... Thus, Τ being expressed as a homogeneous quadratic function of q˙1, q˙2, ... q˙m, χ˙, χ˙′, χ˙″, ..., the momenta corresponding to the co-ordinates χ, χ′, χ″, ... may be written

κ = ∂Τ ,   κ′ = ∂Τ ,   κ″ = ∂Τ , ...
∂χ˙ ∂χ˙′ ∂χ˙″
(24)

These equations, when written out in full, determine χ˙, χ˙′, χ˙″, ... as linear functions of q˙1, q˙2, ... q˙m, κ, κ′, κ″,... We now consider the function

R = Τ − κχ˙ − κ′χ˙′ − κ″χ˙″ − ... ,

(25)

supposed expressed, by means of the above relations in terms of q˙1, q˙2, ... q˙m, κ, κ′, κ″, ... Performing the operation δ on both sides of (25), we have

∂R δq˙1 + ... + ∂R δκ + ... = ∂Τ δq˙1 + ... + ∂Τ δχ˙ + ... − κ∂χ˙ − χ˙δκ − ... ,
∂q˙1 ∂κ ∂q˙1 ∂χ˙
(26)

where, for brevity, only one term of each type has been exhibited. Omitting the terms which cancel in virtue of (24), we have

∂R δq˙1 + ... + ∂R δκ + ... = ∂Τ δq˙1 + ... − χ˙δκ − ...
∂q˙1 ∂κ ∂q˙1
(27)

Since the variations δq1, δq2, ... δqm, δκ, δκ′, δκ″, ... may be taken to be independent, we have

p1 = ∂Τ = ∂R ,   p2 = ∂Τ = ∂R , ...
∂q˙1 ∂q˙1 ∂q˙2 ∂q˙2
(28)

and

χ˙ = − ∂R ,   χ˙′ = − ∂R ,   χ˙″ = − ∂R , ...
∂κ ∂κ′ ∂κ″
(29)

An important property of the present transformation is that, when expressed in terms of the new variables, the kinetic energy is the sum of two homogeneous quadratic functions, thus

Τ = ⅋ + K,

(30)

where ⅋ involves the velocities q˙1, q˙2, ... q˙m alone, and K the momenta κ, κ′, κ″, ... alone. For in virtue of (29) we have, from (25),

Τ = R − (κ ∂R + κ′ ∂R + κ″ ∂R + ... ),
∂κ ∂κ′ ∂κ″
(31)

and it is evident that the terms in R which are bilinear in respect of the two sets of variables q˙1, q˙2, ... q˙m and κ, κ′, κ″, ... will disappear from the right-hand side.

It may be noted that the formula (30) gives immediate proof of two important theorems due to Bertrand and to Lord Kelvin respectively. Let us suppose, in the first place, that the system is started by given impulses of certain types, Maximum and minimum energy. but is otherwise free. J.L.F. Bertrand’s theorem is to the effect that the kinetic energy is greater than if by impulses of the remaining types the system were constrained to take any other course. We may suppose the co-ordinates to be so chosen that the constraint is expressed by the vanishing of the velocities q˙1, q˙2, ... q˙m, whilst the given impulses are κ, κ′, κ″,... Hence the energy in the actual motion is greater than in the constrained motion by the amount ⅋.

Again, suppose that the system is started with prescribed velocity components q˙1, q˙2, ... q˙m, by means of proper impulses of the corresponding types, but is otherwise free, so that in the motion actually generated we have κ = 0, κ′ = 0, κ″ = 0, ... and therefore K = 0. The kinetic energy is therefore less than in any other motion consistent with the prescribed velocity-conditions by the value which K assumes when κ, κ′, κ″, ... represent the impulses due to the constraints.

Simple illustrations of these theorems are afforded by the chain of straight links already employed. Thus if a point of the chain be held fixed, or if one or more of the joints be made rigid, the energy generated by any given impulses is less than if the chain had possessed its former freedom.

2. Continuous Motion of a System.

We may proceed to the continuous motion of a system. The Lagrange’s equations. equations of motion of any particle of the system are of the form

mẍ = X,   mÿ = Y,   mz¨ = Z

(1)

Now let x + δx, y + δy, z + δz be the co-ordinates of m in any arbitrary motion of the system differing infinitely little from the actual motion, and let us form the equation

Σm (ẍδx + ÿδy + z¨δz) = Σ (Xδx + Yδy + Zδz)

(2)

Lagrange’s investigation consists in the transformation of (2) into an equation involving the independent variations δq1, δq2, ... δqn.

It is important to notice that the symbols δ and d/dt are commutative, since

δẋ = d (x + δx) − dx = d δx, &c.
dt dt dt
(3)

Hence

Σm(ẍδx + ÿδy + z¨δz) = d Σm (ẋδx + ẏδy + z˙δz) − Σm (ẋδẋ + ẏδẏ + z˙δz˙)
dt
= d (p1δq1 + p2δq2 + ...) − δΤ,
dt
(4)

by § 1 (14). The last member may be written

1δq1 + p1δq˙1 + ṗ2δq2 + p2δq˙2 + ... − ∂Τ δq˙1 ∂Τ δq1 ∂Τ δq˙2 ∂Τ δq2 − ...
∂q˙1 ∂q1 ∂q˙2 ∂q2
(5)

Hence, omitting the terms which cancel in virtue of § 1 (13), we find

Σm(ẍδx + ÿδy + z¨δz) = (1 ∂Τ ) δq1 + (2 ∂Τ ) δq2 + ...
∂q1 ∂q2
(6)

For the right-hand side of (2) we have

Σ(Xδx + Yδy + Zδz) = Q1δq1 + Q2δq2 + ... ,

(7)

where

Qr = Σ (X ∂x + Y ∂y + Z ∂z ).
∂qr ∂qr ∂qr
(8)

The quantities Q1, Q2, ... are called the generalized components of force acting on the system.

Comparing (6) and (7) we find

1 ∂Τ = Q1,   ṗ2 ∂Τ = Q2, ... ,
∂q˙1 ∂q˙2
(9)

or, restoring the values of p1, p2, ...,

d ( ∂Τ ) ∂Τ = Q1,   d ( ∂Τ ) ∂Τ = Q2, ...
dt ∂q˙1 ∂q1 dt ∂q˙2 ∂q2
(10)

These are Lagrange’s general equations of motion. Their number is of course equal to that of the co-ordinates q1, q2, ... to be determined.

Analytically, the above proof is that given by Lagrange, but the terminology employed is of much more recent date, having been first introduced by Lord Kelvin and P.G. Tait; it has greatly promoted the physical application of the subject. Another proof of the equations (10), by direct transformation of co-ordinates, has been given by Hamilton and independently by other writers (see Mechanics), but the variational method of Lagrange is that which stands in closest relation to the subsequent developments of the subject. The chapter of Maxwell, already referred to, is a most instructive commentary on the subject from the physical point of view, although the proof there attempted of the equations (10) is fallacious.

In a “conservative system” the work which would have to be done by extraneous forces to bring the system from rest in some standard configuration to rest in the configuration (q1, q2, ... qn) is independent of the path, and may therefore be regarded as a definite function of q1, q2, ... qn. Denoting this function (the potential energy) by V, we have, if there be no extraneous force on the system,

Σ (Xδx + Yδy + Zδz) = − δV,

(11)

and therefore

Q1 = − ∂V ,   Q2 = − ∂V , ....
∂q1 ∂q2
(12)

Hence the typical Lagrange’s equation may be now written in the form

d ( ∂Τ ) ∂Τ = − ∂V ,
dt ∂q˙r ∂qr ∂qr
(13)

or, again,

r = − (V − Τ).
∂qr
(14)

It has been proposed by Helmholtz to give the name kinetic potential to the combination V − Τ.

As shown under Mechanics, § 22, we derive from (10)

= Q11 + Q22 + ... ,
dt
(15)

and therefore in the case of a conservative system free from extraneous force,

d (Τ + V) = 0 or Τ + V = const.,
dt
(16)

which is the equation of energy. For examples of the application of the formula (13) see Mechanics, § 22.

3. Constrained Systems.

It has so far been assumed that the geometrical relations, if any, which exist between the various parts of the system Case of varying relations. are of the type § 1 (1), and so do not contain t explicitly. The extension of Lagrange’s equations to the case of “varying relations” of the type

x = ƒ(t, q1, q2, ... qn), y = &c., z = &c.,

(1)

was made by J.M.L. Vieille. We now have

ẋ = ∂x + ∂x 1 + ∂x 2 + ..., &c., &c.,
∂t ∂q1 ∂q2
(2)
∂x = ∂x δq1 + ∂x δq2 + ..., &c., &c.,
∂q1 ∂q2
(3)

so that the expression § 1 (8) for the kinetic energy is to be replaced by

2Τ = α0 + 2α11 + 2α22 + ... + A111² + A222² + ... + A1212 + ...,

(4)

where

α0 = Σm { ( ∂x ) ² + ( ∂y ) ² + ( ∂z ) ² },
∂t   ∂t   ∂t  
(5)
αr = Σm { ∂x   ∂x + ∂y   ∂y + ∂z   ∂z },
∂t ∂qr ∂t ∂qr ∂t ∂qr

and the forms of Arr, Ars are as given by § 1 (7). It is to be remembered that the coefficients α0, α1, α2, ... A11, A22, ... A12 ... will in general involve t explicitly as well as implicitly through the co-ordinates q1, q2,.... Again, we find

Σm (ẋδx + ẏδy + z˙δz) = (α1 + A111 + A122 + ...) δq1 + (α2 + A211 + A222 + ...) ∂q2 + ...

= ∂Τ δq1 + ∂Τ δq2 + ... = p1δq1 + p2δq2 + ...,
∂q˙1 ∂q˙2
(6)

where pr is defined as in § 1 (13). The derivation of Lagrange’s equations then follows exactly as before. It is to be noted that the equation § 2 (15) does not as a rule now hold. The proof involved the assumption that Τ is a homogeneous quadratic function of the velocities q˙1, q˙2....

It has been pointed out by R.B. Hayward that Vieille’s case can be brought under Lagrange’s by introducing a new co-ordinate (χ) in place of t, so far as it appears explicitly in the relations (1). We have then

2Τ = α0χ˙² + 2(α11 + α22 + ...) χ˙ + A111² + A222² + ... + 2A1212 + ....

(7)

The equations of motion will be as in § 2 (10), with the additional equation

d   ∂Τ ∂Τ = X,
dt ∂χ˙ ∂χ
(8)

where X is the force corresponding to the co-ordinate χ. We may suppose X to be adjusted so as to make χ¨ = 0, and in the remaining equations nothing is altered if we write t for χ before, instead of after, the differentiations. The reason why the equation § 2 (15) no longer holds is that we should require to add a term Xχ˙ on the right-hand side; this represents the rate at which work is being done by the constraining forces required to keep χ˙ constant.

As an example, let x, y, z be the co-ordinates of a particle relative to axes fixed in a solid which is free to rotate about the axis of z. If φ be the angular co-ordinate of the solid, we find without difficulty

2Τ = m (ẋ² + ẏ² +z˙²) + 2φ˙m (xẏ − yẋ) + {I + m (x² + y²)} φ˙²,

(9)

where I is the moment of inertia of the solid. The equations of motion, viz.

d   ∂Τ ∂Τ = X,   d   ∂Τ ∂Τ = Y,   d   ∂Τ ∂Τ = Z,
dt ∂ẋ ∂x dt ∂ẏ ∂y dt ∂z˙ ∂z
(10)

and

d   ∂Τ ∂Τ = Φ,
dt ∂φ˙ ∂φ
(11)

become

m (ẍ − 2φ˙ẏ − xφ˙² − yφ¨) = X, m (ÿ + 2φ˙ẋ − yφ˙² + xφ¨) = Y, mz¨ = Z,

(12)

and

d [{I + m (x² + y²)} φ˙ + m (xẏ − yẋ)] = Φ.
dt
(13)

If we suppose Φ adjusted so as to maintain φ¨ = 0, or (again) if we suppose the moment of inertia I to be infinitely great, we obtain the familiar equations of motion relative to moving axes, viz.

m (ẍ − 2ωẏ − ω²x) = X, m (ÿ + 2ωẋ − ω²y) = Y, mz¨ = Z,

(14)

where ω has been written for φ. These are the equations which we should have obtained by applying Lagrange’s rule at once to the formula

2Τ = m (ẋ² + ẏ² + z˙²) + 2mω (xẏ − yẋ) + mω² (x² + y²),

(15)

which gives the kinetic energy of the particle referred to axes rotating with the constant angular velocity ω. (See Mechanics, § 13.)

More generally, let us suppose that we have a certain group of co-ordinates χ, χ′, χ″, ... whose absolute values do not affect the expression for the kinetic energy, and that by suitable forces of the corresponding types the velocity-components χ˙, χ˙′, χ˙″, ... are maintained constant. The remaining co-ordinates being denoted by q1, q2, ... qn, we may write

2T = ⅋ + T0 + 2(α11 + α22 + ...) χ˙ + 2(α′11 + α′22 + ...) χ˙′ + ...,

(16)

where ⅋ is a homogeneous quadratic function of the velocities q˙1, q˙2, ... q˙n of the type § 1 (8), whilst Τ0 is a homogeneous quadratic function of the velocities χ˙, χ˙′, χ˙″, ... alone. The remaining terms, which are bilinear in respect of the two sets of velocities, are indicated more fully. The formulae (10) of § 2 give n equations of the type

d ( ∂⅋ ) ∂⅋ + (r, 1) q˙1 + (r, 2) q˙2 + ... − ∂T0 = Qr
dt ∂qr ∂qr ∂qr
(17)

where

(r, s) = ( ∂αr ∂αs )χ˙ + ( ∂α′r ∂α′s )χ˙′ + ....
∂qs ∂qr ∂qs ∂qr
(18)

These quantities (r, s) are subject to the relations

(r, s) = −(s, r), (r, r) = 0

(19)

The remaining dynamical equations, equal in number to the co-ordinates χ, χ′, χ″, ..., yield expressions for the forces which must be applied in order to maintain the velocities χ˙, χ˙′, χ˙″, ... constant; they need not be written down. If we follow the method by which the equation of energy was established in § 2, the equations (17) lead, on taking account of the relations (19), to

d (⅋ − T0) = Q11 + Q22 + ... + Qnn,
dt
(20)

or, in case the forces Qr depend only on the co-ordinates q1, q2, ... qn and are conservative,

⅋ + V − T0 = const.

(21)

The conditions that the equations (17) should be satisfied by zero values of the velocities q˙1, q˙2, ... q˙n are

Qr = − ∂T0 ,
∂qr
(22)

or in the case of conservative forces

(V − T0) = 0,
∂qr
(23)

i.e. the value of V − Τ0 must be stationary.

We may apply this to the case of a system whose configuration relative to axes rotating with constant angular velocity (ω) is defined by means of the n co-ordinates q1, q2, ... qn. Rotating axes. This is important on account of its bearing on the kinetic theory of the tides. Since the Cartesian co-ordinates x, y, z of any particle m of the system relative to the moving axes are functions of q1, q2, ... qn, of the form § 1 (1), we have, by (15)

2⅋ = Σm (ẋ² + ẏ² + z˙²),   2Τ0 = ω²Σm (x² + y²),

(24)
αr = Σm (x ∂y − y ∂x ),
∂qr ∂qr
(25)

whence

(r, s) = 2ω·Σm ∂(x, y) .
∂(qs, qr)
(26)

The conditions of relative equilibrium are given by (23).

It will be noticed that this expression V − T0, which is to be stationary, differs from the true potential energy by a term which represents the potential energy of the system in relation to fictitious “centrifugal forces.” The question of stability of relative equilibrium will be noticed later (§ 6).

It should be observed that the remarkable formula (20) may in the present case be obtained directly as follows. From (15) and (14) we find

dT = d (⅋ + T0) + ω·Σm (xÿ − yẍ) = d (⅋ − T0) + ω·Σ (xY − yX).
dt dt dt
(27)

This must be equal to the rate at which the forces acting on the system do work, viz. to

ωΣ (xY − yX) + Q11 + Q22 + ... + Qnn,

where the first term represents the work done in virtue of the rotation.

We have still to notice the modifications which Lagrange’s equations undergo when the co-ordinates q1, q2, ... qn Constrained systems. are not all independently variable. In the first place, we may suppose them connected by a number m (< n) of relations of the type

A (t, q1, q2, ... qn) = 0,   B (t, q1, q2, ... qn) = 0, &c.

(28)

These may be interpreted as introducing partial constraints into a previously free system. The variations δq1, δq2, ... δqn in the expressions (6) and (7) of § 2 which are to be equated are no longer independent, but are subject to the relations

∂A δq1 + ∂A δq2 + ... = 0,   ∂B δq1 + ∂B δq2 + ... = 0, &c.
∂q1 ∂q2 ∂q1 ∂q2
(29)

Introducing indeterminate multipliers λ, μ, ..., one for each of these equations, we obtain in the usual manner n equations of the type

d   ∂T ∂T = Qr + λ ∂A + μ ∂B + ...,
dt ∂q˙r ∂qr ∂qr ∂qr
(30)

in place of § 2 (10). These equations, together with (28), serve to determine the n co-ordinates q1, q2, ... qn and the m multipliers λ, μ, ....

When t does not occur explicitly in the relations (28) the system is said to be holonomic. The term connotes the existence of integral (as opposed to differential) relations between the co-ordinates, independent of the time.

Again, it may happen that although there are no prescribed relations between the co-ordinates q1, q2, ... qn, yet from the circumstances of the problem certain geometrical conditions are imposed on their variations, thus

A1δq1 + A2δq2 + ... = 0,   B1δq1 + B2δq2 + ... = 0, &c.,

(31)

where the coefficients are functions of q1, q2, ... qn and (possibly) of t. It is assumed that these equations are not integrable as regards the variables q1, q2, ... qn; otherwise, we fall back on the previous conditions. Cases of the present type arise, for instance, in ordinary dynamics when we have a solid rolling on a (fixed or moving) surface. The six co-ordinates which serve to specify the position of the solid at any instant are not subject to any necessary relation, but the conditions to be satisfied at the point of contact impose three conditions of the form (31). The general equations of motion are obtained, as before, by the method of indeterminate multipliers, thus

d   ∂T ∂T = Qr + λAr + μBr + ...
dt ∂q˙r ∂qr
(32)

The co-ordinates q1, q2, ... qn, and the indeterminate multipliers λ, μ, ..., are determined by these equations and by the velocity-conditions corresponding to (31). When t does not appear explicitly in the coefficients, these velocity-conditions take the forms

A11 + A22 + ... = 0,   B11 + B22 + ... = 0, &c.

(33)

Systems of this kind, where the relations (31) are not integrable, are called non-holonomic.

4. Hamiltonian Equations of Motion.

In the Hamiltonian form of the equations of motion of a conservative system with unvarying relations, the kinetic energy is supposed expressed in terms of the momenta p1, p2, ... and the co-ordinates q1, q2, ..., as in § 1 (19). Since the symbol δ now denotes a variation extending to the co-ordinates as well as to the momenta, we must add to the last member of § 1 (21) terms of the types

∂T δq1 + ∂T` δq2 + ....
∂q1 ∂q2
(1)

Since the variations δp1, δp2, ... δq1, δq2, ... may be taken to be independent, we infer the equations § 1 (23) as before, together with

∂T = − ∂T` ,   ∂T = − ∂T` , ...,
∂q1 ∂q1 ∂q2 ∂q2
(2)

Hence the Lagrangian equations § 2 (14) transform into

1 = − (T` + V),   ṗ2 = − (T` + V), ...
∂q1 ∂q2
(3)

If we write

H = T` + V,

(4)

so that H denotes the total energy of the system, supposed expressed in terms of the new variables, we get

1 = − ∂H ,   ṗ2 = − ∂H , ...
∂q1 ∂q2
(5)

If to these we join the equations

1 = ∂H ,   q˙2 = ∂H , ...,
∂p1 ∂p2
(6)

which follow at once from § 1 (23), since V does not involve p1, p2, ..., we obtain a complete system of differential equations of the first order for the determination of the motion.

The equation of energy is verified immediately by (5) and (6), since these make

dH = ∂H 1 + ∂H 2 + ... + ∂H 1 + ∂H 2 + ... = 0.
dt ∂p1 ∂p2 ∂q1 ∂q2
(7)

The Hamiltonian transformation is extended to the case of varying relations as follows. Instead of (4) we write

H = p11 + p22 + ... − T + V,

(8)

and imagine H to be expressed in terms of the momenta p1, p2, ..., the co-ordinates q1, q2, ..., and the time. The internal forces of the system are assumed to be conservative, with the potential energy V. Performing the variation δ on both sides, we find

δH = q˙1δp1 + ... − ∂T δq1 + ∂V δq + ...,
∂q1 ∂q1
(9)

terms which cancel in virtue of the definition of p1, p2, ... being omitted. Since δp1, δp2, ..., δq1, δq2, ... may be taken to be independent, we infer

1 = ∂H ,   q˙2 = ∂H , ...,
∂p1 ∂p2
(10)

and

(T − V) = − ∂H ,   (T − V) = − ∂H , ....
∂q1 ∂q1 ∂q2 ∂q2
(11)

It follows from (11) that

1 = − ∂H ,   ṗ2 = − ∂H , ....
∂q1 ∂q2
(12)

The equations (10) and (12) have the same form as above, but H is no longer equal to the energy of the system.

5. Cyclic Systems.

A cyclic or gyrostatic system is characterized by the following properties. In the first place, the kinetic energy is not affected if we alter the absolute values of certain of the co-ordinates, which we will denote by χ, χ′, χ″, ..., provided the remaining co-ordinates q1, q2, ... qm and the velocities, including of course the velocities χ˙, χ˙′, χ˙″, ..., are unaltered. Secondly, there are no forces acting on the system of the types χ, χ′, χ″, .... This case arises, for example, when the system includes gyrostats which are free to rotate about their axes, the co-ordinates χ, χ′, χ″, ... then being the angular co-ordinates of the gyrostats relatively to their frames. Again, in theoretical hydrodynamics we have the problem of moving solids in a frictionless liquid; the ignored co-ordinates χ, χ′, χ″, ... then refer to the fluid, and are infinite in number. The same question presents itself in various physical speculations where certain phenomena are ascribed to the existence of latent motions in the ultimate constituents of matter. The general theory of such systems has been treated by E.J. Routh, Lord Kelvin, and H.L.F. Helmholtz.

If we suppose the kinetic energy Τ to be expressed, as in Lagrange’s method, in terms of the co-ordinates and Routh’s equations. the velocities, the equations of motion corresponding to χ, χ′, χ″, ... reduce, in virtue of the above hypotheses, to the forms

d   ∂Τ = 0,   d   ∂Τ = 0,   d   ∂Τ = 0, ...,
dt ∂χ˙ dt ∂χ˙′ dt ∂χ˙″
(1)

whence

∂Τ = κ,   ∂Τ = κ′,   ∂Τ = κ″, ...,
∂χ˙ ∂χ˙′ ∂χ˙″
(2)

where κ, κ′, κ″, ... are the constant momenta corresponding to the cyclic co-ordinates χ, χ′, χ″, .... These equations are linear in χ˙, χ˙′, χ˙″, ...; solving them with respect to these quantities and substituting in the remaining Lagrangian equations, we obtain m differential equations to determine the remaining co-ordinates q1, q2, ... qm. The object of the present investigation is to ascertain the general form of the resulting equations. The retained co-ordinates q1, q2, ... qm may be called (for distinction) the palpable co-ordinates of the system; in many practical questions they are the only co-ordinates directly in evidence.

If, as in § 1 (25), we write

R = T − κχ˙ − κ′χ˙′ − κ″χ˙″ − ...,

(3)

and imagine R to be expressed by means of (2) as a quadratic function of q˙1, q˙2, ... q˙m, κ, κ′, κ″, ... with coefficients which are in general functions of the co-ordinates q1, q2, ... qm, then, performing the operation δ on both sides, we find

∂R δq˙1 + ... + ∂R δκ + ... + ∂R δq1 + ... = ∂T δq˙1 + ... + ∂T δq1 + ...
∂q˙1 ∂κ ∂q1 ∂q˙1 ∂q1
+ ∂T δχ˙ + ... + ∂T δq1 + ... − κδχ˙ − χ˙δκ − ....
∂χ˙ ∂χ1
(4)

Omitting the terms which cancel by (2), we find

∂T = ∂R ,   ∂T = ∂R , ...,
∂q˙1 ∂q˙1 ∂q˙2 ∂q˙2
(5)
∂T = ∂R ,   ∂T = ∂R , ...,
∂q1 ∂q1 ∂q2 ∂q2
(6)
χ˙ = − ∂R ,   χ˙′ = − ∂R ,   χ˙″ = − ∂R , ...
∂κ ∂κ′ ∂κ″
(7)

Substituting in § 2 (10), we have

d   ∂R ∂R = Q1,   d   ∂R ∂R = Q2, ...
dt ∂q˙1 ∂q1 dt ∂q˙2 ∂q2
(8)

These are Routh’s forms of the modified Lagrangian equations. Equivalent forms were obtained independently by Helmholtz at a later date.

The function R is made up of three parts, thus

R = R2, 0 + R1, 1 + R0, 2, ...

(9)

where R2, 0 is a homogeneous quadratic function of q˙1, q˙2, ... q˙m, R0, 2 is Kelvin’s equations. a homogeneous quadratic function of κ, κ′, κ″, ..., whilst R1, 1 consists of products of the velocities q˙1, q˙2, ... q˙m into the momenta κ, κ′, κ″.... Hence from (3) and (7) we have

T = R − (κ ∂R + κ′ ∂R + κ″ ∂R + ...) = R2, 0 − R0, 2.
∂κ ∂κ′ ∂κ″
(10)

If, as in § 1 (30), we write this in the form

Τ = ⅋ + K,

(11)

then (3) may be written

R = ⅋ − K + β11 + β22 + ...,

(12)

where β1, β2, ... are linear functions of κ, κ′, κ″, ..., say

βr = αrκ + α′rκ′ + α″rκ″ + ...,

(13)

the coefficients αr, α′r, α″r, ... being in general functions of the co-ordinates q1, q2, ... qm. Evidently βr denotes that part of the momentum-component ∂R / ∂q˙r which is due to the cyclic motions. Now

d   ∂R = d ( ∂⅋ + βr) = d   ∂⅋ + ∂βr 1 + ∂βr 2+ ...,
dt ∂q˙r dt ∂q˙r dt ∂q˙r ∂q1 ∂q2
(14)
∂R = ∂⅋ ∂K + ∂β1 1 + ∂β2 2 + ....
∂qr ∂qr ∂qr ∂qr ∂qr
(15)

Hence, substituting in (8), we obtain the typical equation of motion of a gyrostatic system in the form

d   ∂⅋ ∂⅋ + (r, 1) q˙1 + (r, 2) q˙2 + ... + (r, s) q˙s + ... + ∂K = Qr,
dt ∂q˙r ∂qr ∂qr
(16)

where

(r, s) = ∂βr ∂βs .
∂qs ∂qr
(17)

This form is due to Lord Kelvin. When q1, q2, ... qm have been determined, as functions of the time, the velocities corresponding to the cyclic co-ordinates can be found, if required, from the relations (7), which may be written

χ˙ = ∂K − α11 − α22 − ...,
∂κ
(18)
χ˙′ = ∂K − α′11 − α′22 − ...,
∂κ′

&c., &c.

It is to be particularly noticed that

(r, r) = 0, (r, s) = −(s, r).

(19)

Hence, if in (16) we put r = 1, 2, 3, ... m, and multiply by q˙1, q˙2, ... q˙m respectively, and add, we find

d (⅋ + K) = Q11 + Q22 + ...,
dt
(20)

or, in the case of a conservative system

⅋ + V + K = const.,

(21)

which is the equation of energy.

The equation (16) includes § 3 (17) as a particular case, the eliminated co-ordinate being the angular co-ordinate of a rotating solid having an infinite moment of inertia.

In the particular case where the cyclic momenta κ, κ′, κ″, ... are all zero, (16) reduces to

d   ∂⅋ ∂⅋ = Qr.
dt ∂q˙r ∂qr
(22)

The form is the same as in § 2, and the system now behaves, as regards the co-ordinates q1, q2, ... qm, exactly like the acyclic type there contemplated. These co-ordinates do not, however, now fix the position of every particle of the system. For example, if by suitable forces the system be brought back to its initial configuration (so far as this is defined by q1, q2, ..., qm), after performing any evolutions, the ignored co-ordinates χ, χ′, χ″, ... will not in general return to their original values.

If in Lagrange’s equations § 2 (10) we reverse the sign of the time-element dt, the equations are unaltered. The motion is therefore reversible; that is to say, if as the system is passing through any configuration its velocities q˙1, q˙2, ..., q˙m be all reversed, it will (if the forces be the same in the same configuration) retrace its former path. But it is important to observe that the statement does not in general hold of a gyrostatic system; the terms of (16), which are linear in q˙1, q˙2, ..., q˙m, change sign with dt, whilst the others do not. Hence the motion of a gyrostatic system is not reversible, unless indeed we reverse the cyclic motions as well as the velocities q˙1, q˙2, ..., q˙m. For instance, the precessional motion of a top cannot be reversed unless we reverse the spin.

The conditions of equilibrium of a system with latent cyclic motions Kineto-statics. are obtained by putting q˙1 = 0, q˙2 = 0, ... q˙m = 0 in (16); viz. they are

Q1 = ∂K ,   Q2 = ∂K , ...
∂q1 ∂q2
(23)

These may of course be obtained independently. Thus if the system be guided from (apparent) rest in the configuration (q1, q2, ... qm) to rest in the configuration q1 + δq1, q2 + δq2, ..., qm + δqm, the work done by the forces must be equal to the increment of the kinetic energy. Hence

Q1δq1 + Q2δq2 + ... = δK,

(24)

which is equivalent to (23). The conditions are the same as for the equilibrium of a system without latent motion, but endowed with potential energy K. This is important from a physical point of view, as showing how energy which is apparently potential may in its ultimate essence be kinetic.

By means of the formulae (18), which now reduce to

χ˙ = ∂K ,   χ˙′ = ∂K ,   χ˙″ = ∂K , ...,
∂κ ∂κ′ ∂κ″
(25)

K may also be expressed as a homogeneous quadratic function of the cyclic velocities χ˙, χ˙′, χ˙″,... Denoting it in this form by Τ0, we have

δ (T0 + K) = 2δK = δ (κχ˙ + κ′χ˙′ + κ″χ˙″ + ...)

(26)

Performing the variations, and omitting the terms which cancel by (2) and (25), we find

∂Τ0 = − ∂K ,   ∂Τ0 = − ∂K , ...,
∂q1 ∂q1 ∂q2 ∂q2
(27)

so that the formulae (23) become

Q1 = − ∂Τ0 ,   Q2 = − ∂Τ0 , ...
∂q1 ∂q2
(28)

A simple example is furnished by the top (Mechanics, § 22). The cyclic co-ordinates being ψ, φ, we find

2⅋ = Aθ˙²,   2K = (μ − ν cos θ)² + ν² ,
A sin² θ C

0 = A sin² θψ˙² + C (φ˙ + ψ cos θ)²,

(29)

whence we may verify that ∂Τ0 / ∂θ = −∂K / ∂θ in accordance with (27). And the condition of equilibrium

∂K = − ∂V
∂θ ∂θ
(30)

gives the condition of steady precession.

6. Stability of Steady Motion.

The small oscillations of a conservative system about a configuration of equilibrium, and the criterion of stability, are discussed in Mechanics, § 23. The question of the stability of given types of motion is more difficult, owing to the want of a sufficiently general, and at the same time precise, definition of what we mean by “stability.” A number of definitions which have been propounded by different writers are examined by F. Klein and A. Sommerfeld in their work Über die Theorie des Kreisels (1897-1903). Rejecting previous definitions, they base their criterion of stability on the character of the changes produced in the path of the system by small arbitrary disturbing impulses. If the undisturbed path be the limiting form of the disturbed path when the impulses are indefinitely diminished, it is said to be stable, but not otherwise. For instance, the vertical fall of a particle under gravity is reckoned as stable, although for a given impulsive disturbance, however small, the deviation of the particle’s position at any time t from the position which it would have occupied in the original motion increases indefinitely with t. Even this criterion, as the writers quoted themselves recognize, is not free from ambiguity unless the phrase “limiting form,” as applied to a path, be strictly defined. It appears, moreover, that a definition which is analytically precise may not in all cases be easy to reconcile with geometrical prepossessions. Thus a particle moving in a circle about a centre of force varying inversely as the cube of the distance will if slightly disturbed either fall into the centre, or recede to infinity, after describing in either case a spiral with an infinite number of convolutions. Each of these spirals has, analytically, the circle as its limiting form, although the motion in the circle is most naturally described as unstable.

A special form of the problem, of great interest, presents itself in the steady motion of a gyrostatic system, when the non-eliminated co-ordinates q1, q2, ... qm all vanish (see § 5). This has been discussed by Routh, Lord Kelvin and Tait, and Poincaré. These writers treat the question, by an extension of Lagrange’s method, as a problem of small oscillations. Whether we adopt the notion of stability which this implies, or take up the position of Klein and Sommerfeld, there is no difficulty in showing that stability is ensured if V + K be a minimum as regards variations of q1, q2, ... qm. The proof is the same as that of Dirichlet for the case of statical stability.

We can illustrate this condition from the case of the top, where, in our previous notation,

V + K = Mgh cos θ + (μ − νcos θ)² + ν² .
2A sin² θ 2C
(1)

To examine whether the steady motion with the centre of gravity vertically above the pivot is stable, we must put μ = ν. We then find without difficulty that V + K is a minimum provided ν² ≥ 4AMgh. The method of small oscillations gave us the condition ν² > 4AMgh, and indicated instability in the cases ν² ≤ 4AMgh. The present criterion can also be applied to show that the steady precessional motions in which the axis has a constant inclination to the vertical are stable.

The question remains, as before, whether it is essential for stability that V + K should be a minimum. It appears that from the point of view of the theory of small oscillations it is not essential, and that there may even be stability when V + K is a maximum. The precise conditions, which are of a somewhat elaborate character, have been formulated by Routh. An important distinction has, however, been established by Thomson and Tait, and by Poincaré, between what we may call ordinary or temporary stability (which is stability in the above sense) and permanent or secular stability, which means stability when regard is had to possible dissipative forces called into play whenever the co-ordinates q1, q2, ... qm vary. Since the total energy of the system at any instant is given (in the notation of § 5) by an expression of the form ⅋ + V + K, where ⅋ cannot be negative, the argument of Thomson and Tait, given under Mechanics, § 23, for the statical question, shows that it is a necessary as well as a sufficient condition for secular stability that V + K should be a minimum. When a system is “ordinarily” stable, but “secularly” unstable, the operation of the frictional forces is to induce a gradual increase in the amplitude of the free vibrations which are called into play by accidental disturbances.

There is a similar theory in relation to the constrained systems considered in § 3 above. The equation (21) there given leads to the conclusion that for secular stability of any type of motion in which the velocities q˙1, q˙2, ... q˙n are zero it is necessary and sufficient that the function V − Τ0 should be a minimum.

The simplest possible example of this is the case of a particle at the lowest point of a smooth spherical bowl which rotates with constant angular velocity (ω) about the vertical diameter. This position obviously possesses “ordinary” stability. If a be the radius of the bowl, and θ denote angular distance from the lowest point, we have

V − Τ0 = mga(1 − cos θ) − ½mω²a² sin² θ;

(2)

this is a minimum for θ = 0 only so long as ω² < g/a. For greater values of ω the only position of “permanent” stability is that in which the particle rotates with the bowl at an angular distance cos−1 (g/ω²a) from the lowest point. To examine the motion in the neighbourhood of the lowest point, when frictional forces are taken into account, we may take fixed ones, in a horizontal plane, through the lowest point. Assuming that the friction varies as the relative velocity, we have

ẍ = −p²x − k (ẋ + ωy),
ÿ = −p²y − k (ẏ − ωx),

(3)

where p² = g/a. These combine into

z¨ + kz˙ + (p² − ikω) z = 0,

(4)

where z = x + iy, i = √−1. Assuming z = Ceλt, we find

λ = −½k(1 ∓ ω/p) ± ip,

(5)

if the square of k be neglected. The complete solution is then

x + iy = C1e−β1t eipt + + C2e−β2t e−ipt,

(6)

where

β1 = ½k (1 − ω/p),   β2 = ½k (1 + ω/p).

(7)

This represents two superposed circular vibrations, in opposite directions, of period 2π/p. If ω < p, the amplitude of each of these diminishes asymptotically to zero, and the position x = 0, y = 0 is permanently stable. But if ω > p the amplitude of that circular vibration which agrees in sense with the rotation ω will continually increase, and the particle will work its way in an ever-widening spiral path towards the eccentric position of secular stability. If the bowl be not spherical but ellipsoidal, the vertical diameter being a principal axis, it may easily be shown that the lowest position is permanently stable only so long as the period of the rotation is longer than that of the slower of the two normal modes in the absence of rotation (see Mechanics, § 13).

7. Principle of Least Action.

The preceding theories give us statements applicable to the system at any one instant of its motion. We now come to a series of theorems relating to the whole motion of the system between any two configurations through which it passes, Stationary Action. viz. we consider the actual motion and compare it with other imaginable motions, differing infinitely little from it, between the same two configurations. We use the symbol δ to denote the transition from the actual to any one of the hypothetical motions.

The best-known theorem of this class is that of Least Action, originated by P.L.M. de Maupertuis, but first put in a definite form by Lagrange. The “action” of a single particle in passing from one position to another is the space-integral of the momentum, or the time-integral of the vis viva. The action of a dynamical system is the sum of the actions of its constituent particles, and is accordingly given by the formula

A = Σ mvds = Σ mv²dt = 2 Τdt.

(1)

The theorem referred to asserts that the free motion of a conservative system between any two given configurations is characterized by the property

δA = 0,

(2)

provided the total energy have the same constant value in the varied motion as in the actual motion.

If t, t′ be the times of passing through the initial and final configurations respectively, we have

δA = δ t′t Σm (ẋ² + ẏ² + z˙²) dt

= t′t δΤdt + 2Τ′δt′ + 2Τδt,

(3)

since the upper and lower limits of the integral must both be regarded as variable. This may be written

δA = t′t δΤdt + t′t Σm (ẋδẋ + ẏδẏ + z˙δz˙) dt + 2Τ′δt′ − 2Τδt

= t′t δΤdt + [ Σm (ẋδx + ẏδy + z˙δz)]t′t

t′t Σm (ẍδx + ÿδy + z¨δz) dt + 2Τ′δt′ − 2Τδt.

(4)

Now, by d’Alembert’s principle,

Σm (ẍδx + ÿδy + z¨δz) = −δV,

(5)

and by hypothesis we have

δ(Τ + V) = 0.

(6)

The formula therefore reduces to

δA = [Σm (ẋδx + ẏδy + z˙δz)]t′t + 2Τ′δt′ − 2Τδt.

(7)

Since the terminal configurations are unaltered, we must have at the lower limit

δx + ẋδt = 0,   δy + ẏδt = 0,   δz + z˙δt = 0,

(8)

with similar relations at the upper limit. These reduce (7) to the form (2).

The equation (2), it is to be noticed, merely expresses that the variation of A vanishes to the first order; the phrase stationary action has therefore been suggested as indicating more accurately what has been proved. The action in the free path between two given configurations is in fact not invariably a minimum, and even when a minimum it need not be the least possible subject to the given conditions. Simple illustrations are furnished by the case of a single particle. A particle moving on a smooth surface, and free from extraneous force, will have its velocity constant; hence the theorem in this case resolves itself into

δ ds = 0,

(9)

i.e. the path must be a geodesic line. Now a geodesic is not necessarily the shortest path between two given points on it; for example, on the sphere a great-circle arc ceases to be the shortest path between its extremities when it exceeds 180°. More generally, taking any surface, let a point P, starting from O, move along a geodesic; this geodesic will be a minimum path from O to P until P passes through a point O′ (if such exist), which is the intersection with a consecutive geodesic through O. After this point the minimum property ceases. On an anticlastic surface two geodesics cannot intersect more than once, and each geodesic is therefore a minimum path between any two of its points. These illustrations are due to K.G.J. Jacobi, who has also formulated the general criterion, applicable to all dynamical systems, as follows:—Let O and P denote any two configurations on a natural path of the system. If this be the sole free path from O to P with the prescribed amount of energy, the action from O to P is a minimum. But if there be several distinct paths, let P vary from coincidence with O along the first-named path; the action will then cease to be a minimum when a configuration O′ is reached such that two of the possible paths from O to O′ coincide. For instance, if O and P be positions on the parabolic path of a projectile under gravity, there will be a second path (with the same energy and therefore the same velocity of projection from O), these two paths coinciding when P is at the other extremity (O′, say) of the focal chord through O. The action from O to P will therefore be a minimum for all positions of P short of O′. Two configurations such as O and O′ in the general statement are called conjugate kinetic foci. Cf. Variations, Calculus of.

Before leaving this topic the connexion of the principle of stationary action with a well-known theorem of optics may be noticed. For the motion of a particle in a conservative field of force the principle takes the form

δ vds = 0.

(10)

On the corpuscular theory of light v is proportional to the refractive index μ of the medium, whence

δ μds = 0.

(11)

In the formula (2) the energy in the hypothetical motion is prescribed, whilst the time of transit from the initial to the final configuration Hamiltonian principle. is variable. In another and generally more convenient theorem, due to Hamilton, the time of transit is prescribed to be the same as in the actual motion, whilst the energy may be different and need not (indeed) be constant. Under these conditions we have

δ t′t (T − V)dt = 0,

(12)

where t, t′ are the prescribed times of passing through the given initial and final configurations. The proof of (12) is simple; we have

δ t′t (T − V)dt = t′t (δΤ − δV)dt = t′t {Σm (ẋδẋ + ẏδẏ + z˙δz˙) − δV} dt

= [ Σm (ẋδx + ẏδy + z˙δz)]t′tt′t {Σm (ẍδx + ÿδy + z¨δz) + δV} dt.

(13)

The integrated terms vanish at both limits, since by hypothesis the configurations at these instants are fixed; and the terms under the integral sign vanish by d’Alembert’s principle.

The fact that in (12) the variation does not affect the time of transit renders the formula easy of application in any system of co-ordinates. Thus, to deduce Lagrange’s equations, we have

t′t (δΤ − δV) dt = t′t { ∂T δq˙1 + ∂T δq1 + ... − ∂V δq1 − ... } dt
∂q˙1 ∂q1 ∂q1
= [p1δq1 + p2δq2 + ...]t′tt′t { [1 ∂T + ∂V ) δq1 + (2 ∂T + ∂V ) δq2 + ...} dt.
∂q1 ∂q1 ∂q2 ∂q2
(14)

The integrated terms vanish at both limits; and in order that the remainder of the right-hand member may vanish it is necessary that the coefficients of δq1, δq2, ... under the integral sign should vanish for all values of t, since the variations in question are independent, and subject only to the condition of vanishing at the limits of integration. We are thus led to Lagrange’s equation of motion for a conservative system. It appears that the formula (12) is a convenient as well as a compact embodiment of the whole of ordinary dynamics.

The modification of the Hamiltonian principle appropriate to Extension to cyclic systems. the case of cyclic systems has been given by J. Larmor. If we write, as in § 1 (25),

R = Τ − κχ˙ − κ′χ˙′ − κ″χ˙″ − ...,

(15)

we shall have

δ t′t (R − V) dt = 0,

(16)

provided that the variation does not affect the cyclic momenta κ, κ′, κ″, ..., and that the configurations at times t and t′ are unaltered, so far as they depend on the palpable co-ordinates q1, q2, ... qm. The initial and final values of the ignored co-ordinates will in general be affected.

To prove (16) we have, on the above understandings,

δ t′t (R − V) dt = t′t (δT − κδχ˙ − ... − δV) dt

= t′t ( ∂T δq˙1 + ... + ∂T δq1 + ... − δV) dt,
∂q˙1 ∂q1
(17)

where terms have been cancelled in virtue of § 5 (2). The last member of (17) represents a variation of the integral

t′t (T − V) dt

on the supposition that δX = 0, δX′ = 0, δX″ = 0, ... throughout, whilst δq1, δq2, δqm vanish at times t and t′; i.e. it is a variation in which the initial and final configurations are absolutely unaltered. It therefore vanishes as a consequence of the Hamiltonian principle in its original form.

Larmor has also given the corresponding form of the principle of least action. He shows that if we write

A = (2T − κχ˙ − κ′χ˙′ − κ″χ˙″ − ...) dt,

(18)

then

δA = 0,

(19)

provided the varied motion takes place with the same constant value of the energy, and with the same constant cyclic momenta, between the same two configurations, these being regarded as defined by the palpable co-ordinates alone.

§ 8. Hamilton’s Principal and Characteristic Functions.

In the investigations next to be described a more extended meaning is given to the symbol δ. We will, in the first instance, denote by it an infinitesimal variation of the most Principal function. general kind, affecting not merely the values of the co-ordinates at any instant, but also the initial and final configurations and the times of passing through them. If we put

S = t′t (Τ − V) dt,

(1)

we have, then,

δS = (T′ − V′) δt′ − (T − V) δt + t′t (δΤ − δV) dt

= (T′ − V′) δt′ − (T − V) δt + [Σm (ẋδx + ẏδy + z˙δz)]t′t.

(2)

Let us now denote by x′ + δx′, y′ + δy′, z′ + δz′, the final co-ordinates (i.e. at time t′ + δt′) of a particle m. In the terms in (2) which relate to the upper limit we must therefore write δx′ − ẋ′δt′, δy′ − ẏ′δt′, δz′ − z˙′δt′ for δx, δy, δz. With a similar modification at the lower limit, we obtain

δS = −Hδτ + Σm (ẋ′δx′ + ẏ′δy′ + z˙′δz′) − Σm (ẋδx + ẏδy + z˙δz),

(3)

where H (= T + V) is the constant value of the energy in the free motion of the system, and τ (= t′ − t) is the time of transit. In generalized co-ordinates this takes the form

δS = −Hδτ + p′1δq′1 + p′2δq′2 + ... − p1δq1 − p2δq2 − ....

(4)

Now if we select any two arbitrary configurations as initial and final, it is evident that we can in general (by suitable initial velocities or impulses) start the system so that it will of itself pass from the first to the second in any prescribed time τ. On this view of the matter, S will be a function of the initial and final co-ordinates (q1, q2, ... and q′1, q′2, ...) and the time τ, as independent variables. And we obtain at once from (4)

p′1 = ∂S ,   p′2 = ∂S , ... ,
∂q′1 ∂q′2
(5)
p1 = − ∂S ,   p2 = − ∂S , ... ,
∂q1 ∂q2

and

H = − ∂S .
∂τ
(6)

S is called by Hamilton the principal function; if its general form for any system can be found, the preceding equations suffice to determine the motion resulting from any given conditions. If we substitute the values of p1, p2, ... and H from (5) and (6) in the expression for the kinetic energy in the form T′ (see § 1), the equation

T¹ + V = H

(7)

becomes a partial differential equation to be satisfied by S. It has been shown by Jacobi that the dynamical problem resolves itself into obtaining a “complete” solution of this equation, involving n + 1 arbitrary constants. This aspect of the subject, as a problem in partial differential equations, has received great attention at the hands of mathematicians, but must be passed over here.

There is a similar theory Characteristic function. for the function

A = 2 Tdt = S + Hτ

(8)

It follows from (4) that

δA = τδH + p′1δq′1 + p′2δq′2 + ... − p1δq1 − p2δq2 − ....

(9)

This formula (it may be remarked) contains the principle of “least action” as a particular case. Selecting, as before, any two arbitrary configurations, it is in general possible to start the system from one of these, with a prescribed value of the total energy H, so that it shall pass through the other. Hence, regarding A as a function of the initial and final co-ordinates and the energy, we find

p′1 = ∂A ,   p′2 = ∂A , ... ,
∂q′1 ∂q′2
(10)
p1 = − ∂A ,   p2 = − ∂A , ... ,
∂q1 ∂q2

and

τ = ∂A .
∂H
(11)

A is called by Hamilton the characteristic function; it represents, of course, the “action” of the system in the free motion (with prescribed energy) between the two configurations. Like S, it satisfies a partial differential equation, obtained by substitution from (10) in (7).

The preceding theorems are easily adapted to the case of cyclic systems. We have only to write

S = t′t (R − V) dt = t′t (T − κχ˙ − κ′χ˙′ − ... − V) dt

(12)

in place of (1), and

A = (2T − κχ˙ − κ′χ˙′ − ...) dt,

(13)

in place of (8); cf. § 7 ad fin. It is understood, of course, that in (12) S is regarded as a function of the initial and final values of the palpable co-ordinates q1, q2, ... qm, and of the time of transit τ, the cyclic momenta being invariable. Similarly in (13), A is regarded as a function of the initial and final values of q1, q2, ... qm, and of the total energy H, with the cyclic momenta invariable. It will be found that the forms of (4) and (9) will be conserved, provided the variations δq1, δq2, ... be understood to refer to the palpable co-ordinates alone. It follows that the equations (5), (6) and (10), (11) will still hold under the new meanings of the symbols.

9. Reciprocal Properties of Direct and Reversed Motions.

We may employ Hamilton’s principal function to prove a very remarkable formula connecting any two slightly disturbed Lagrange’s formula. natural motions of the system. If we use the symbols δ and Δ to denote the corresponding variations, the theorem is

d Σ (δpr·Δqr − Δpr·δqr) = 0;
dt
(1)

or integrating from t to t′,

Σ (δp′r·Δq′r − Δq′r·δq′r) = Σ (δpr·Δqr − Δpr·δqr).

(2)

If for shortness we write

(r, s) = ∂²S ,   (r, s′) = ∂²S ,
∂qr∂qs ∂qr∂q′s
(3)

we have

∂pr = −Σs (r, s) δqs − Σs (r, s′) δq′s

(4)

with a similar expression for Δpr. Hence the right-hand side of (2) becomes

− Σrs(r, s) δqs + Σs(r, s′) δq′s} Δqr + Σrs(r, s)Δqs + Σs(r, s′) Δq′s} δqr

= ΣrΣs(r, s′) {δqr·Δq′s − Δqr·δq′s}.

(5)

The same value is obtained in like manner for the expression on the left hand of (2); hence the theorem, which, in the form (1), is due to Lagrange, and was employed by him as the basis of his method of treating the dynamical theory of Variation of Arbitrary Constants.

The formula (2) leads at once to some remarkable reciprocal relations which were first expressed, in their complete form, by Helmholtz. Consider any natural motion of a conservative system between two configurations O and O′ Helmholtz’s reciprocal theorems. through which it passes at times t and t′ respectively, and let t′ − t = τ. As the system is passing through O let a small impulse δpr be given to it, and let the consequent alteration in the co-ordinate qs after the time τ be δq′s. Next consider the reversed motion of the system, in which it would, if undisturbed, pass from O′ to O in the same time τ. Let a small impulse δp′s be applied as the system is passing through O′, and let the consequent change in the co-ordinate qr after a time τ be δqr. Helmholtz’s first theorem is to the effect that

δqr : δp′s = δq′s : δpr.

(6)

To prove this, suppose, in (2), that all the δq vanish, and likewise all the δp with the exception of δpr. Further, suppose all the Δq′ to vanish, and likewise all the Δp′ except Δp′s, the formula then gives

δpr·Δqr = −Δp′s·δq′s,

(7)

which is equivalent to Helmholtz’s result, since we may suppose the symbol Δ to refer to the reversed motion, provided we change the signs of the Δp. In the most general motion of a top (Mechanics, § 22), suppose that a small impulsive couple about the vertical produces after a time τ a change δθ in the inclination of the axis, the theorem asserts that in the reversed motion an equal impulsive couple in the plane of θ will produce after a time τ a change δψ, in the azimuth of the axis, which is equal to δθ. It is understood, of course, that the couples have no components (in the generalized sense) except of the types indicated; for instance, they may consist in each case of a force applied to the top at a point of the axis, and of the accompanying reaction at the pivot. Again, in the corpuscular theory of light let O, O′ be any two points on the axis of a symmetrical optical combination, and let V, V′ be the corresponding velocities of light. At O let a small impulse be applied perpendicular to the axis so as to produce an angular deflection δθ, and let β′ be the corresponding lateral deviation at O′. In like manner in the reversed motion, let a small deflection δθ′ at O′ produce a lateral deviation β at O. The theorem (6) asserts that

β = β′β′ ,
V′δθ′ Vδθ
(8)

or, in optical language, the “apparent distance” of O from O′ is to that of O′ from O in the ratio of the refractive indices at O′ and O respectively.

In the second reciprocal theorem of Helmholtz the configuration O is slightly varied by a change δqr in one of the co-ordinates, the momenta being all unaltered, and δq′s is Helmholtz’s second reciprocal theorem. the consequent variation in one of the momenta after time τ. Similarly in the reversed motion a change δp′s produces after time τ a change of momentum δpr. The theorem asserts that

δp′s : δqr = δpr : δq′s

(9)

This follows at once from (2) if we imagine all the δp to vanish, and likewise all the δq save δqr, and if (further) we imagine all the Δp′ to vanish, and all the Δq′ save Δq′s. Reverting to the optical illustration, if F, F′, be principal foci, we can infer that the convergence at F′ of a parallel beam from F is to the convergence at F of a parallel beam from F′ in the inverse ratio of the refractive indices at F′ and F. This is equivalent to Gauss’s relation between the two principal focal lengths of an optical instrument. It may be obtained otherwise as a particular case of (8).

We have by no means exhausted the inferences to be drawn from Lagrange’s formula. It may be noted that (6) includes as particular cases various important reciprocal relations in optics and acoustics formulated by R.J.E. Clausius, Helmholtz, Thomson (Lord Kelvin) and Tait, and Lord Rayleigh. In applying the theorem care must be taken that in the reversed motion the reversal is complete, and extends to every velocity in the system; in particular, in a cyclic system the cyclic motions must be imagined to be reversed with the rest. Conspicuous instances of the failure of the theorem through incomplete reversal are afforded by the propagation of sound in a wind and the propagation of light in a magnetic medium.

It may be worth while to point out, however, that there is no such limitation to the use of Lagrange’s formula (1). In applying it to cyclic systems, it is convenient to introduce conditions already laid down, viz. that the co-ordinates qr are the palpable co-ordinates and that the cyclic momenta are invariable. Special inference can then be drawn as before, but the interpretation cannot be expressed so neatly owing to the non-reversibility of the motion.

Authorities.—The most important and most accessible early authorities are J.L. Lagrange, Mécanique analytique (1st ed. Paris, 1788, 2nd ed. Paris, 1811; reprinted in Œuvres, vols. xi., xii., Paris, 1888-89); Hamilton, “On a General Method in Dynamics,” Phil. Trans. 1834 and 1835; C.G.J. Jacobi, Vorlesungen über Dynamik (Berlin, 1866, reprinted in Werke, Supp.-Bd., Berlin, 1884). An account of the extensive literature on the differential equations of dynamics and on the theory of variation of parameters is given by A. Cayley, “Report on Theoretical Dynamics,” Brit. Assn. Rep. (1857), Mathematical Papers, vol. iii. (Cambridge, 1890). For the modern developments reference may be made to Thomson and Tait, Natural Philosophy (1st ed. Oxford, 1867, 2nd ed. Cambridge, 1879); Lord Rayleigh, Theory of Sound, vol. i. (1st ed. London, 1877; 2nd ed. London, 1894); E.J. Routh, Stability of Motion (London, 1877), and Rigid Dynamics (4th ed. London, 1884); H. Helmholtz, “Über die physikalische Bedeutung des Prinzips der kleinsten Action,” Crelle, vol. c., 1886, reprinted (with other cognate papers) in Wiss. Abh. vol. iii. (Leipzig, 1895); J. Larmor, “On Least Action,” Proc. Lond. Math. Soc. vol. xv. (1884); E.T. Whittaker, Analytical Dynamics (Cambridge, 1904). As to the question of stability, reference may be made to H. Poincaré, “Sur l’équilibre d’une masse fluide animée d’un mouvement de rotation” Acta math. vol. vii. (1885); F. Klein and A. Sommerfeld, Theorie des Kreisels, pts. 1, 2 (Leipzig, 1897-1898); A. Lioupanoff and J. Hadamard, Liouville, 5me série, vol. iii. (1897); T.J.I. Bromwich, Proc. Lond. Math. Soc. vol. xxxiii. (1901). A remarkable interpretation of various dynamical principles is given by H. Hertz in his posthumous work Die Prinzipien der Mechanik (Leipzig, 1894), of which an English translation appeared in 1900.

(H. Lb.)