Theoretical Physics
Solution Manual
Ian D Lawrie
September 2012
This document can be downloaded from http://www.unifiedgrandtours.org. ⃝
c Ian D Lawrie 2012
It may be freely shared, but may not be altered or sold.
, Chapter 2
Chapter 2
Geometry
Exercise 2.1
Consider a Cartesian coordinate system S and and a second one, S ′ , which
is obtained by giving S a velocity v, without rotating its axes. Then the
origin of S ′ moves with constant velocity v relative to S, and we take the
two origins to coincide at t = t′ = 0. Assume that the relation between the
two sets of coordinates is linear and that space is isotropic. The most general
form of the transformation law can then be written as
[ ] [ ]
x′ = α (1 − λv 2 )x + (λv · x − βt)v t′ = γ t − (δ/c2 )v · x
where α, β, γ, δ and λ are functions of v 2 . For the case that v is in the
positive x direction, write out the transformations for the four coordinates.
Write down the trajectory of the S ′ origin as seen in S and that of the S origin
as seen in S ′ and show that β = 1 and α = γ. Write down the trajectories seen
in S and S ′ of a light ray emitted from the origin at t = t′ = 0 that travels
in the positive x direction, assuming that it is observed to travel with speed
c in each case. Show that δ = 1. The transformation from S ′ to S should be
the same as the transformation from S to S ′ , except for the replacement of
v by −v. Use this to complete the derivation of the Lorentz transformation
[2.2] by finding γ and λ.
Solution
First, a few words about the way this question is set up. Once we have learned
enough about the geometry of Minkowski spacetime, the best way of arriving
2
This document can be downloaded from http://www.unifiedgrandtours.org. ⃝
c Ian D Lawrie 2012
It may be freely shared, but may not be altered or sold.
, Chapter 2
at the Lorentz transformation is to ask about coordinate transformations that
preserve the metric or, equivalently, the form of the proper time interval [2.6].
Thus, an inertial Cartesian frame of reference is a set of coordinates such that
′
c2 (dτ )2 = c2 (dt)2 − (dx)2 − (dy)2 − (dz)2 and, using xµ for (ct, x, y, z) and xµ
′ ′ ′
for (ct′ , x′ , y ′ , z ′ ), we look for a constant matrix Λµ µ such that, if xµ = Λµ µ xµ ,
then
c2 (dt′ )2 − (dx′ )2 − (dy ′ )2 − (dz ′ )2 = c2 (dt)2 − (dx)2 − (dy)2 − (dz)2 .
By studying the most general matrix that satisfies this requirement, we find
that the change of coordinates can be interpreted as a combination of a rela-
tive velocity of the origins and a rotation of the spatial axes. (There is some
further discussion in §3.5.) For the purposes of this question, we are taking a
more primitive point of view, by simply trying to find a transformation rule
that works, without any insight into its geometrical meaning. For that rea-
son, the terminology needs to be considered carefully in the light of the later
theory. In particular, a ‘rotation of spatial axes’ turns out to mean different
things in two frames of reference that are in relative motion.
In setting up the problem, I assumed that space (more accurately, space-
time) is isotropic. That means that there is no naturally-occurring vector
that distinguishes one direction from any other direction. Consequently, the
new 3-dimensional vector x′ must be constructed from the only vectors we
have to hand, namely x and v. That is, x′ = Ax + Bv. The coefficients A
and B can depend only on scalar quantities that are unchanged by spatial
rotations, namely t and the dot products of vectors, x · x = |x|2 , v · v = v 2
and v · x. Since we also assume that the transformation is linear in x and
t, we find that A can be a function only of v 2 , while B can only have the
form B = B1 (v 2 )v · x + B2 (v 2 )t. Similarly, t′ , which is a scalar from the 3-
dimensional point of view, can only have the form t′ = C1 (v 2 )t + C2 (v 2 )v · x.
I traded in the five functions A, B1 , B2 , C1 and C2 for five other functions α,
β, γ, δ and λ because I happen to know that this will simplify the algebra.
Now for the problem itself. Say that v = (v, 0, 0). Then the Lorentz
transformation given above reads explicitly
x′ = α[(1 − λv 2 )x + (λvx − βt)v] = α(x − βvt) (2.1)
y′ = α(1 − λv 2 )y (2.2)
z′ = α(1 − λv 2 )z (2.3)
t′ = γ(t − δvx/c2 ). (2.4)
3
This document can be downloaded from http://www.unifiedgrandtours.org. ⃝
c Ian D Lawrie 2012
It may be freely shared, but may not be altered or sold.