Rotation Matrix - Wikipedia
Rotation Matrix - Wikipedia
org/wiki/Rotation_matrix
In linear algebra, a rotation matrix is a matrix that is used to perform a rotation in Euclidean space. For example,
using the convention below, the matrix
rotates points in the xy-Cartesian plane counter-clockwise through an angle θ about the origin of the Cartesian
coordinate system. To perform the rotation using a rotation matrix R, the position of each point must be represented
by a column vector v, containing the coordinates of the point. A rotated vector is obtained by using the matrix
multiplication Rv.
Rotation matrices also provide a means of numerically representing an arbitrary rotation of the axes about the origin,
without appealing to angular specification. These coordinate rotations are a natural way to express the orientation of a
camera, or the attitude of a spacecraft, relative to a reference axes-set. Once an observational platform's local X-Y-Z
axes are expressed numerically as three direction vectors in world coordinates, they together comprise the columns of
the rotation matrix R (world → platform) that transforms directions (expressed in world coordinates) into equivalent
directions expressed in platform-local coordinates.
The examples in this article apply to active rotations of vectors counter-clockwise in a right-handed coordinate
system by pre-multiplication. If any one of these is changed (e.g. rotating axes instead of vectors, i.e. a passive
transformation), then the inverse of the example matrix should be used, which coincides precisely with its transpose.
Since matrix multiplication has no effect on the zero vector (the coordinates of the origin), rotation matrices can only
be used to describe rotations about the origin of the coordinate system. Rotation matrices provide an algebraic
description of such rotations, and are used extensively for computations in geometry, physics, and computer graphics.
Rotation matrices are square matrices, with real entries. More specifically, they can be characterized as orthogonal
matrices with determinant 1; that is, a square matrix R is a rotation matrix if and only if RT = R−1 and det R = 1.
In some literature, the term rotation is generalized to include improper rotations, characterized by orthogonal matrices
with determinant −1 (instead of +1). These combine proper rotations with reflections (which invert orientation). In
other cases, where reflections are not being considered, the label proper may be dropped. This convention is followed
in this article.
The set of all orthogonal matrices of size n with determinant +1 forms a group known as the special orthogonal group
SO(n). The most important special case is that of the rotation group SO(3). The set of all orthogonal matrices of size
n with determinant +1 or -1 forms the (general) orthogonal group O(n).
1 In two dimensions
1.1 Non-standard orientation of the coordinate system
1.2 Common rotations
2 In three dimensions
2.1 Basic rotations
2.2 General rotations
2.3 Conversion from and to axis-angle
2.3.1 Determining the axis
2.3.2 Determining the angle
2.3.3 Rotation matrix from axis and angle
3 Properties of a rotation matrix
4 Examples
5 Geometry
6 Multiplication
7 Ambiguities
8 Decompositions
8.1 Independent planes
8.2 Sequential angles
8.3 Nested dimensions
8.4 Skew parameters via Cayley's formula
8.5 Decomposition into shears
9 Group theory
9.1 Lie group
9.2 Lie algebra
9.3 Exponential map
9.4 Baker–Campbell–Hausdorff formula
9.5 Spin group
9.6 Infinitesimal rotations
10 Conversions
10.1 Quaternion
10.2 Polar decomposition
10.3 Axis and angle
10.4 Euler angles
11 Uniform random rotation matrices
12 See also
13 Remarks
14 Notes
15 References
16 External links
The rotation matrix can be derived from the Euler Formula, by considering
how a complex number is transformed under multiplication by another
complex number of unit modulus. See (Familton, 2015)[1] for a complete
derivation.
.
A counterclockwise rotation of a
vector through angle θ. The vector is
So the coordinates (x',y') of the point (x,y) after rotation are initially aligned with the x-axis.
,
.
The direction of vector rotation is counterclockwise if θ is positive (e.g. 90°), and clockwise if θ is negative (e.g.
−90°). Thus the clockwise rotation matrix is found as
Note that the two-dimensional case is the only non-trivial (i.e. not 1-dimensional) case where the rotation matrices
group is commutative, so that it does not matter in which order multiple rotations are performed. An alternative
convention uses rotating axes,[2] and the above matrices also represent a rotation of the axes clockwise through an
angle θ.
See below for other alternative conventions which may change the sense of
the rotation produced by a rotation matrix.
Common rotations
A rotation through angle θ with
Particularly useful are the matrices for 90°, 180°, and 270° rotations, non-standard axes.
Basic rotations
A basic rotation (also called elemental rotation) is a rotation about one of the axes of a Coordinate system. The
following three basic rotation matrices rotate vectors by an angle θ about the x, y, or z axis, in three dimensions, using
the right-hand rule—which codifies their alternating signs. (The same matrices can also represent a clockwise rotation
of the axes.[nb 1])
A positive 90° rotation around the y-axis (left) after one around the z-axis (middle) gives a 120°
rotation around the main diagonal (right).
In the top left corner are the rotation matrices, in the bottom right corner are the corresponding
permutations of the cube with the origin in its center.
For column vectors, each of these basic vector rotations appears counter-clockwise when the axis about which they
occur points toward the observer, the coordinate system is right-handed, and the angle θ is positive. Rz, for instance,
would rotate toward the y-axis a vector aligned with the x-axis, as can easily be checked by operating with Rz on the
vector (1,0,0):
This is similar to the rotation produced by the above mentioned 2-D rotation matrix. See below for alternative
conventions which may apparently or actually invert the sense of the rotation produced by these matrices.
General rotations
Other rotation matrices can be obtained from these three using matrix multiplication. For example, the product
represents a rotation whose yaw, pitch, and roll angles are α, β and γ, respectively. More formally, it is an intrinsic
rotation whose Tait-Bryan angles are α, β, γ, about axes z, y, x respectively. Similarly, the product
represents an extrinsic rotation whose (improper) Euler angles are α, β, γ about axes x, y, z.
These matrices produce the desired effect only if they are used to pre-multiply column vectors, and (since in general
matrix multiplication isn't commutative) only if they are applied in the specified order (see Ambiguities for more
details).
Every rotation in three dimensions is defined by its axis (a vector along this axis is unchanged by the rotation), and its
angle — the amount of rotation about that axis (Euler rotation theorem).
There are several methods to compute the axis and angle from a rotation matrix (see also axis-angle). Here, we only
describe the method based on the computation of the eigenvectors and eigenvalues of the rotation matrix. It is also
possible to use the trace of the rotation matrix.
since the rotation of u around the rotation axis must result in u. The equation
above may be solved for u which is unique up to a scalar factor unless R = I.
To find the angle of a rotation, once the axis of the rotation is known, select a vector v perpendicular to the axis. Then
the angle of the rotation is the angle between v and Rv.
A more direct method, however, is to simply calculate the trace, i.e., the sum of the diagonal elements of the rotation
matrix. Care should be taken to select the right sign for the angle θ to match the chosen axis:
For some applications, it is helpful to be able to make a rotation with a given axis. Given a unit vector u = (ux, uy, uz),
where ux2 + uy2 + uz2 = 1, the matrix for a rotation by an angle of θ about an axis in the direction of u is[4]
A derivation of this matrix from first principles can be found in section 9.2. here.[5]
where is the cross product matrix of u, is the tensor product and I is the Identity matrix, or alternatively as
where is the Levi-Civita symbol with . This is a matrix form of Rodrigues' rotation formula, (or the
equivalent, differently parameterized Euler–Rodrigues formula) with[6]
If the 3D space is right-handed, this rotation will be counterclockwise when u points towards the observer
(Right-hand rule). Rotations in the counterclockwise (anticlockwise) direction are considered positive rotations.
Note the striking merely apparent differences to the equivalent Lie-algebraic formulation below, #Exponential map.
It follows that:
A rotation is termed proper if and improper if . For even dimensions (n even), the
eigenvalues of a rotation matrix occur as pairs of complex conjugates which are roots of unity and may be written
. Therefore, there may be no set of vectors which are unaffected by the rotation, and thus no axis of rotation. If
there are any real eigenvalues, they will equal unity and will occur in pairs, and the axis of rotation will be an even
dimensional subspace of the whole space. For odd dimensions, there will be an odd number of such eigenvalues, with
at least one eigenvalue being unity, and the axis of rotation will be an odd dimensional subspace of the whole space.
For example, in 2-space (n=2), there are either two complex eigenvalues or two real eigenvalues equal to unity. In the
case of the two unit eigenvalues, the rotation is a null rotation, but otherwise, there is no axis of rotation. In 3-space
(n=3), there will be an axis of rotation (a 1-D manifold, or a line through the origin) or the rotation will be null. In
4-space (n=4), there may be no axes of rotation, or there may be a 2 dimensional axis, a plane through the origin,
called the "axis plane". As always, when all eigenvalues are unity, the rotation is a null rotation.
The trace of a rotation matrix will be equal to the sum of its eigenvalues. For n=2 the two eigenvalues are and
the trace will be where is the rotation angle about the origin. For n=3 the three eigenvalues are 1 and
where is the rotation angle about the axis-line. The trace will be . For n=4, the four eigenvalues are of
the form and and the trace will be . If one of the angles, say , is equal to zero, then the
rotation will be a "simple" rotation, with two unit eigenvalues and the other angle ( ) will be the angle of rotation
about the axis-plane spanned by the two eigenvectors with eigenvalues of unity. Otherwise, there will be no
axis-plane of rotation. If (an "isoclinic" rotation), the eigenvalues will be repeated twice, and every vector
from the origin will be rotated through the angle . The trace will be .
is a rotation matrix, as is the matrix of any even the next two axes 180°, and leaves the last
permutation, and rotates through 120° about the axis coordinate axis unmoved.
x = y = z.
In Euclidean geometry, a rotation is an example of an isometry, a transformation that moves points without changing
the distances between them. Rotations are distinguished from other isometries by two additional properties: they
leave (at least) one point fixed, and they leave "handedness" unchanged. By contrast, a translation moves every point,
a reflection exchanges left- and right-handed ordering, and a glide reflection does both.
A rotation that does not leave "handedness" unchanged is an improper rotation or a rotoinversion.
If a fixed point is taken as the origin of a Cartesian coordinate system, then every point can be given coordinates as a
displacement from the origin. Thus one may work with the vector space of displacements instead of the points
themselves. Now suppose (p1,…,pn) are the coordinates of the vector p from the origin, O, to point P. Choose an
orthonormal basis for our coordinates; then the squared distance to P, by Pythagoras, is
A geometric rotation transforms lines to lines, and preserves ratios of distances between points. From these properties
it can be shown that a rotation is a linear transformation of the vectors, and thus can be written in matrix form, Qp.
The fact that a rotation preserves, not just ratios, but distances themselves, is stated as
or
Because this equation holds for all vectors, p, one concludes that every rotation matrix, Q, satisfies the
orthogonality condition,
Rotations preserve handedness because they cannot change the ordering of the axes, which implies the special
matrix condition,
Equally important, it can be shown that any matrix satisfying these two conditions acts as a rotation.
The inverse of a rotation matrix is its transpose, which is also a rotation matrix:
Noting that any identity matrix is a rotation matrix, and that matrix multiplication is associative, we may summarize
all these properties by saying that the n×n rotation matrices form a group, which for n > 2 is non-abelian, called a
special orthogonal group, and denoted by SO(n), SO(n,R), SOn, or SOn(R), the group of n × n rotation matrices
is isomorphic to the group of rotations in an n-dimensional space. This means that multiplication of rotation matrices
corresponds to composition of rotations, applied in left-to-right order of their corresponding matrices.
represents a counterclockwise rotation of a vector v by an angle θ, or a rotation of CS by the same angle but in
the opposite direction (i.e. clockwise). Alibi and alias transformations are also known as active and passive
transformations, respectively.
Pre-multiplication or post-multiplication
The same point P can be represented either by a column vector v or a row vector w. Rotation matrices can
either pre-multiply column vectors (Rv), or post-multiply row vectors (wR). However, Rv produces a rotation
in the opposite direction with respect to wR. Throughout this article, rotations produced on column vectors are
described by means of a pre-multiplication. To obtain exactly the same rotation (i.e. the same final coordinates
of point P), the row vector must be post-multiplied by the transpose of R (wRT).
Independent planes
so that
Two features are noteworthy. First, one of the roots (or eigenvalues) is 1, which tells us that some direction is
unaffected by the matrix. For rotations in three dimensions, this is the axis of the rotation (a concept that has no
meaning in any other dimension). Second, the other two roots are a pair of complex conjugates, whose product is 1
(the constant term of the quadratic), and whose sum is 2 cos θ (the negated linear term). This factorization is of
interest for 3×3 rotation matrices because the same thing occurs for all of them. (As special cases, for a null rotation
the "complex conjugates" are both 1, and for a 180° rotation they are both −1.) Furthermore, a similar factorization
holds for any n×n rotation matrix. If the dimension, n, is odd, there will be a "dangling" eigenvalue of 1; and for any
dimension the rest of the polynomial factors into quadratic terms like the one here (with the two special cases noted).
We are guaranteed that the characteristic polynomial will have degree n and thus n eigenvalues. And since a rotation
matrix commutes with its transpose, it is a normal matrix, so can be diagonalized. We conclude that every rotation
matrix, when expressed in a suitable coordinate system, partitions into independent rotations of two-dimensional
subspaces, at most n∕2 of them.
The sum of the entries on the main diagonal of a matrix is called the trace; it does not change if we reorient the
coordinate system, and always equals the sum of the eigenvalues. This has the convenient implication for 2×2 and
3×3 rotation matrices that the trace reveals the angle of rotation, θ, in the two-dimensional (sub-)space. For a 2×2
matrix the trace is 2 cos(θ), and for a 3×3 matrix it is 1+2 cos(θ). In the three-dimensional case, the subspace consists
of all vectors perpendicular to the rotation axis (the invariant direction, with eigenvalue 1). Thus we can extract from
any 3×3 rotation matrix a rotation axis and an angle, and these completely determine the rotation.
Sequential angles
The constraints on a 2×2 rotation matrix imply that it must have the form
with a2+b2 = 1. Therefore we may set a = cos θ and b = sin θ, for some angle θ. To solve for θ it is not enough to look
at a alone or b alone; we must consider both together to place the angle in the correct quadrant, using a two-argument
arctangent function.
Although a2+b2 will probably not equal 1, but some value r2 < 1, we can use a slight variation of the previous
computation to find a so-called Givens rotation that transforms the column to
zeroing b. This acts on the subspace spanned by the x and y axes. We can then repeat the process for the xz subspace
to zero c. Acting on the full matrix, these two rotations produce the schematic form
Shifting attention to the second column, a Givens rotation of the yz subspace can now zero the z value. This brings the
full matrix to the form
entries below the diagonal to zero. We can zero them by extending the same idea of stepping through the columns
with a series of rotations in a fixed sequence of planes. We conclude that the set of n×n rotation matrices, each of
which has n2 entries, can be parameterized by n(n−1)/2 angles.
In three dimensions this restates in matrix form an observation made by Euler, so xzxw xzyw xyxw xyzw
mathematicians call the ordered sequence of three angles Euler angles. However, the
situation is somewhat more complicated than we have so far indicated. Despite the small yxyw yxzw yzyw yzxw
dimension, we actually have considerable freedom in the sequence of axis pairs we use;
zyzw zyxw zxzw zxyw
and we also have some freedom in the choice of angles. Thus we find many different
conventions employed when three-dimensional rotations are parameterized for physics, or xzxb yzxb xyxb zyxb
medicine, or chemistry, or other disciplines. When we include the option of world axes or
body axes, 24 different sequences are possible. And while some disciplines call any yxyb zxyb yzyb xzyb
sequence Euler angles, others give different names (Euler, Cardano, Tait-Bryan,
zyzb xyzb zxzb yxzb
Roll-pitch-yaw) to different sequences.
One reason for the large number of options is that, as noted previously, rotations in three dimensions (and higher) do
not commute. If we reverse a given sequence of rotations, we get a different outcome. This also implies that we
cannot compose two rotations by adding their corresponding angles. Thus Euler angles are not vectors, despite a
similarity in appearance as a triple of numbers.
Nested dimensions
This is no illusion; not just one, but many, copies of n-dimensional rotations are found within (n+1)-dimensional
rotations, as subgroups. Each embedding leaves one direction fixed, which in the case of 3×3 matrices is the rotation
axis. For example, we have
fixing the x axis, the y axis, and the z axis, respectively. The rotation axis need not be a coordinate axis; if u = (x,y,z)
is a unit vector in the desired direction, then
A direction in (n+1)-dimensional space will be a unit magnitude vector, which we may consider a point on a
generalized sphere, Sn. Thus it is natural to describe the rotation group SO(n+1) as combining SO(n) and Sn. A
suitable formalism is the fiber bundle,
where for every direction in the "base space", Sn, the "fiber" over it in the "total space", SO(n+1), is a copy of the
"fiber space", SO(n), namely the rotations that keep that direction fixed.
Thus we can build an n×n rotation matrix by starting with a 2×2 matrix, aiming its fixed axis on S2 (the ordinary
sphere in three-dimensional space), aiming the resulting rotation on S3, and so on up through Sn−1. A point on Sn can
be selected using n numbers, so we again have n(n−1)/2 numbers to describe any n×n rotation matrix.
In fact, we can view the sequential angle decomposition, discussed previously, as reversing this process. The
composition of n−1 Givens rotations brings the first column (and row) to (1,0,…,0), so that the remainder of the
matrix is a rotation matrix of dimension one less, embedded so as to leave (1,0,…,0) fixed.
When an n×n rotation matrix Q, does not include a −1 eigenvalue, thus none of the planar rotations which it
comprises are 180° rotations, then Q+I is an invertible matrix. Most rotation matrices fit this description, and for them
it can be shown that (Q−I)(Q+I)−1 is a skew-symmetric matrix, A. Thus AT = −A; and since the diagonal is necessarily
zero, and since the upper triangle determines the lower one, A contains n(n−1)/2 independent numbers.
Conveniently, I−A is invertible whenever A is skew-symmetric; thus we can recover the original matrix using the
Cayley transform,
which maps any skew-symmetric matrix A to a rotation matrix. In fact, aside from the noted exceptions, we can
produce any rotation matrix in this way. Although in practical applications we can hardly afford to ignore 180°
rotations, the Cayley transform is still a potentially useful tool, giving a parameterization of most rotation matrices
without trigonometric functions.
If we condense the skew entries into a vector, (x,y,z), then we produce a 90° rotation around the x axis for (1,0,0),
around the y axis for (0,1,0), and around the z axis for (0,0,1). The 180° rotations are just out of reach; for, in the limit
as x goes to infinity, (x,0,0) does approach a 180° rotation around the x axis, and similarly for other directions.
For the 2D case, a rotation matrix can be decomposed into three shear matrices (Paeth 1986):
This is useful, for instance, in computer graphics, since shears can be implemented with fewer multiplication
instructions than rotating a bitmap directly. On modern computers, this may not matter, but it can be relevant for very
old or low-end microprocessors.
A rotation can also be written as two shears and scaling (Daubechies & Sweldens 1998):
Below follow some skeletal facts about rôle of the collection of all rotation matrices of a fixed dimension (here
mostly 3) in mathematics and particularly in physics where rotational symmetry is a requirement of every truly
fundamental law (due to the assumption of isotropy of space), and where the same symmetry, when present, is a
simplifying property of many problems of less fundamental nature. Examples abound in classical mechanics and
quantum mechanics. Knowledge of the part of the solutions pertaining to this symmetry applies (with qualifications)
to all such problems and it can be factored out of a specific problem at hand, thus reducing its complexity. A prime
example – in mathematics and physics – would be the theory of spherical harmonics. Their role in the group theory of
the rotation groups is that of being a representation space for the entire set of finite-dimensional irreducible
representations of the rotation group SO(3). For this topic, see Rotation group SO(3) § Spherical harmonics.
The main articles listed in each subsection are referred to for more detail.
Lie group
The n × n rotation matrices for each n form a group, the special orthogonal group, SO(n). This algebraic structure is
coupled with a topological structure inherited from GLn(ℝ) in such a way that the operations of multiplication and
taking the inverse are analytic functions of the matrix entries. Thus SO(n) is for each n a Lie group. It is compact and
connected, but not simply connected. It is also semi-simple group, in fact simple group with the exception SO(4).[7]
The relevance of this is that all theorems and all machinery from the theory of analytic manifolds (analytic manifolds
are in particular smooth manifolds) apply and the well-developed representation theory of compact semi-simple
groups is ready for use.
Lie algebra
and is the space of skew-symmetric matrices of dimension n, see classical group, where o(n) is the Lie algebra of
O(n), the orthogonal group. For reference, the most common basis for so(3) is
Exponential map
Connecting the Lie algebra to the Lie group is the exponential map, which is defined using the standard matrix
exponential series for eA[8] For any skew-symmetric matrix A, exp(A) is always a rotation matrix.[nb 2]
An important practical example is the 3 × 3 case. In rotation group SO(3), it is shown that one can identify every
A ∈ so(3) with an Euler vector ω = θ u, where u = (x,y,z) is a unit magnitude vector.
By the properties of the identification su(2) ≅ ℝ3, u is in the null space of A. Thus, u is left invariant by exp(A)
and is hence a rotation axis.
Using Rodrigues' rotation formula on matrix form with θ = θ∕2 + θ∕2, together with standard double angle formulae
one obtains,
This is the matrix for a rotation around axis u by the angle θ in half-angle form. For full detail, see exponential map
SO(3).
Baker–Campbell–Hausdorff formula
The BCH formula provides an explicit expression for Z = log(eXeY) in terms of a series expansion of nested
commutators of X and Y.[9] This general expansion unfolds as[nb 3]
for suitable trigonometric function coefficients, detailed in the Baker–Campbell–Hausdorff formula for SO(3).
As a group identity, the above holds for all faithful representations, including the doublet (spinor representation),
which is simpler. The same explicit formula thus follows straightforwardly through Pauli matrices; see the 2×2
derivation for SU(2). For the general n×n case, one might use Ref.[11]
Spin group
The Lie group of n×n rotation matrices, SO(n), is not simply connected, so Lie theory tells us it is a homomorphic
image of a universal covering group. Often the covering group, which in this case is called the spin group denoted by
Spin(n), is simpler and more natural to work with.[12]
In the case of planar rotations, SO(2) is topologically a circle, S1. Its universal covering group, Spin(2), is isomorphic
to the real line, R, under addition. Whenever angles of arbitrary magnitude are used one is taking advantage of the
convenience of the universal cover. Every 2 × 2 rotation matrix is produced by a countable infinity of angles,
separated by integer multiples of 2π. Correspondingly, the fundamental group of SO(2) is isomorphic to the integers,
Z.
In the case of spatial rotations, SO(3) is topologically equivalent to three-dimensional real projective space, RP3. Its
universal covering group, Spin(3), is isomorphic to the 3-sphere, S3. Every 3 × 3 rotation matrix is produced by
two opposite points on the sphere. Correspondingly, the fundamental group of SO(3) is isomorphic to the
two-element group, Z2.
We can also describe Spin(3) as isomorphic to quaternions of unit norm under multiplication, or to certain 4 × 4
real matrices, or to 2 × 2 complex special unitary matrices, namely SU(2). The covering maps for the first and the
last case are given by
and
For a detailed account of the SU(2)-covering and the quaternionic covering, see spin group SO(3).
Many features of these cases are the same for higher dimensions. The coverings are all two-to-one, with SO(n),
n > 2, having fundamental group Z2. The natural setting for these groups is within a Clifford algebra. One type of
action of the rotations is produced by a kind of "sandwich", denoted by qvq∗. More importantly in applications to
physics, the corresponding spin representation of the Lie algebra sits inside the Clifford algebra. It can be
exponentiated in the usual way to give rise to a 2-valued representation, a k a projective representation of the rotation
group. This is the case with SO(3) and SU(2), where the 2-valued representation can be viewed as an "inverse" of
the covering map. By properties of covering maps, the inverse can be chosen ono-to-one as a local section, but not
globally.
Infinitesimal rotations
The matrices in the Lie algebra are not themselves rotations; the skew-symmetric matrices are derivatives,
proportional differences of rotations. An actual "differential rotation", or infinitesimal rotation matrix has the form
The computation rules are as usual except that infinitesimals of second order are routinely dropped. With these rules,
these matrices do not satisfy all the same properties as ordinary finite rotation matrices under the usual treatment of
infinitesimals.[13] It turns out that the order in which infinitesimal rotations are applied is irrelevant. To see this
exemplified, consult infinitesimal rotations SO(3).
We have seen the existence of several decompositions that apply in any dimension, namely independent planes,
sequential angles, and nested dimensions. In all these cases we can either decompose a matrix or construct one. We
have also given special attention to 3×3 rotation matrices, and these warrant further attention, in both directions
(Stuelpnagel 1964).
Quaternion
Given the unit quaternion q = w + xi + yj + zk, the equivalent left-handed (Post-Multiplied) 3×3 rotation matrix is
Now every quaternion component appears multiplied by two in a term of degree two, and if all such terms are zero
what's left is an identity matrix. This leads to an efficient, robust conversion from any quaternion – whether unit or
non-unit – to a 3×3 rotation matrix.
n = w * w + x * x + y * y + z * z
s = if n == 0 then 0 else 2 / n
wx = s * w * x, wy = s * w * y, wz = s * w * z
xx = s * x * x, xy = s * x * y, xz = s * x * z
yy = s * y * y, yz = s * y * z, zz = s * z * z
[ 1 - (yy + zz) xy - wz xz + wy ]
[ xy + wz 1 - (xx + zz) yz - wx ]
[ xz - wy yz + wx 1 - (xx + yy) ]
Freed from the demand for a unit quaternion, we find that nonzero quaternions act as homogeneous coordinates for
3×3 rotation matrices. The Cayley transform, discussed earlier, is obtained by scaling the quaternion so that its w
component is 1. For a 180° rotation around any axis, w will be zero, which explains the Cayley limitation.
The sum of the entries along the main diagonal (the trace), plus one, equals 4−4(x2+y2+z2), which is 4w2. Thus we
can write the trace itself as 2w2+2w2−1; and from the previous version of the matrix we see that the diagonal entries
themselves have the same form: 2x2+2w2−1, 2y2+2w2−1, and 2z2+2w2−1. So we can easily compare the magnitudes
of all four quaternion components using the matrix diagonal. We can, in fact, obtain all four magnitudes using sums
and square roots, and choose consistent signs using the skew-symmetric part of the off-diagonal entries.
t = Qxx+Qyy+Qzz (trace of Q)
r = sqrt(1+t)
w = 0.5*r
x = copysign(0.5*sqrt(1+Qxx-Qyy-Qzz), Qzy-Qyz)
y = copysign(0.5*sqrt(1-Qxx+Qyy-Qzz), Qxz-Qzx)
z = copysign(0.5*sqrt(1-Qxx-Qyy+Qzz), Qyx-Qxy)
t = Qxx+Qyy+Qzz
r = sqrt(1+t)
s = 0.5/r
w = 0.5*r
x = (Qzy-Qyz)*s
y = (Qxz-Qzx)*s
z = (Qyx-Qxy)*s
This is numerically stable so long as the trace, t, is not negative; otherwise, we risk dividing by (nearly) zero. In that
case, suppose Qxx is the largest diagonal entry, so x will have the largest magnitude (the other cases are similar); then
the following is safe.
t = Qxx+Qyy+Qzz
r = sqrt(1+Qxx-Qyy-Qzz)
s = 0.5/r
w = (Qzy-Qyz)*s
x = 0.5*r
y = (Qxy+Qyx)*s
z = (Qzx+Qxz)*s
If the matrix contains significant error, such as accumulated numerical error, we may construct a symmetric 4×4
matrix,
and find the eigenvector, (x,y,z,w), of its largest magnitude eigenvalue. (If Q is truly a rotation matrix, that value will
be 1.) The quaternion so obtained will correspond to the rotation matrix closest to the given matrix(Bar-Itzhack
2000).
Polar decomposition
If the n×n matrix M is non-singular, its columns are linearly independent vectors; thus the Gram–Schmidt process can
adjust them to be an orthonormal basis. Stated in terms of numerical linear algebra, we convert M to an orthogonal
matrix, Q, using QR decomposition. However, we often prefer a Q "closest" to M, which this method does not
accomplish. For that, the tool we want is the polar decomposition (Fan & Hoffman 1955; Higham 1989).
To measure closeness, we may use any matrix norm invariant under orthogonal transformations. A convenient choice
is the Frobenius norm, ||Q−M||F, squared, which is the sum of the squares of the element differences. Writing this in
terms of the trace, Tr, our goal is,
Though written in matrix terms, the objective function is just a quadratic polynomial. We can minimize it in the usual
way, by finding where its derivative is zero. For a 3×3 matrix, the orthogonality constraint implies six scalar
equalities that the entries of Q must satisfy. To incorporate the constraint(s), we may employ a standard technique,
Lagrange multipliers, assembled as a symmetric matrix, Y. Thus our method is:
Differentiate Tr( (Q−M)T(Q−M) + (QTQ−I)Y ) with respect to (the entries of) Q, and equate to zero.
so that
Taking the derivative with respect to Qxx, Qxy, Qyx, Qyy in turn, we assemble a matrix.
where Q is orthogonal and S is symmetric.
To ensure a minimum, the Y matrix (and
hence S) must be positive definite. Linear
algebra calls QS the polar decomposition of
M, with S the positive square root of S2 =
MTM.
When M is non-singular, the Q and S factors of the polar decomposition are uniquely determined. However, the
determinant of S is positive because S is positive definite, so Q inherits the sign of the determinant of M. That is, Q is
only guaranteed to be orthogonal, not a rotation matrix. This is unavoidable; an M with negative determinant has no
uniquely defined closest rotation matrix.
To efficiently construct a rotation matrix Q from an angle θ and a unit axis u, we can take advantage of symmetry and
skew-symmetry within the entries. If x, y, and z are the components of the unit vector representing the axis, and
then
Determining an axis and angle, like determining a quaternion, is only possible up to sign; that is, (u,θ) and (−u,−θ)
correspond to the same rotation matrix, just like q and −q. As well, axis-angle extraction presents additional
difficulties. The angle can be restricted to be from 0° to 180°, but angles are formally ambiguous by multiples of
360°. When the angle is zero, the axis is undefined. When the angle is 180°, the matrix becomes symmetric, which
has implications in extracting the axis. Near multiples of 180°, care is needed to avoid numerical problems: in
extracting the angle, a two-argument arctangent with atan2(sin θ,cos θ) equal to θ avoids the insensitivity of
arccosine; and in computing the axis magnitude in order to force unit magnitude, a brute-force approach can lose
accuracy through underflow (Moler & Morrison 1983).
The x, y, and z components of the axis would then be divided by r. A fully robust approach will use different code
when t, the trace of the matrix Q, is negative, as with quaternion extraction. When r is zero because the angle is zero,
an axis must be provided from some source other than the matrix.
Euler angles
Complexity of conversion escalates with Euler angles (used here in the broad sense). The first difficulty is to establish
which of the twenty-four variations of Cartesian axis order we will use. Suppose the three angles are θ1, θ2, θ3;
physics and chemistry may interpret these as
One systematic approach begins with choosing the right-most axis. Among all permutations of (x,y,z), only two place
that axis first; one is an even permutation and the other odd. Choosing parity thus establishes the middle axis. That
leaves two choices for the left-most axis, either duplicating the first or not. These three choices gives us 3×2×2 = 12
variations; we double that to 24 by choosing static or rotating axes.
This is enough to construct a matrix from angles, but triples differing in many ways can give the same rotation
matrix. For example, suppose we use the zyz convention above; then we have the following equivalent pairs:
Angles for any order can be found using a concise common routine (Herter & Lott 1993; Shoemake 1994).
The problem of singular alignment, the mathematical analog of physical gimbal lock, occurs when the middle rotation
aligns the axes of the first and last rotations. It afflicts every axis order at either even or odd multiples of 90°. These
singularities are not characteristic of the rotation matrix as such, and only occur with the usage of Euler angles.
The singularities are avoided when considering and manipulating the rotation matrix as orthonormal row vectors (in
3D applications often named 'right'-vector, 'up'-vector and 'out'-vector) instead of as angles. The singularities are also
avoided when working with quaternions.
We sometimes need to generate a uniformly distributed random rotation matrix. It seems intuitively clear in two
dimensions that this means the rotation angle is uniformly distributed between 0 and 2π. That intuition is correct, but
does not carry over to higher dimensions. For example, if we decompose 3×3 rotation matrices in axis-angle form, the
angle should not be uniformly distributed; the probability that (the magnitude of) the angle is at most θ should be
1∕ (θ − sin θ), for 0 ≤ θ ≤ π.
π
Since SO(n) is a connected and locally compact Lie group, we have a simple standard criterion for uniformity,
namely that the distribution be unchanged when composed with any arbitrary rotation (a Lie group "translation").
This definition corresponds to what is called Haar measure. León, Massé & Rivest (2006) show how to use the
Cayley transform to generate and test matrices according to this criterion.
We can also generate a uniform distribution in any dimension using the subgroup algorithm of Diaconis &
Shashahani (1987). This recursively exploits the nested dimensions group structure of SO(n), as follows. Generate a
uniform angle and construct a 2×2 rotation matrix. To step from n to n+1, generate a vector v uniformly distributed
on the n-sphere, Sn, embed the n×n matrix in the next larger size with last column (0,…,0,1), and rotate the larger
matrix so the last column becomes v.
As usual, we have special alternatives for the 3×3 case. Each of these methods begins with three independent random
scalars uniformly distributed on the unit interval. Arvo (1992) takes advantage of the odd dimension to change a
Householder reflection to a rotation by negation, and uses that to aim the axis of a uniform planar rotation.
Another method uses unit quaternions. Multiplication of rotation matrices is homomorphic to multiplication of
quaternions, and multiplication by a unit quaternion rotates the unit sphere. Since the homomorphism is a local
isometry, we immediately conclude that to produce a uniform distribution on SO(3) we may use a uniform
distribution on S3. In practice: create a 4 element vector where each element is a sampling of a normal distribution.
Normalize its length and you have a uniformly sampled random unit quaternion which represents a uniformly
sampled random rotation.
Euler angles can also be used, though not with each angle uniformly distributed (Murnaghan 1962; Miles 1965).
For the axis-angle form, the axis is uniformly distributed over the unit sphere of directions, S2, while the angle has the
non-uniform distribution over [0,π] noted previously (Miles 1965).
1. Note that if instead of rotating vectors, it is the reference frame that is being rotated, the signs on the sinθ terms will be
reversed. If reference frame A is rotated anti-clockwise about the origin through an angle θ to create reference frame B,
then R_x (with the signs flipped) will transform a vector described in reference frame A coordinates to reference frame B
coordinates.
2. Note that this exponential map of skew-symmetric matrices to rotation matrices is quite different from the Cayley transform
discussed earlier, differing to 3rd order,
Conversely, a skew-symmetric matrix A specifying a rotation matrix through the Cayley map specifies the same rotation
matrix through the map exp(2arctanhA).
3. For a detailed derivation, see Derivative of the exponential map. Issues of convergence of this series to the right element of
the Lie algebra are here swept under the carpet. Convergence is guaranteed when ||X|| + ||Y|| < log 2 and ||Z|| < log 2. If
these conditions aren't fulfilled, the series may still converge. A solution always exists since exp is onto in the cases under
consideration.
1. Familton, Johannes C. (2015). "Quaternions: a history of complex noncommutative rotation groups in theoretical physics".
Arvo, James (1992), "Fast random rotation matrices", in David Kirk, Graphics Gems III
(http://www.graphicsgems.org/), San Diego: Academic Press Professional, pp. 117–120,
ISBN 978-0-12-409671-4
Baker, Andrew (2003), Matrix Groups: An Introduction to Lie Group Theory, Springer,
ISBN 978-1-85233-470-3
Bar-Itzhack, Itzhack Y. (Nov–Dec 2000), "New method for extracting the quaternion from a rotation matrix",
AIAA Journal of Guidance, Control and Dynamics, 23 (6): 1085–1087, doi:10.2514/2.4654 (https://doi.org
/10.2514%2F2.4654), ISSN 0731-5090 (https://www.worldcat.org/issn/0731-5090)
Björck, Åke; Bowie, Clazett (June 1971), "An iterative algorithm for computing the best estimate of an
orthogonal matrix", SIAM Journal on Numerical Analysis, 8 (2): 358–364, doi:10.1137/0708036
(https://doi.org/10.1137%2F0708036), ISSN 0036-1429 (https://www.worldcat.org/issn/0036-1429)
Cayley, Arthur (1846), "Sur quelques propriétés des déterminants gauches", Journal für die reine und
angewandte Mathematik, 32: 119–123, doi:10.1515/crll.1846.32.119 (https://doi.org
/10.1515%2Fcrll.1846.32.119), ISSN 0075-4102 (https://www.worldcat.org/issn/0075-4102); reprinted as
article 52 in Cayley, Arthur (1889), The collected mathematical papers of Arthur Cayley
(http://www.hti.umich.edu/cgi/t/text/pageviewer-idx?c=umhistmath;cc=umhistmath;rgn=full%20text;
idno=ABS3153.0001.001;didno=ABS3153.0001.001;view=image;seq=00000349), I (1841–1853), Cambridge
University Press, pp. 332–336
Diaconis, Persi; Shahshahani, Mehrdad (1987), "The subgroup algorithm for generating uniform random
variables", Probability in the Engineering and Informational Sciences, 1: 15–32,
doi:10.1017/S0269964800000255 (https://doi.org/10.1017%2FS0269964800000255), ISSN 0269-9648
(https://www.worldcat.org/issn/0269-9648)
Engø, Kenth (June 2001), "On the BCH-formula in so(3)" (http://www.ii.uib.no/publikasjoner/texrap/abstract
/2000-201.html), BIT Numerical Mathematics, 41 (3): 629–632, doi:10.1023/A:1021979515229 (https://doi.org
/10.1023%2FA%3A1021979515229), ISSN 0006-3835 (https://www.worldcat.org/issn/0006-3835)
Fan, Ky; Hoffman, Alan J. (February 1955), "Some metric inequalities in the space of matrices", Proc. AMS,
Proceedings of the American Mathematical Society, Vol. 6, No. 1, 6 (1): 111–116, doi:10.2307/2032662
(https://doi.org/10.2307%2F2032662), ISSN 0002-9939 (https://www.worldcat.org/issn/0002-9939),
JSTOR 2032662 (https://www.jstor.org/stable/2032662)
Fulton, William; Harris, Joe (1991), Representation Theory: A First Course, GTM, 129, New York, Berlin,
Heidelberg: Springer, ISBN 978-0-387-97495-8, MR 1153249 (https://www.ams.org/mathscinet-
getitem?mr=1153249)
Goldstein, Herbert; Poole, Charles P.; Safko, John L. (2002), Classical Mechanics (third ed.), Addison Wesley,
ISBN 978-0-201-65702-9
Hall, Brian C. (2004), Lie Groups, Lie Algebras, and Representations: An Elementary Introduction, Springer,
ISBN 978-0-387-40122-5 (GTM 222)
Herter, Thomas; Lott, Klaus (September–October 1993), "Algorithms for decomposing 3-D orthogonal
matrices into primitive rotations", Computers & Graphics, 17 (5): 517–527,
doi:10.1016/0097-8493(93)90003-R (https://doi.org/10.1016%2F0097-8493%2893%2990003-R),
ISSN 0097-8493 (https://www.worldcat.org/issn/0097-8493)
Higham, Nicholas J. (October 1, 1989), "Matrix nearness problems and applications", in Gover, Michael J. C.;
Barnett, Stephen, Applications of Matrix Theory (http://scholar.google.co.uk/scholar?hl=en&
lr=&q=author%3AHigham+intitle%3AApplications+of+Matrix+Theory&as_publication=&as_ylo=1989&
as_yhi=1989&btnG=Search), Oxford University Press, pp. 1–27, ISBN 978-0-19-853625-3
León, Carlos A.; Massé, Jean-Claude; Rivest, Louis-Paul (February 2006), "A statistical model for random
rotations" (http://www.mat.ulaval.ca/pages/lpr/), Journal of Multivariate Analysis, 97 (2): 412–430,
doi:10.1016/j.jmva.2005.03.009 (https://doi.org/10.1016%2Fj.jmva.2005.03.009), ISSN 0047-259X
(https://www.worldcat.org/issn/0047-259X)
Miles, Roger E. (December 1965), "On random rotations in R3", Biometrika, Biometrika, Vol. 52, No. 3/4, 52
(3/4): 636–639, doi:10.2307/2333716 (https://doi.org/10.2307%2F2333716), ISSN 0006-3444
(https://www.worldcat.org/issn/0006-3444), JSTOR 2333716 (https://www.jstor.org/stable/2333716)
Moler, Cleve; Morrison, Donald (1983), "Replacing square roots by pythagorean sums"
(http://domino.watson.ibm.com/tchjr/journalindex.nsf/0b9bc46ed06cbac1852565e6006fe1a0
/0043d03ee1c1013c85256bfa0067f5a6?OpenDocument), IBM Journal of Research and Development, 27 (6):
577–581, doi:10.1147/rd.276.0577 (https://doi.org/10.1147%2Frd.276.0577), ISSN 0018-8646
(https://www.worldcat.org/issn/0018-8646)
Murnaghan, Francis D. (1950), "The element of volume of the rotation group" (http://www.pnas.org/content
/vol36/issue11/), Proceedings of the National Academy of Sciences, 36 (11): 670–672,
doi:10.1073/pnas.36.11.670 (https://doi.org/10.1073%2Fpnas.36.11.670), ISSN 0027-8424
(https://www.worldcat.org/issn/0027-8424)
Murnaghan, Francis D. (1962), The Unitary and Rotation Groups, Lectures on applied mathematics,
Washington: Spartan Books
Cayley, Arthur (1889), The collected mathematical papers of Arthur Cayley (http://www.hti.umich.edu/cgi/t
/text/pageviewer-idx?c=umhistmath;cc=umhistmath;rgn=full%20text;idno=ABS3153.0001.001;
didno=ABS3153.0001.001;view=image;seq=00000349), I (1841–1853), Cambridge University Press,
pp. 332–336
Paeth, Alan W. (1986), "A Fast Algorithm for General Raster Rotation" (http://www.cipprs.org/papers
/VI/VI1986/pp077-081-Paeth-1986.pdf) (PDF), Proceedings, Graphics Interface '86: 77–81
Daubechies, Ingrid; Sweldens, Wim (1998), "Factoring wavelet transforms into lifting steps" (http://cm-
bell-labs.github.io/who/wim/papers/factor/factor.pdf) (PDF), Journal of Fourier Analysis and Applications:
247–269
Pique, Michael E. (1990), "Rotation Tools", in Andrew S. Glassner, Graphics Gems
(http://www.graphicsgems.org/), San Diego: Academic Press Professional, pp. 465–469,
ISBN 978-0-12-286166-6
Press, William H.; Teukolsky, Saul A.; Vetterling, William T.; Flannery, Brian P. (2007), "Section 21.5.2.
Picking a Random Rotation Matrix" (http://apps.nrbook.com/empanel/index.html#pg=1130), Numerical
Recipes: The Art of Scientific Computing (3rd ed.), New York: Cambridge University Press,
ISBN 978-0-521-88068-8
Shepperd, Stanley W. (May–June 1978), "Quaternion from rotation matrix", AIAA Journal of Guidance,
Control and Dynamics, 1 (3): 223–224, ISSN 0731-5090 (https://www.worldcat.org/issn/0731-5090)
Shoemake, Ken (1994), "Euler angle conversion", in Paul Heckbert, Graphics Gems IV
(http://www.graphicsgems.org/), San Diego: Academic Press Professional, pp. 222–229,
ISBN 978-0-12-336155-4
Stuelpnagel, John (October 1964), "On the parameterization of the three-dimensional rotation group", SIAM
Review, 6 (4): 422–430, doi:10.1137/1006093 (https://doi.org/10.1137%2F1006093), ISSN 0036-1445
(https://www.worldcat.org/issn/0036-1445) (Also NASA-CR-53568 (http://ntrs.nasa.gov/search.jsp).)
Varadarajan, Veeravalli S. (1984), Lie Groups, Lie Algebras, and Their Representation, Springer,
ISBN 978-0-387-90969-1 (GTM 102)
Wedderburn, Joseph H. M. (1934), Lectures on Matrices (http://scholar.google.co.uk/scholar?hl=en&
lr=&q=author%3AWedderburn+intitle%3ALectures+on+Matrices&as_publication=&as_ylo=1934&