Vectors

A vector is an arrow with a length and a direction. Just like positions, vectors exist before we measure or describe them. Unlike positions, vectors can mean many different things, such as position vectors, velocities, etc. Vectors are not anchored to particular positions in space, so we can slide a vector around and locate it at any position.

Change:

Two vectors, which may or may not be the same vector. Moving a vector around does not change it: it is still the same vector.

Notation

Some textbooks differentiate between free vectors, which are free to slide around, and bound vectors, which are anchored in space. We will only use free vectors.

We will use the over-arrow notation $ \vec{a} $ for vector quantities. Other common notations include bold $ \boldsymbol{a} $ and under-bars $ \underline{a} $. For unit (length one) vectors we will use an over-hat $ \hat{a} $.

Cartesian coordinate system

Cartesian coordinates (also known as rectangular coordinates) are the regular $x,y$ or $x,y,z$ coordinates in 2D and 3D, with perpendicular straight-line axes.

coordinate	name	range	definition
$x$	horizontal	$-\infty \lt x \lt \infty$	horizontal distance from the $y$-axis
$y$	vertical	$-\infty \lt y \lt \infty$	vertical distance from the $x$-axis

In 3D the $x$,$y$,$z$ axes are oriented in right-handed order, so that fingers curling from $x$ to $y$ means the thumb points in the $z$ direction.

Cartesian coordinate system, showing the origin $O$ and three positions with their coordinates.

Notation note

When writing coordinates we can either list the coordinate values explicitly, like $x = 2$, $y = 1$, or we can just write the numbers in an ordered list, like $ (2, 1) $.

Writing them as a list assumes that we know which coordinate system we are using, and which order the coordinates are in. We will typically write the coordinate values explicitly to avoid confusion. This also applies to vector components.

Did you know?

While the idea of $x,y$ coordinates specifying positions might seem obvious, the use of algebraic quantities to define geometry in this way was a major discovery of René Descartes and a key event in the scientific revolution. Descartes was also instrumental in establishing the philosophical groundwork of The Enlightenment, and we owe to him the famous phrase “Cogito ergo sum”.

Polar coordinate system

Polar coordinates are an alternative 2D coordinate system that are useful for rotating or circular motion.

coordinate	name	range	definition
$r$	radius	$0 \le r \lt \infty$	distance from the origin $O$
$\theta$	angle	$-\pi \lt \theta \le \pi$	angle from the $x$-axis in the $x$–$y$ plane

The two common extensions of polar coordinates to 3D are cylindrical coordinates and spherical coordinates.

Polar coordinate system, showing the origin $O$ and three positions with their coordinates.

To define polar coordinates, we relate them to another known coordinate system, typically Cartesian coordinates as follows, where we use the atan2 function.

Conversion between polar and Cartesian coordinates #rvp-ep

$$ \begin{aligned}x &= r \cos\theta& r &= \sqrt{x^2 + y^2} \\y &= r \sin\theta& \theta &= \operatorname{atan2}(y, x)\end{aligned} $$

Did you know?

Angles can be measured in many different units. While radians are the most mathematically elegant unit, their use dates only from Roger Cotes in the 18th century. Degrees are a much older unit, dating back to the sexagesimal number system of the ancient Babylonians, and are convenient for celestial calculations because there are approximately 360 days in one year. Other units include grads (100 grads in a right angle) and mils (various definitions, all approximately equal to a milliradian).

Notation note

When solving for $\theta$ in terms of $x$ and $y$ we know that $\tan\theta = y / x$. However, this does not necessary mean that $ \theta = \tan^{-1}(y / x) $, because $\tan$ is not a one-to-one function. For example, $ \tan^{-1}(1/1) $ is the same as $ \tan^{-1}(-1/-1) $. We thus need to take $ \theta = \tan^{-1}(y/x) + \pi $ if the point is in quadrants II or III.

An alternative is to use the atan2 function, which takes $x$ and $y$ as separate arguments and correctly handles the conversion to $\theta$ in any quadrant. This function is available in all common programming languages, including C, MATLAB, Python, and others.

Scientific calculators typically have special commands to convert to/from polar and Cartesian coordinates, which also automatically correctly handle the different quadrants.

Units

When using coordinates to describe the location of physical positions in the real world, we must always have appropriate units of length for the coordinates. For example, the following coordinates all specify the same position:

$$ \begin{aligned} x &= 2{\rm\ m} & x &= 6.56{\rm\ ft} & r &= 6.32{\rm\ m} & r &= 249{\rm\ in} \\ y &= -6{\rm\ m} & y &= -6 \times 10^6{\rm\ \mu m} & \theta &= 1.25{\rm\ rad} & \theta &= 71.6^\circ. \end{aligned} $$

It is important to distinguish between units and dimensions. The base dimensions include mass (M), length (L), and time (T), and can be combined to produce dimensions such as L/T or L T$ ^{-1} $ for velocity. Each physical quantity such as velocity has just one set of dimensions, and there is no choice in this.

Units are particular ways of measuring dimensional quantities, and include the SI units kilogram (kg), meter (m), and second (s), as well as the customary U.S. units pound (lb) and foot (ft). A given physical quantity can be written in terms of many different units, although all of these must conform to the dimensions of the quantity. For example, velocity has dimension L/T and so it can be written in terms of the units m/s, ft/s, km/h, or many other choices.

Angles are an example of a dimensionless quantity (having dimension 1), but they still have units, typically either radians (rad) or degrees (°).

Did you know?

The SI unit system is the definitive system of measurement used in science and engineering. Other systems, such as U.S. customary units are defined in terms of SI units, so that an inch is defined to be exactly 2.54 cm, for example.

The National Institute of Standards and Technology (NIST) maintains an excellent reference guide to SI units, including precise rules and style conventions for writing units in scientific and engineering work.

Unit vectors

A unit vector is any vector with a length of one. We use the special over-hat notation $ \hat{a} $ to indicate when a vector is a unit vector. Any non-zero vector $ \vec{a} $ gives a unit vector $ \hat{a} $ that specifies the direction of $ \vec{a} $.

Normalization to unit vector. #rvv-eu

$$ \begin{aligned} \hat{a} =\frac{\vec{a}}{a}\end{aligned} $$

If we compute the length of $ \hat{a} $ then we find:

$$ \| \hat{a} \| = \left\| \frac{\vec{a}}{a} \right\| = \frac{\|\vec{a}\|}{a} = \frac{a}{a} = 1, $$

so $ \hat{a} $ is really a unit vector, and it is in the same direction as $ \vec{a} $ as they differ only by a scalar factor.

Any vector can be written as the product of its length and direction:

Vector decomposition into length and direction. #rvv-ei

$$ \begin{aligned} \vec{a} = a\hat{a}\end{aligned} $$

This follows from rearranging #rvv-eu.

Three vectors and their decompositions into lengths and directional unit vectors.

Vectors bases

To describe vectors mathematically, we write them as a combination of basis vectors. An orthonormal basis is a set of two (in 2D) or three (in 3D) basis vectors which are orthogonal (have 90° angles between them) and normal (have length equal to one). We will not be using non-orthogonal or non-normal bases.

Any other vector can be written as a linear combination of the basis vectors:

Components of a vector. #rvv-ec

$$ \vec{a} = a_1 \,\hat{\imath}+ a_2 \,\hat{\jmath} + a_3 \,\hat{k} $$

The numbers $a_1, a_2, a_3$ are called the components of $ \vec{a} $ in the $ \,\hat{\imath}, \hat{\jmath}, \hat{k} $ basis. If we are in 2D then we will only have two components for a vector.

Writing a vector as the sum of scaled basis vectors. The scale factors are the components of the vector. Here $ \vec{a} = 3\hat\imath + 2\hat\jmath $, so the components of $ \vec{a} $ are $a_1 = 3$ and $a_2 = 2$.

We draw the symbol $\odot$ (arrow tip) to indicate a vector coming out of the page, and $\otimes$ (arrow fletching) to indicate an arrow going into the page.

Two standard arrangements of the basis vectors when working in 2D. Either $\hat\jmath$ is the vertical and $ \hat{k} $ is out of the page, or $ \hat{k} $ is the vertical and $\hat\jmath$ is into the page. In both cases $\hat\imath$ is horizontal.

Notation note

Just as for position coordinates, we can write the vector components $3\hat\imath + 2\hat\jmath$ as the ordered list $(3, 2)$ if we know which basis we are using. Because we often will be using several bases simultaneously, we will generally write the components explicitly in the $3\hat\imath + 2\hat\jmath$ form.

Did you know?

The use of the letter $i,j,k$ for basis vectors is due to William Hamilton, who was motivated by thinking of basis vectors as extensions of the complex number $i$. This notation was popularized by the book Vector Analysis: A Text Book for the Use of Students of Mathematics and Physics Founded upon the Lectures of J. Willard Gibbs (1901), by E. B. Wilson. This book also introduced the use of bold letters to represent vectors.

Length of vectors

The length of a vector $ \vec{a} $ is written either $ \| \vec{a} \| $ or just plain $a$. The length can be computed using Pythagorus’ theorem:

Pythagorus' length formula. #rvv-ey

$$ a = \|\vec{a}\| = \sqrt{a_1^2 + a_2^2 + a_3^2} $$

First we prove Pythagorus' theorem for right-angle triangles. For side lengths $a$ and $b$ and hypotenuse $c$, the fact that $a^2 + b^2 = c^2$ can be seen graphically below, where the gray area is the same before and after the triangles are rotated in the animation:

Pythagorus' theorem immediately gives us vector lengths in 2D. To find the length of a vector in 3D we can use Pythagorus' theorem twice, as shown below. This gives the two right-triangle calculations:

$$ \begin{aligned} \ell^2 &= a_1^2 + a_2^2 \\ a^2 &= \ell^2 + a_3^2 = a_1^2 + a_2^2 + a_3^2. \end{aligned} $$

Click and drag to rotate.

Warning: Length must be computed in a single basis. #rvv-wl

The Pythagorean length formula can only be used if all the components are written in a single orthonormal basis.

Computing the length of a vector using Pythagorus' theorem.

Some common integer vector lengths are $ \vec{a} = 4\hat\imath + 3\hat\jmath $ (length $a = 5$) and $ \vec{b} = 12\hat\imath + 5\hat\jmath $ (length $b = 13$).

Warning: Adding vectors does not add lengths. #rvv-wa

If $ \vec{c} = \vec{a} + \vec{b} $, then $ \|\vec{c}\| \ne \|\vec{a}\| + \|\vec{b}\| $ unless $ \vec{a} $ and $ \vec{b} $ are parallel and in the same direction.

It will always be true, however, that $ \|\vec{c}\| \le \|\vec{a}\| + \|\vec{b}\| $. This fact is known as the triangle inequality, for reasons that should be obvious.

Did you know?

Sets of three integers $a,b,c$ where $a^2 + b^2 = c^2$ are called Pythagorean triples. A long list of such triples is given on the Plimpton 322 clay tablet written by the ancient Babylonians around 1800 BCE, although it is unclear how they generated these numbers. Pythagorean triples lead to complex mathematics, including the curious patterns shown below and Fermat's Last Theorem.

The values of $a$ and $b$ for all Pythagorean triples $a,b,c$ with $a$ and $b$ up to 2000.

Changing bases

To change the basis that a vector is written in, we need to know how the basis vectors are related. We do this by writing one set of basis vectors in terms of the other basis vectors. If we want to change from $ \hat\imath,\hat\jmath $ to $ \hat{u},\hat{v} $, then we need to write $ \hat\imath,\hat\jmath $ in terms of $ \hat{u},\hat{v} $ and then substitute the expressions.

Example: Example: Basis change. #rvv-xn

For example, if we have $ \vec{a} = 3\,\hat{\imath} + 2\,\hat{\jmath} $ and we want to write this in the $ \,hat{u}, ,hat{v} $ basis, then we need to know $ \,\hat{\imath}, \,\hat{\jmath} $ in terms of $ \,\hat{u}, \,\hat{v} $.

From above we see that:

$$ \begin{aligned} \hat{\imath} &= \cos\theta \, \hat{u} - \sin\theta \, \hat{v} = \frac{1}{\sqrt{2}} \,\hat{u} - \frac{1}{\sqrt{2}} \,\hat{v} \\ \hat{\jmath} &= \sin\theta \, \hat{u} + \cos\theta \, \hat{v} = \frac{1}{\sqrt{2}} \,\hat{u} + \frac{1}{\sqrt{2}} \,\hat{v}.\end{aligned} $$

Then we can substitute and re-arrange:

$$ \begin{aligned} \vec{a} &= 3\,\hat{\imath} + 2\,\hat{\jmath} \\ &= 3\left(\frac{1}{\sqrt{2}} \,\hat{u} - \frac{1}{\sqrt{2}} \,\hat{v}\right) + 2\left(\frac{1}{\sqrt{2}} \,\hat{u} + \frac{1}{\sqrt{2}} \,\hat{v}\right) \\ &= \left(\frac{3}{\sqrt{2}} + \frac{2}{\sqrt{2}} \right) \,\hat{u} + \left(-\frac{3}{\sqrt{2}} + \frac{2}{\sqrt{2}} \right) \,\hat{v} \\ &= \frac{5}{\sqrt{2}} \,\hat{u} - \frac{1}{\sqrt{2}} \,\hat{v}.\end{aligned} $$

If we want to convert back the other way then we would need to know $ \,\hat{u}, \,\hat{v} $ in terms of $ \,\hat{\imath}, \,\hat{\jmath} $. We can find this by solving for $ \,\hat{u}, \,\hat{v} $ above, giving:

$$ \begin{aligned} \hat{u} &= \cos\theta \, \hat\imath + \sin\theta \, \hat\jmath = \frac{1}{\sqrt{2}} \,\hat\imath + \frac{1}{\sqrt{2}} \,\hat\jmath \\ \hat{v} &= -\sin\theta \, \hat\imath + \cos\theta \, \hat\jmath = -\frac{1}{\sqrt{2}} \,\hat\imath + \frac{1}{\sqrt{2}} \,\hat\jmath.\end{aligned} $$

We can also write the general expressions for basis change, as below.

Change of basis formulas. #rvv-eg

$$ \begin{aligned}\vec{a} &=a_i \, \hat\imath + a_j \, \hat\jmath + a_k \, \hat{k}& \vec{a} &=a_u \, \hat{u} + a_v \, \hat{v} + a_w \, \hat{w} \\[1em]a_i &= a_u u_i + a_v v_i + a_w w_i& a_u &= a_i i_u + a_j j_u + a_k k_u \\a_j &= a_u u_j + a_v v_j + a_w w_j& a_v &= a_i i_v + a_j j_v + a_k k_v \\a_k &= a_u u_k + a_v v_k + a_w w_k& a_w &= a_i i_w + a_j j_w + a_k k_w\end{aligned} $$

We will derive the first set of equations (the second set are derived similarly). The vector $ \vec{a} $ can be written in both the $ \hat\imath,\hat\jmath,\hat{k} $ and $ \hat{u},\hat{v},\hat{w} $ bases:

$$ \begin{aligned} \vec{a} &= a_i \hat\imath + a_j \hat\jmath + a_k \hat{k} & \vec{a} &= a_u \hat{u} + a_v \hat{v} + a_w \hat{w}. \end{aligned} $$

We can write each $ \hat{u},\hat{v},\hat{w} $ basis vector in terms of the $ \hat\imath,\hat\jmath,\hat{k} $ basis:

$$ \begin{aligned} \hat{u} &= u_i \hat\imath + u_j \hat\jmath + u_k \hat{k} \\ \hat{v} &= v_i \hat\imath + v_j \hat\jmath + v_k \hat{k} \\ \hat{w} &= w_i \hat\imath + w_j \hat\jmath + w_k \hat{k}. \end{aligned} $$

Substituting these expressions into $ \vec{a} $ gives:

$$ \begin{aligned} \vec{a} &= a_u \hat{u} + a_v \hat{v} + a_w \hat{w} \\ &= a_u (u_i \hat\imath + u_j \hat\jmath + u_k \hat{k}) + a_v (v_i \hat\imath + v_j \hat\jmath + v_k \hat{k}) + a_w (w_i \hat\imath + w_j \hat\jmath + w_k \hat{k}) \\ &= (a_u u_i + a_v v_i + a_w w_i) \hat\imath + (a_u u_j + a_v v_j + a_w w_j) \hat\jmath + (a_u u_k + a_v v_k + a_w w_k) \hat{k} \\ &= a_i \hat\imath + a_j \hat\jmath + a_k \hat{k}. \end{aligned} $$

Comparing the last two lines gives the component formulas.

In 2D the change between two orthonormal bases is a rotation by an angle $ \theta $, resulting in the change of basis expression below.

Change of basis formula in 2D. #rvv-e2

$$ \begin{aligned} \vec{a} &= a_i \, \hat\imath + a_j \, \hat\jmath & \vec{a} &= a_u \, \hat{u} + a_v \, \hat{v} \\[1em] a_i &= \cos\theta \, a_u - \sin\theta \, a_v & a_u &= \cos\theta \, a_i + \sin\theta \, a_j \\ a_j &= \sin\theta \, a_u + \cos\theta \, a_v & a_v &= -\sin\theta \, a_i + \cos\theta \, a_j \end{aligned} $$

Elementary geometry gives the relationships between the basis vectors:

$$ \begin{aligned} \hat\imath &= \cos\theta \, \hat{u} - \sin\theta \, \hat{v} & \hat{u} &= \cos\theta \, \hat\imath + \sin\theta \, \hat\jmath \\ \hat\jmath &= \sin\theta \, \hat{u} + \cos\theta \, \hat{v} & \hat{v} &= -\sin\theta \, \hat\imath + \cos\theta \, \hat\jmath. \end{aligned} $$

Thus we have the components:

$$ \begin{aligned} i_u &= \cos\theta & i_v &= -\sin\theta & u_i &= \cos\theta & u_j &= \sin\theta \\ j_u &= \sin\theta & j_v &= \cos\theta & v_i &= -\sin\theta & v_j &= \cos\theta. \end{aligned} $$

Substituting these into #rvv-eg and ignoring the third components gives the desired expressions.

Vector expressions are true no matter which basis we write the vectors in, even if they are written in different bases.

Example: Vector addition in different bases. #rvv-xa

Adding $ \vec{a} $ and $ \vec{b} $ to get the result $ \vec{c} $ is a well-defined operation even before any basis is used, so it cannot depend on the basis chosen. As we see below, we can do the calculation in either the $ \hat\imath,\hat\jmath $ or $ \hat{u}, \hat{v} $ basis.

Show components: none $ \hat\imath,\hat\jmath $ $ \hat{u}, \hat{v} $ mixed

$$ \begin{aligned} \vec{c} &= \vec{a} + \vec{b} \\ &= (3\hat\imath + 2\hat\jmath) + (3\hat\imath - \hat\jmath) \\ &= 6\hat\imath + \hat\jmath \\ \vec{c} &= \vec{a} + \vec{b} \\ &= (3.5\hat{u} - 0.7\hat{v}) + (1.4\hat{u} - 2.8\hat{v}) \\ &= 4.9\hat{u} - 3.5\hat{v} \\ \vec{c} &= \vec{a} + \vec{b} \\ &= (3\hat\imath + 2\hat\jmath) + (1.4\hat{u} - 2.8\hat{v}) \\ &= 3\hat\imath - 2.8\hat{v} + 2\hat\jmath + 1.4\hat{u}. \end{aligned} $$

The component order in the mixed expression is arbitrary.

Example Problem: Cross product in different bases. #rvv-xx

Consider the two vectors shown below and their components in two bases, together with their lengths:

$$ \begin{aligned} \vec{a} &= 3\hat\imath + 2\hat\jmath = 3.5\hat{u} - 0.7\hat{v} & a &= \sqrt{3^2 + 2^2} = 3.6 \\ \vec{b} &= 3\hat\imath - \hat\jmath = 1.4\hat{u} - 2.8\hat{v} & b &= \sqrt{3^2 + 1^2} = 3.2 \end{aligned} $$

Show components: none $ \hat\imath,\hat\jmath $ $ \hat{u}, \hat{v} $

Compute the cross product $ \vec{a} \times \vec{b} $ using: (1) the angle formula #rvv-el; (2) the component formula #rvv-ex with $ \vec{a}, \vec{b} $ both in the $ \hat\imath,\hat\jmath $ basis, both in the $ \hat{u}, \hat{v} $ basis, and with $ \vec{a} $ in the $ \hat\imath,\hat\jmath $ basis and $ \vec{b} $ in the $ \hat{u}, \hat{v} $ basis.

(1) The dot product is $ \vec{a} \cdot \vec{b} = 7 $ and the vector lengths are $a = 3.6$ and $b = 3.2$, so $\cos\theta = 7 / (ab)$ and $\theta \approx 53^\circ$. Now using #rvv-el gives:

$$ \begin{aligned} \vec{a} \times \vec{b} &= a b \sin\theta ( -\hat{k}) \\ &\approx -9 \hat{k}. \end{aligned} $$

(2) Using the component formula #rvv-ex gives:

$$ \begin{aligned} (3\hat\imath + 2\hat\jmath) \times (3\hat\imath - \hat\jmath) &= -3 \hat\imath \times \hat\jmath + 6 \hat\jmath \times \hat\imath \\ &= -3 \hat{k} - 6 \hat{k} \\ &= -9 \hat{k} \\ (3.5 \hat{u} - 0.7 \hat{v}) \times (1.4 \hat{u} - 2.8 \hat{v}) &= - (3.5 \times 2.8) \hat{u} \times \hat{v} - (0.7 \times 1.4) \hat{v} \times \hat{u} \\ &= -10 \hat{k} + \hat{k} \\ &= -9 \hat{k} \\ (3\hat\imath + 2\hat\jmath) \times (1.4\hat{u} - 2.8\hat{v}) &= (3 \times 1.4) \hat\imath \times \hat{u} - (3 \times 2.8) \hat\imath \times \hat{v} \\ &\quad + (2 \times 1.4) \hat\jmath \times \hat{u} - (2 \times 2.8) \hat\jmath \times \hat{v} \\ &= 4.2 \sin 53^\circ \, \hat{k} - 8.5 \sin 143^\circ \, \hat{k} \\ &\quad - 2.8 \sin 53^\circ \, \hat{k} - 5.7 \sin 53^\circ \, \hat{k} \\ &= - 9 \hat{k}. \end{aligned} $$

Example: Dot product is independent of basis. #rvv-xd

Equation #rvv-ed makes it clear that the dot product does not depend on which basis we use to write $ \vec{a} $ and $ \vec{b} $, so long as we use the same orthonormal basis for both of them. This is because the dot product only depends on the lengths and angle between the vectors, which are real physical quantities that don’t change just because we use a different basis.

However, we can also verify directly that the component equation #rvv-es for the dot product does not depend on which basis we use. To keep the algebra short, we will only do this in 2D.

We compute the dot product using #rvv-es in the $\hat\imath,\hat\jmath$ basis and substitute in the change-of-basis expressions #rvv-eg, giving:

$$ \begin{aligned} \vec{a} \cdot \vec{b} &= a_i b_i + a_j b_j \\ &= (u_i a_u + v_i a_v ) (u_i b_u + v_i b_v) + (u_j a_u + v_j a_v) (u_j b_u + v_j b_v) \\ &= (u_i^2 + u_j^2) a_u b_u + (v_i^2 + v_j^2) a_v b_v + (u_i v_i + u_j b_j) (a_u b_v + a_v b_u) \\ &= \| \hat{u} \|^2 a_u b_u + \| \hat{v} \|^2 a_v b_v + (\hat{u} \cdot \hat{v}) (a_u b_v + a_v b_u) \\ &= a_u b_u + a_v b_v. \end{aligned} $$

To get the last line we used the fact that $ \hat{u} $ and $ \hat{v} $ form an orthornormal basis, so that they each have length 1 (that is, $ \|\hat{u}\| = \|\hat{v}\| = 1 $) and they are orthogonal (that is, $ \hat{u} \cdot \hat{v} = 0 $ ).

This then shows that

$$ \begin{aligned} a_i b_i + a_j b_j &= a_u b_u + a_v b_v \end{aligned} $$

and so it doesn’t matter which basis we use to compute $ \vec{a} \cdot \vec{b} $ , so long as we use an orthonormal basis.

Projection and complementary projection

The projection and complementary projection are:

Projection of $\vec{a}$ onto $\vec{b}$. #rvv-ep

$$ \operatorname{Proj}(\vec{a},\vec{b})= (\vec{a} \cdot \hat{b}) \hat{b}= (a \cos\theta) \, \hat{b} $$

Complementary projection of $\vec{a}$ with respect to $\vec{b}$. #rvv-em

$$ \begin{aligned}\operatorname{Comp}(\vec{a}, \vec{b})&= \vec{a} -\operatorname{Proj}(\vec{a}, \vec{b}) =\vec{a} - (\vec{a} \cdot \hat{b}) \hat{b} \\\left\|\operatorname{Comp}(\vec{a}, \vec{b}) \right\|&= a \sin\theta\end{aligned} $$

Adding the projection and the complementary projection of a vector just give the same vector again, as we can see on the figure below.

Projection of $ \vec{a} $ onto $ \vec{b} $ and the complementary projection.

As we see in the diagram above, the complementary projection is orthogonal to the reference vector:

Complementary projection is orthogonal to the reference. #rvv-er

$$ \operatorname{Comp}(\vec{a}, \vec{b}) \cdot \vec{b} = 0 $$

Using the definitions of the complementary projection rvv-em and projection rvv-ep, we compute:

$$ \begin{aligned} \operatorname{Comp}(\vec{a}, \vec{b}) \cdot \vec{b} &= \Big(\vec{a} - (\vec{a} \cdot \hat{b}) \hat{b}\Big) \cdot \vec{b} \\ &= \vec{a} \cdot \vec{b} - (\vec{a} \cdot \hat{b}) (\hat{b} \cdot \vec{b}) \\ &= a b \cos\theta - (a\cos\theta) b \\ &= 0. \end{aligned} $$

Change in length and direction

Two useful derivatives are the rates of change of a vector's length and direction:

Derivative of vector length. #rvc-el

$$ \dot{a} = \dot{\vec{a}} \cdot \hat{a} $$

We start with the dot product expression #rvv-ed for length and differentiate it:

$$ \begin{aligned} a &= \sqrt{\vec{a} \cdot \vec{a}} \\ \frac{d}{dt} a &= \frac{d}{dt} \big( (\vec{a} \cdot \vec{a})^{1/2} \big) \\ \dot{a} &= \frac{1}{2} (\vec{a} \cdot \vec{a})^{-1/2} (\dot{\vec{a}} \cdot \vec{a} + \vec{a} \cdot \dot{\vec{a}}) \\ &= \frac{1}{2\sqrt{a^2}} (2 \dot{\vec{a}} \cdot \vec{a}) \\ &= \dot{\vec{a}} \cdot \hat{a}.\end{aligned} $$

Derivative of vector direction. #rvc-eu

$$ \dot{\hat{a}} = \frac{1}{a}\operatorname{Comp}(\dot{\vec{a}}, \vec{a}) $$

We take the definition #rvv-eu for the unit vector and differentiate it:

$$ \begin{aligned} \hat{a} &= \frac{\vec{a}}{a} \\ \frac{d}{dt} \hat{a} &= \frac{d}{dt}\left(\frac{\vec{a}}{a}\right) \\ \dot{\hat{a}} &= \frac{\dot{\vec{a}} a - \vec{a} \dot{a}}{a^2} \\ &= \frac{\dot{\vec{a}}}{a} - \frac{\dot{\vec{a}} \cdot \hat{a}}{a^2} \vec{a} \\ &= \frac{1}{a} \big( \dot{\vec{a}} - (\dot{\vec{a}} \cdot \hat{a}) \hat{a} \big)\\ &= \frac{1}{a} \operatorname{Comp}(\dot{\vec{a}}, \vec{a}).\end{aligned} $$

Here we observed at the end that we had the expression #rvv-em for the complementary projection of the derivative $ \dot{\vec{a}} $ with respect to $ \vec{a} $ itself.

An immediate consequence of the derivative of direction formula is that the derivative of a unit vector is always orthogonal to the unit vector:

Derivative of unit vector is orthogonal. #rvc-eu2

$$ \dot{\hat{a}} \cdot \hat{a} = 0 $$

From #rvc-eu we know that $ \dot{\hat{a}} $ is in the direction of $ \operatorname{Comp}(\dot{\vec{a}}, \vec{a}) $ , and from #rvv-er we know that this is orthogonal to $ \vec{a} $ (and also $ \hat{a} $).

Recall that we can always write a vector as the product of its length and direction, so $ \vec{a} = a \hat{a} $. This gives the following decomposition of the derivative of $ \vec{a} $.

Vector derivative decomposition. #rvc-em2

$$ \begin{aligned} \dot{\vec{a}} &=\underbrace{\dot{a}\hat{a}}_{\operatorname{Proj}(\dot{\vec{a}},\vec{a})} + \underbrace{a\dot{\hat{a}}}_{\operatorname{Comp}(\dot{\vec{a}},\vec{a})}\end{aligned} $$

Differentiating $ \vec{a} = a \hat{a} $ and substituting in #rvv-el and #rvv-eu gives

$$ \begin{aligned} \dot{\vec{a}} &= \dot{a} \hat{a} + a \dot{\hat{a}} \\ &= ( \dot{\vec{a}} \cdot \hat{a} ) \hat{a} + a \frac{1}{a} \operatorname{Comp}(\dot{\vec{a}}, \hat{a}) \\ &= \operatorname{Proj}(\dot{\vec{a}}, \vec{a}) + \operatorname{Comp}(\dot{\vec{a}}, \vec{a}). \end{aligned} $$

Show:

Vector derivatives can be decomposed into length changes (projection onto $ \vec{a} $) and direction changes (complementary projection). Compare to Figure #rvv-fu.

Spherical coordinates

The spherical coordinate system extends polar coordinates into 3D by using an angle $\phi$ for the third coordinate. This gives coordinates $(r, \theta, \phi)$ consisting of:

coordinate	name	range	definition
$r$	radius	$0 \le r \lt \infty$	distance from the origin
$\theta$	azimuth	$-\pi \lt \theta \le \pi$	angle from the $x$-axis in the $x$–$y$ plane
$\phi$	inclination	$0 \le \phi \le \pi$	angle from the positive $z$ axis

The diagram below shows the spherical coordinates of a point $P$. By changing the display options, we can see that the basis vectors are tangent to the corresponding coordinate lines. Changing $\theta$ moves $P$ along the $\theta$ coordinate line in the direction $ \hat{e}_\theta $, and similarly for the other coordinates.

Show:

Coordinate lines:

Radius:

$r = \; $ 4

Azimuth:

$\theta = \; $ 45$^\circ$

Inclination:

$\phi = \; $ 45°

Spherical coordinates are defined with respect to a set of Cartesian coordinates, and can be converted to and from these coordinates using the atan2 function as follows.

Conversion between spherical and Cartesian coordinates. #rvs-ec

$$ \begin{aligned}x &= r \cos\theta \sin\phi & r &= \sqrt{x^2 + y^2 + z^2} \\y &= r \sin\theta \sin\phi & \theta &= \operatorname{atan2}(y, x) \\z &= r \cos\phi & \phi &= \operatorname{arccos}(z / r)\end{aligned} $$

To find the conversion to Cartesian coordinates, we consider the projection of $ \vec{r} $ down onto the $x$–$y$ plane, as shown. Then $z = r \cos\phi$ and $ \ell = r \sin\phi $, from which we obtain $ x = \ell \cos\theta $ and $ y = \ell \sin\theta $.

To convert from Cartesian coordinates, we use the same projection and read off the expressions for the spherical coordinates. This uses the atan2 function to handle the different quadrants for $\theta$.

The basis vectors are tangent to the coordinate lines and form an orthonormal basis $ \hat{e}_r, \hat{e}_\theta, \hat{e}_\phi $ that depends on the current position $P$ as follows.

Spherical basis vectors. #rvs-eb

$$ \begin{aligned}\hat{e}_r &= \cos\theta \sin\phi \,\hat{\imath}+ \sin\theta \sin\phi \,\hat{\jmath} + \cos\phi \,\hat{k} \\\hat{e}_{\theta} &= - \sin\theta \,\hat{\imath}+ \cos\theta \,\hat{\jmath} \\\hat{e}_{\phi} &= \cos\theta \cos\phi \,\hat{\imath}+ \sin\theta \cos\phi \,\hat{\jmath} - \sin\phi \,\hat{k} \\[1em]\hat{\imath} &= \cos\theta \sin\phi \, \hat{e}_r- \sin\theta \, \hat{e}_\theta + \cos\theta \cos\phi \, \hat{e}_\phi \\\hat{\jmath} &= \sin\theta \sin\phi \, \hat{e}_r+ \cos\theta \, \hat{e}_\theta + \sin\theta \cos\phi \, \hat{e}_\phi \\\hat{k} &= \cos\phi \, \hat{e}_r - \sin\phi \, \hat{e}_\phi\end{aligned} $$

We write the position vector $ \vec{r} = r \cos\theta \sin\phi \, \hat{\imath} + r \sin\theta \sin\phi \, \hat{\jmath} + r \cos\phi \, \hat{k} $ and then use the definition of coordinate basis vectors to find the non-normalized spherical basis vectors:

$$ \begin{aligned} \vec{e}_r &= \frac{\partial\vec{r}}{\partial r} = \cos\theta \sin\phi \, \hat{\imath} + \sin\theta \sin\phi \, \hat{\jmath} + \cos\phi \, \hat{k} \\ \vec{e}_\theta &= \frac{\partial\vec{r}}{\partial\theta} = -r \sin\theta \sin\phi \, \hat{\imath} + r \cos\theta \sin\phi \, \hat{\jmath} \\ \vec{e}_\phi &= \frac{\partial\vec{r}}{\partial\phi} = r \cos\theta \cos\phi \, \hat{\imath} + r \sin\theta \cos\phi \, \hat{\jmath} - r \sin\phi \, \hat{k} \end{aligned} $$

To normalize these vectors we divide by their lengths, which we can compute to be $ \| \vec{e}_r \| = 1 $, $ \| \vec{e}_\theta \| = r \sin\phi, and \| \vec{e}_\phi \| = r $.

To invert the basis change we first observe that we can take combinations of $ \hat{e}_r $ and $ \hat{e}_{\phi} $ to give:

$$ \begin{aligned} \cos\phi \, \hat{e}_r - \sin\phi \, \hat{e}_\phi &= \hat{k} \\ \sin\phi \, \hat{e}_r + \cos\phi \, \hat{e}_\phi &= \cos\theta \, \hat{\imath} + \sin\theta \, \hat{\jmath} \end{aligned} $$

Then:

$$ \begin{aligned} \cos\theta(\sin\phi \, \hat{e}_r + \cos\phi \, \hat{e}_\phi) - \sin\theta \, \hat{e}_\theta &= \cos^2\theta \, \hat{\imath} + \sin^2\theta \, \hat{\imath} = \hat{\imath} \\ \sin\theta(\sin\phi \, \hat{e}_r + \cos\phi \, \hat{e}_\phi) + \cos\theta \, \hat{e}_\theta &= \sin^2\theta \, \hat{\imath} + \cos^2\theta \, \hat{\jmath} = \hat{\jmath} \end{aligned} $$

Rearranging these gives the Cartesian basis vector expressions above.

Warning: $(\hat{e}_r,\hat{e}_\theta,\hat{e}_\phi)$ is not right-handed #rvs-wr

Although it is common to write the spherical coordinates in the order $(r,\theta,\phi)$, this order gives a left-handed basis $ (\hat{e}_r,\hat{e}_\theta,\hat{e}_\phi) $, which we can see graphically from the fact that $ \hat{e}_r \times \hat{e}_\theta = -\hat{e}_\phi $. We can either work with this as a left-handed basis, or re-order the coordinates to give the right-handed basis $ (\hat{e}_r,\hat{e}_\phi,\hat{e}_\theta) $.

If the spherical coordinates change with time then this causes the spherical basis vectors to rotate with the following angular velocity.

Angular velocity of the spherical basis. #rvs-ew

$$ \begin{aligned}\vec{\omega} &= \dot\phi \, \hat{e}_\theta + \dot\theta \, \hat{k} \\&= \dot\theta \cos\phi \,\hat{e}_r + \dot\phi \, \hat{e}_\theta- \dot\theta \sin\phi \,\hat{e}_{\phi}\end{aligned} $$

Changing $r$ does not cause a rotation of the basis, while changing $\theta$ rotates about the vertical axis $ \hat{k} $ and changing $\phi $ rotates about $ \hat{e}_\theta $. Combining these angular velocities gives the expression for $ \vec{\omega} $.

The rotation of the basis vectors caused by changing coordinates can be directly computed, giving the time derivatives below.

Time derivatives of spherical basis vectors. #rvs-et

$$ \begin{aligned}\dot{\hat{e}}_r &= \dot\theta \sin\phi \,\hat{e}_{\theta}+ \dot\phi \,\hat{e}_{\phi} \\\dot{\hat{e}}_{\theta} &= - \dot\theta \sin\phi \,\hat{e}_r- \dot\theta \cos\phi \,\hat{e}_{\phi} \\\dot{\hat{e}}_{\phi} &= - \dot\phi \,\hat{e}_r+ \dot\theta \cos\phi \,\hat{e}_{\theta}\end{aligned} $$

We can either directly differentiate the basis vector expressions, or we can recall that $ \dot{\hat{e}} = \vec{\omega} \times \hat{e} $ for any basis vector $ \hat{e} $. This gives:

$$ \begin{aligned} \dot{\hat{e}}_r &= \vec{\omega} \times \hat{e}_r = \dot\theta \cos\phi \, \hat{e}_r \times \hat{e}_r + \dot\phi \, \hat{e}_\theta \times \hat{e}_r - \dot\theta \sin\phi \, \hat{e}_{\phi} \times \hat{e}_r \\ \dot{\hat{e}}_{\theta} &= \vec{\omega} \times \hat{e}_{\theta} = \dot\theta \cos\phi \,\hat{e}_r \times \hat{e}_\theta + \dot\phi \, \hat{e}_\theta \times \hat{e}_\theta - \dot\theta \sin\phi \,\hat{e}_{\phi} \times \hat{e}_\theta \\ \dot{\hat{e}}_{\phi} &= \vec{\omega} \times \hat{e}_{\phi} = \dot\theta \cos\phi \,\hat{e}_r \times \hat{e}_\phi + \dot\phi \, \hat{e}_\theta \times \hat{e}_\phi - \dot\theta \sin\phi \,\hat{e}_{\phi} \times \hat{e}_\phi \end{aligned} $$

Now we evaluate the cross products graphically to obtain the final expressions.

A point $P$ at a time-varying position $(r,\theta,\phi)$ has position vector $ \vec{r} $, velocity $ \vec{v} = \dot{\vec{r}} $, and acceleration $ \vec{a} = \ddot{\vec{r}} $ given by the following expressions in spherical components.

Position, velocity, and acceleration in spherical components. #rvs-ep

$$ \begin{aligned}\vec{r} &= r \,\hat{e}_r \\\vec{v} &= \dot{r} \,\hat{e}_r+ r \dot\theta \sin\phi \,\hat{e}_{\theta}+ r \dot\phi \,\hat{e}_{\phi} \\\vec{a} &= (\ddot{r} - r \dot{\theta}^2 \sin^2\phi- r \dot{\phi}^2) \,\hat{e}_r \\&\quad + (r \ddot\theta \sin\phi+ 2 \dot{r} \dot\theta \sin\phi+ 2 r \dot\theta \dot\phi \cos\phi) \,\hat{e}_{\theta} \\&\quad + (r \ddot\phi + 2 \dot{r} \dot\phi- r \dot{\theta}^2 \sin\phi \cos\phi) \,\hat{e}_{\phi}\end{aligned} $$

Because $ \hat{e}_r $ is a unit vector in the direction of the position vector $ \vec{r} $, we know that $ \vec{r} = r \, \hat{e}_r $. Then we can differentiate this expression to obtain:

$$ \begin{aligned} \vec{v} &= \dot{\vec{r}} = \frac{d}{dt}\Big(r \, \hat{e}_r \Big) = \dot{r} \, \hat{e}_r + r \, \dot{\hat{e}}_r \end{aligned} $$

and we substitute in the expression for $ \dot{\hat{e}_r} $from above. Taking another derivative gives:

$$ \begin{aligned} \vec{a} = \dot{\vec{v}} &= \frac{d}{dt}\Big( \dot{r} \,\hat{e}_r + r \dot\theta \sin\phi \,\hat{e}_{\theta} + r \dot\phi \,\hat{e}_{\phi} \Big) \\ &= \ddot{r} \, \hat{e}_r + \dot{r} \, \dot{\hat{e}}_r + (\dot{r} \dot\theta \sin\phi + r \ddot\theta \sin\phi + r \dot\theta \cos\phi \, \dot\phi) \, \hat{e}_\theta \\ &\quad + r \dot\theta \sin\phi \, \dot{\hat{e}}_\theta + (\dot{r} \dot\phi + r \ddot\phi) \, \hat{e}_\phi + r \dot\phi \, \dot{\hat{e}}_\phi \end{aligned} $$

and again we can substitute the basis vector derivatives.

Cylindrical coordinates

The cylindrical coordinate system extends polar coordinates into 3D by using the standard vertical coordinate $z$ for the third. This gives coordinates $(r, \theta, z)$ consisting of:

coordinate	name	range	definition
$r$	radius	$0 \le r \lt \infty$	distance from the origin
$\theta$	azimuth	$-\pi \lt \theta \le \pi$	angle from the $x$-axis in the $x$–$y$ plane
$z$	height	$-\infty \le z \le \infty$	vertical height

The diagram below shows the cylindrical coordinates of a point $P$. By changing the display options, we can see that the basis vectors are tangent to the corresponding coordinate lines. Changing $\theta$ moves $P$ along the $\theta$ coordinate line in the direction $ \hat{e}_\theta $, and similarly for the other coordinates.

Show:

Coordinate lines:

Radius:

$r = \; $ 4

Azimuth:

$\theta = \; $ 45$^\circ$

Elevation:

$z = \; $ 4

Cylindrical coordinates are defined with respect to a set of Cartesian coordinates, and can be converted to and from these coordinates using the atan2 function, as described in the previos section of Spherical coordinates.

The basis vectors are tangent to the coordinate lines and form an orthonormal basis $ \hat{e}_r, \hat{e}_\theta, \hat{e}_z $ that depends on the current position $ \vec{P} $ as follows. We can write either $ \hat{e}_z $ or $ \hat{k}_z $ for the vertical basis vector.

Cylindrical basis vectors. #rvy-eb

$$ \begin{aligned}\hat{e}_r &= \cos\theta \, \hat{\imath} + \sin\theta \,\hat{\jmath} \\\hat{e}_{\theta} &= - \sin\theta \,\hat{\imath}+ \cos\theta \,\hat{\jmath} \\\hat{e}_{z} &= \hat{k} \\[1em]\hat{\imath} &= \cos\theta \, \hat{e}_r- \sin\theta \, \hat{e}_\theta \\\hat{\jmath} &= \sin\theta \, \hat{e}_r+ \cos\theta \, \hat{e}_\theta \\\hat{k} &= \hat{e}_z \end{aligned} $$

We write the position vector $ \vec{\rho} = r \cos\theta \, \hat{\imath} + r \sin\theta \, + z \, \hat{k} $ and then use the definition of coordinate basis vectors to find the non-normalized spherical basis vectors:

$$ \begin{aligned} \vec{e}_r &= \frac{\partial\vec{\rho}}{\partial r} = \cos\theta \, \hat{\imath} + \sin\theta \, \hat{\jmath} \\ \vec{e}_\theta &= \frac{\partial\vec{\rho}}{\partial\theta} = -r \sin\theta \, \hat{\imath} + r \cos\theta \, \hat{\jmath} \\ \vec{e}_z &= \frac{\partial\vec{\rho}}{\partial\phi} = \hat{k} \end{aligned} $$

Both $ \vec{e}_r $ and $ \vec{e}_z $ are already normalized, and the length $ \vec{e}_{\theta} $ is r, so we can divide by this to obtain the final normalized basis vector.

To invert the basis change we can solve for $ \hat{\imath} $ and $ \hat{\jmath} $.

If the cylindrical coordinates change with time then this causes the spherical basis vectors to rotate with the following angular velocity.

Angular velocity of the cylindrical basis. #rvy-ew

$$ \vec{\omega} = \dot\theta \, \hat{e}_z $$

Changing $r$ or $z$ does not cause a rotation of the basis, while changing $\theta$ rotates about the vertical axis $ \hat{e}_z $.

The rotation of the basis vectors caused by changing coordinates gives the time derivatives below.

Time derivatives of cylindrical basis vectors. #rvy-et

$$ \begin{aligned}\dot{\hat{e}}_r &= \dot\theta \,\hat{e}_{\theta} \\\dot{\hat{e}}_{\theta} &= - \dot\theta \,\hat{e}_r\\\dot{\hat{e}}_z &= 0\end{aligned} $$

We can either directly differentiate the basis vector expressions, or we can recall that $ \dot{\hat{e}} = \vec{\omega} \times \hat{e} $ for any basis vector $ \hat{e} $. This gives:

$$ \begin{aligned} \dot{\hat{e}}_r &= \vec{\omega} \times \hat{e}_r = \dot\theta \, \hat{e}_z \times \hat{e}_r = \dot\theta \, \hat{e}_{\theta} \\ \dot{\hat{e}}_{\theta} &= \vec{\omega} \times \hat{e}_{\theta} = \dot\theta \, \hat{e}_z \times \hat{e}_{\theta} = -\dot\theta \, \hat{e}_r \\ \dot{\hat{e}}_{\phi} &= \vec{\omega} \times \hat{e}_{\phi} = \dot\theta \, \hat{e}_z \times \hat{e}_z = 0 \end{aligned} $$

where we used the fact that $ \hat{e}_r , \hat{e}_{\theta} , \hat{e}_z $ form a right-handed orthonormal basis to evaluate the cross products.

A point $P$ at a time-varying position $(r,\theta, z)$ has position vector $ \vec{\rho} $, velocity $ \vec{v} = \dot{\vec{\rho}} $, and acceleration $ \vec{a} = \ddot{\vec{\rho}} $ given by the following expressions in cylindrical components.

Position, velocity, and acceleration in cylindrical components. #rvy-ep

$$ \begin{aligned}\vec{\rho} &= r \,\hat{e}_r +z \, \hat{e}_z \\\vec{v} &= \dot{r} \,\hat{e}_r+ r \dot\theta \,\hat{e}_{\theta}+ \dot{z} \,\hat{e}_{z} \\\vec{a} &= (\ddot{r} - r \dot{\theta}^2) \, \hat{e}_r + (r \ddot{\theta} + 2 \dot{r} \dot{\theta}) \, \hat{e}_{\theta} + \ddot{z} \, \hat{e}_z \end{aligned} $$

From the coordinate expressions we see that the the position vector $ \vec{\rho} $, we know that $ \vec{\rho} = r \, \hat{e}_r + z \, \hat{e}_Z $. Differentiating this then gives:

$$ \begin{aligned} \vec{v} &= \dot{\vec{\rho}} = \frac{d}{dt}\Big(r \, \hat{e}_r + z \, \hat{e}_z\Big) = \dot{r} \, \hat{e}_r + r \, \dot{\hat{e}}_r + \dot{z} \, \hat{e}_z + z \, \dot{\hat{e}}_z \end{aligned} $$

and we substitute in the expression for $ \dot{\hat{e}_r} $ and $ \dot{\hat{e}_z} $ from above. Taking another derivative gives:

$$ \begin{aligned} \vec{a} = \dot{\vec{v}} &= \frac{d}{dt}\Big( \dot{r} \,\hat{e}_r + r \dot\theta \,\hat{e}_{\theta} + \dot{z} \,\hat{e}_{z} \Big) \\ &= \ddot{r} \, \hat{e}_r + \dot{r} \, \dot{\hat{e}}_r + (\dot{r} \dot\theta + r \ddot\theta) \, \hat{e}_\theta + r \dot{\theta} \dot{\hat{e}_{\theta}} + \ddot{z} \hat{e}_z + \dot{z} \dot{\hat{e}_z} \\ \end{aligned} $$

and again we can substitute the basis vector derivatives.

Warning!

We normally write $ \vec{r} $ for the position vector of a point, but if we are using cylindrical coordinates $ r,\theta ,z $ then this is dangerous. This is because $ r $ might mean the magnitude of $ \vec{r} $ or the radial coordinate, which are different. To avoid this confusion we use $ \vec{\rho} $ for the position vector and $ r $ for the radial coordinate.

Applications

Shortest flight paths

If we want to fly from Urbana, USA to Delhi, India, which path should we follow? The locations of the two cities are:

	Latitude	Longitude
Urbana	$\phi = 40^\circ 6' 35'' \rm N$	$\lambda = 88^\circ 12' 15'' \rm W$
Delhi	$\phi = 28^\circ 36' 36'' \rm N$	$\lambda = 77^\circ 13' 48'' \rm E$

Geographic coordinates are typically given in spherical coordinates, but without the radius and with the angles given in degrees, minutes, and seconds, with the latitude first, and the direction given as North/South or East/West rather than positive/negative. Also note that latitude is the elevation angle up from the equator, whereas spherical coordinates often use the inclination angle down from the polar axis.

Plotting the city locations on a map of the Earth gives the figure below. Such a map can give a very misleading idea of the shortest route, however.

Show:

Map of the Earth in an equirectangular projection, showing two possible flight paths from Urbana, USA to Delhi, India.

Reference material

Dot product
Spherical coordinates

Did you know?

To get an idea of what it looks like to fly along a great circle path, we can view the crew movies taken from the ISS. The movies taken at night are especially evocative, such as this montage by Michael König.

Technically the ISS is not following a great circle path around the Earth, because the Earth is rotating underneath it. However, because the speed of the ISS is much higher than the Earth's rotational velocity (90 min versus 24 h period), these paths are close to being great circles.

Shortest paths on the sphere

Finding the shortest distance between two points on the sphere is not a simple calculation given their latitude and longitude. As proved below, the shortest path on the sphere is always a great circle, which is the intersection of the sphere with a plane through the origin. The shortest distance calculation thus reduces to finding the angle between the vectors $ \vec{OA} $ and $ \vec{OB} $, which can be easily done by finding their dot product after changing them to rectangular coordinates.

Click and drag to rotate.
Show:	Earth transparency: 20%

City A:	Latitude:	$\phi_1 = $ 50°
	Longitude:	$\lambda_1 = $ -20°
City B:	Latitude:	$\phi_1 = $ -20°
	Longitude:	$\lambda_1 = $ 65°

Reference material

There are many different map projections that draw the surface of the spherical Earth on a two-dimensional map. Because the sphere is curved, a map projection cannot preserve both areas and angles simultaneously, leading to different trade-offs in choosing an appropriate projection for a given purpose. Some common projections are shown below.

Equirectangular.

$$ x = \lambda, \quad y = \phi $$

Mercator: cylindrical, conformal (angle-preserving).

$$ x = \lambda, \quad y = \log\left(\tan\Big(\frac{\pi}{4} + \frac{\phi}{2}\Big)\right) $$

Hobo-Dyer: cylindrical, area-preserving.

$$ x = \lambda, \quad y = \frac{\sin\phi}{\cos^2(37.5^\circ)} $$

Winkel tripel: non-cylindrical, compromise.

$$ \begin{aligned}\small x &= \frac{1}{2} \left(\lambda \cos\phi_1 + \frac{2\cos\phi\sin\frac{\lambda}{2}}{{\rm sinc}\,\alpha}\right) \\ y &= \frac{1}{2} \left(\phi + \frac{\sin\phi}{{\rm sinc}\,\alpha}\right) \\ \alpha &= \arccos\Big(\cos\phi \cos\frac{\lambda}{2}\Big) \\ \phi_1 &= \arccos\frac{2}{\pi} \end{aligned} $$

You might also be interested in what your choice of map projection says about your personality.

Proof that great circles are shortest paths

Given any two points $ A $ and $ B $ on the surface of the Earth, we choose coordinate axes and use spherical coordinates so that $ A $ is at $ (r = r_0, \theta = 0, \phi = \frac{\pi}{2} - \phi_0) $ and $ B $ is at $ (r = r_0, \theta = 0, \phi = \frac{\pi}{2} + \phi_0) $. Here $ r_0 $ is the radius of the Earth and $ 0 \le \phi_0 \le \frac{\pi}{2} $. The total angle between $ \vec{OA} $ and $ \vec{OB} $ is $ \Delta\phi = 2\phi_0 $, so the great-circle distance between $ A $ and $ B $ is $ r_0 \Delta\phi $. We will now prove that this is the smallest possible distance.

Consider any path from $A$ to $B$ on the Earth's surface. Suppose we travel along this path, starting at $A$ at time $t = 0$ and ending at $B$ at time $t = 1$. Then the position vector at time $t$ is given by $ \vec{r}(t) $ with spherical coordinates $r_0$, $\theta(t)$, $\phi(t)$ and rectangular coordinates $x(t), y(t), z(t)$.

The length $L$ of our path can be computed by integrating the speed from the start time to the end time:

$$ L = \int_0^1 \| \dot{\vec{r}} \| \, dt= \int_0^1 \sqrt{\dot{x}^2 + \dot{y}^2 + \dot{z}^2} \, dt. $$

To switch from rectangular to spherical coordinates we use the conversion:

$$ \begin{aligned} x &= r \cos\theta \sin\phi \\ y &= r \sin\theta \sin\phi \\ z &= r \cos\phi. \end{aligned} $$

Differentiating these expressions and substituting (or using the spherical velocity expression) gives the path length expression:

$$ L = \int_0^1 r_0 \sqrt{\dot{\phi}^2 + \dot{\theta}^2 \sin^2\phi} \, dt\ge \int_0^1 r_0 |\dot{\phi}| \, dt\ge r_0 \Delta \phi. $$

Here we have used the fact that $ \dot{\theta}^2 \sin^2\phi \ge 0 $ to see that $L$ must be at least as big as the great-circle distance $r_0 \Delta \phi$. Thus the great-circle distance is the minimum path length between $A$ and $B$ on the surface of the sphere.

Did you know?

All the calculations on this page have assumed a spherical model of the Earth, given by

$$ x^2 + y^2 + z^2 = R^2, $$

where the radius of the Earth is $R = 6371\rm\ km.$

Because it spins with a period of about 24 hours, the Earth bulges outwards so the center is further from the equator than from the poles. A shape that better approximates the true Earth is thus an ellipsoid, given by:

$$ \left(\frac{x}{R_{\rm xy}}\right)^2+ \left(\frac{y}{R_{\rm xy}}\right)^2+ \left(\frac{z}{R_{\rm z}}\right)^2 = 1, $$

where $ R_{\rm xy} = 6378\rm\ km $ and $ R_{\rm z} = 6357\rm\ km. $

Computing shortest path distances on ellipsoids turns out to be much harder than shortest paths on spheres. One set of equations for doing this are known as Vincenty's formulae.

There is no single best ellipsoid that fits the Earth. Instead there are many different ellipsoids in use, each of which fit the Earth better near different countries (e.g., NAD 83 for North America and WGS 84 for GPS world-wide). When even more accuracy is needed, such as for satellite orbit modeling, then true geoid models are used, with spherical harmonic expansions determined from satellite gravity measurements.

coordinate	name	range	definition
\(x\)	horizontal	\(-\infty \lt x \lt \infty\)	horizontal distance from the \(y\)-axis
\(y\)	vertical	\(-\infty \lt y \lt \infty\)	vertical distance from the \(x\)-axis

coordinate	name	range	definition
\(r\)	radius	\(0 \le r \lt \infty\)	distance from the origin \(O\)
\(\theta\)	angle	\(-\pi \lt \theta \le \pi\)	angle from the \(x\)-axis in the \(x\)–\(y\) plane