Chapter 6: The Metric $g$ and the Hodge Star $\ast$ — Summoning the Inner Product and Reversing Degree

§6.0 The End of Excuses — Releasing the Inner Product

When, in Chapter 2, we restored the scalar area of a parallelogram as the square root of the sum of squares of three projected areas, and when, in Chapter 3, we measured the surface area $4\pi R^2$ of a sphere and the arc length $2\pi R$ of a circle by the same idea—here we have repeatedly hedged, each time we computed a length or an area, with “we have not yet formally introduced the metric (inner product).”

The reader has probably begun to find this tedious by now. To be honest, I have too.

After all, we are computing on real space $(x,y,z)$—orthogonal Cartesian coordinates. In this space the Pythagorean theorem holds. Let us stop being shy and start using the inner product.

What is the inner product? We have already met it many times. In Chapter 2, when we obtained an oriented area, we said that if you take the square root of the sum of squares of the three shadow components $A_{yz}, A_{zx}, A_{xy}$, you recover the elementary-school area—that “sum of squares” is precisely the inner product with itself (or its square root). More generally, for column vectors among themselves, or for $1$-forms written as row vectors, the scalar quantity obtained by multiplying corresponding components and adding is what we call the inner product here.

...That is all. Most readers probably learned the inner product in high-school mathematics and are thinking, “Why state the obvious?” Yet in this book we have, uncomfortably, avoided defining the inner product head-on until now. There is a reason.

The reason is that if we bring out the inner product too soon, the distinctions we have cared about until now will collapse.

From a more advanced standpoint, “vector addition and subtraction” and the “inner product” are not inseparable; they can be thought of separately. On that view one distinguishes “spaces with an inner product” from “spaces without one,” and calls the matrix that defines the inner product the “metric tensor.” I respect that distinction too. That is precisely why we have repeatedly hedged that we have not yet formally introduced the metric or the inner product.

What we really want to distinguish here is the operation “row vector × column vector gives a scalar” from the operation “take the inner product between vectors of the same kind.”

Note (Why separate row and column so insistently) There is a standpoint that says one can describe physics quite powerfully even without putting the metric or inner product front and center—for example, elementary integration of $n$-forms and much undergraduate-level physics calculation work well on that view. I respect that standpoint too. On the other hand, the metric viewpoint that uses length, angle, inner product, and the Hodge star $\ast$ is of course also powerful. Personally, I think the ideal is to move back and forth between these two views as the problem demands. And to move back and forth, we must first not confuse “the operation measured by row vector × column vector” with “the operation that takes the inner product between vectors of the same kind.” That is why this book has been so insistent about separating row vectors and column vectors.

Since Chapter 1 we have done mountains of calculations that obtain a scalar from row vector × column vector. So how is that different from the inner product?

The conclusion: they are completely different things. Row vector times column vector is a calculation between vectors of different kinds—row and column. The inner product, by contrast, is a calculation between vectors of the same kind—“row with row” or “column with column.”

Yes—the matter is deeper than the reader may have thought. On the special stage of real space, these two often happen to return the same number, which has encouraged their confusion—and on top of that confusion a vast edifice called vector analysis has been built. The goal of this chapter is to make this distinction clear, then introduce the metric $g$ as the natural generalization of the inner product, and from there expose what the Hodge star $\ast$ really is.


§6.1 The Inner Product on Parameter Space — The True Nature of the Metric $g$

6.1.1 Inner Product in Real Space

Write two column vectors in real space $(x,y,z)$ in components:

$$\mathbf{v}_1 = \begin{pmatrix} x_1 \\ y_1 \\ z_1 \end{pmatrix},\qquad \mathbf{v}_2 = \begin{pmatrix} x_2 \\ y_2 \\ z_2 \end{pmatrix}$$

In this book, the inner product (dot product), denoted by $\cdot$, is the operation of multiplying components with the same index and adding the results.

$$\mathbf{v}_1 \cdot \mathbf{v}_2 = x_1 x_2 + y_1 y_2 + z_1 z_2$$

In $xyz$ coordinates, the inner product is computed simply by multiplying corresponding components and adding.

6.1.2 Rereading the Inner Product as a Measuring Device

Here let us reread the same calculation in the language of “measuring devices.” At first glance the inner product looks like an operation of a different lineage from the measuring devices we have handled until now. In fact, however, the inner product can also be read as an operation that “turns something into a measuring device, acts on something else, and yields a scalar.”

The entry point is the transpose. Transposing $\mathbf{v}_1$ turns a column vector into a row vector:

$$\mathbf{v}_1^T = \begin{pmatrix} x_1 & y_1 & z_1 \end{pmatrix}$$

This is a measuring device that eats the paired column vector $\mathbf{v}_2$ and measures “how large is the inner product with $\mathbf{v}_1$?”

$$\mathbf{v}_1^T\mathbf{v}_2 = \begin{pmatrix} x_1 & y_1 & z_1 \end{pmatrix} \begin{pmatrix} x_2 \\ y_2 \\ z_2 \end{pmatrix} = x_1x_2 + y_1y_2 + z_1z_2$$

That is, in orthogonal Cartesian coordinates on real space, transposing a column vector $\mathbf{v}_1$ alone yields the row vector—a measuring device—for taking the inner product with that vector.

Note (What raising and lowering indices really are) In tensor analysis there is a convention of distinguishing vector components from measuring-device components by upper and lower indices. One is then told that the metric $g$ is used to raise and lower indices. In the language of this book, the true nature of that is the operation “convert a column vector into a row vector that measures the inner product.” In orthogonal Cartesian coordinates $g=I$, so transpose alone suffices; but in parameter space, as we shall see next, we must insert $\mathbf{g}=J^T J$ along the way.

6.1.3 Taking the Inner Product in Parameter Space

What happens if we carry out the same rereading in parameter space $(r,\theta,z)$?

In orthogonal Cartesian coordinates on real space, $\mathbf{v}_1^T$ was directly the “measuring device for the inner product with $\mathbf{v}_1$.” So in parameter space too we are tempted to use $\mathbf{v}_1^T$ as the measuring device as-is. But that only computes $\Delta r_1 \Delta r_2 + \Delta\theta_1 \Delta\theta_2 + \Delta z_1 \Delta z_2$, which is not the correct inner product in real space. The correspondence between increments in parameter space and displacements in real space varies from place to place.

What, then, must we insert to turn a column vector in parameter space into a row vector that correctly measures the inner product in real space?

In Chapter 4 we saw how the Jacobian matrix of a coordinate change transforms components. Here we use that same matrix $J$ to read a displacement in parameter space as a displacement in real space.

$$J = \begin{pmatrix} \frac{\partial x}{\partial r} & \frac{\partial x}{\partial \theta} & \frac{\partial x}{\partial z} \\ \frac{\partial y}{\partial r} & \frac{\partial y}{\partial \theta} & \frac{\partial y}{\partial z} \\ \frac{\partial z}{\partial r} & \frac{\partial z}{\partial \theta} & \frac{\partial z}{\partial z} \end{pmatrix}$$

From the cylindrical-coordinate formulas $x = r\cos\theta,\; y = r\sin\theta,\; z = z$, computing partial derivatives gives:

$$J = \begin{pmatrix} \cos\theta & -r\sin\theta & 0 \\ \sin\theta & r\cos\theta & 0 \\ 0 & 0 & 1 \end{pmatrix}$$

Write the vectors $\mathbf{v}_1, \mathbf{v}_2$ in parameter space in components:

$$\mathbf{v}_1 = \begin{pmatrix} \Delta r_1 \\ \Delta\theta_1 \\ \Delta z_1 \end{pmatrix},\qquad \mathbf{v}_2 = \begin{pmatrix} \Delta r_2 \\ \Delta\theta_2 \\ \Delta z_2 \end{pmatrix}$$

Read in real space, these become $J\mathbf{v}_1$ and $J\mathbf{v}_2$ respectively. By the reading of the previous subsection, the measuring device for “the inner product with $J\mathbf{v}_1$” in real space is $(J\mathbf{v}_1)^T$. Therefore the inner product we want can be written as

$$\text{inner product} = (J\mathbf{v}_1)^T (J\mathbf{v}_2)$$

Using the rule for transposes, $(J\mathbf{v}_1)^T = \mathbf{v}_1^T J^T$, so

$$= \mathbf{v}_1^T (J^T J) \mathbf{v}_2$$

Let us actually compute $J^T J$. $J^T$ is $J$ with rows and columns swapped:

$$J^T = \begin{pmatrix} \cos\theta & \sin\theta & 0 \\ -r\sin\theta & r\cos\theta & 0 \\ 0 & 0 & 1 \end{pmatrix}$$

Therefore,

$$J^T J = \begin{pmatrix} \cos\theta & \sin\theta & 0 \\ -r\sin\theta & r\cos\theta & 0 \\ 0 & 0 & 1 \end{pmatrix} \begin{pmatrix} \cos\theta & -r\sin\theta & 0 \\ \sin\theta & r\cos\theta & 0 \\ 0 & 0 & 1 \end{pmatrix} = \begin{pmatrix} 1 & 0 & 0 \\ 0 & r^2 & 0 \\ 0 & 0 & 1 \end{pmatrix}$$

The algebraic identity $\cos^2\theta + \sin^2\theta = 1$ does all the geometry’s work, leaving $1, r^2, 1$ on the diagonal. We never once thought about arc length in the $\theta$ direction—it is a purely algebraic manipulation using only matrix product and transpose properties.

Thus a new measuring device appears. To build from a column vector $\mathbf{v}_1$ in parameter space a row vector that correctly measures the inner product in real space, we must multiply on the right by $J^T J$, not merely transpose:

$$\omega_{\mathbf{v}_1} = \mathbf{v}_1^T (J^T J)$$

Acting this measuring device on $\mathbf{v}_2$ yields the same value as the inner product in real space.

We call this $J^T J$ the metric matrix $\mathbf{g}$:

$$\mathbf{g} = J^T J$$

In $xyz$ coordinates we computed the inner product from the sum of products of components alone, never going through pullback. In parameter space we first read displacements in real space via $J$, then take the sum of products of components. Folding these two stages into matrix form puts $\mathbf{g}=J^T J$ in the middle—and its contents are found mechanically from partial derivatives and matrix products alone, as long as we know the Jacobian matrix $J$. No room remains anywhere for new axioms of geometry.

For safety, let us check with concrete numbers. Consider the point $r=2,\; \theta=\pi/3$. Then $\cos\theta = 1/2,\; \sin\theta = \sqrt{3}/2$, so

$$J = \begin{pmatrix} 1/2 & -\sqrt{3} & 0 \\ \sqrt{3}/2 & 1 & 0 \\ 0 & 0 & 1 \end{pmatrix}$$

In parameter space consider the unit displacement $\mathbf{v} = \begin{pmatrix}0 \\ 1 \\ 0\end{pmatrix}$ in the $\theta$ direction. We have $\mathbf{g} = \begin{pmatrix} 1 & 0 & 0 \\ 0 & r^2 & 0 \\ 0 & 0 & 1 \end{pmatrix}$, and since $r=2$ now, $\mathbf{g} = \begin{pmatrix} 1 & 0 & 0 \\ 0 & 4 & 0 \\ 0 & 0 & 1 \end{pmatrix}$. Computing the inner product (square of length) in parameter space:

$$\mathbf{v}^T \mathbf{g} \,\mathbf{v} = \begin{pmatrix} 0 & 1 & 0 \end{pmatrix} \begin{pmatrix} 1 & 0 & 0 \\ 0 & 4 & 0 \\ 0 & 0 & 1 \end{pmatrix} \begin{pmatrix} 0 \\ 1 \\ 0 \end{pmatrix} = 4$$

On the other hand, reading the same $\mathbf{v}$ as a displacement in real space:

$$J\mathbf{v} = \begin{pmatrix} 1/2 & -\sqrt{3} & 0 \\ \sqrt{3}/2 & 1 & 0 \\ 0 & 0 & 1 \end{pmatrix} \begin{pmatrix} 0 \\ 1 \\ 0 \end{pmatrix} = \begin{pmatrix} -\sqrt{3} \\ 1 \\ 0 \end{pmatrix}$$

The inner product (square of length) in real space is $(-\sqrt{3})^2 + 1^2 + 0^2 = 4$. They indeed agree. For $\mathbf{v} = \begin{pmatrix}1 \\ 0 \\ 0\end{pmatrix}$ (unit displacement in the $r$ direction) likewise, $\mathbf{v}^T \mathbf{g} \,\mathbf{v} = 1$ and $(J\mathbf{v})^T (J\mathbf{v}) = \cos^2\theta + \sin^2\theta = 1$ agree. In this way $\mathbf{g} = J^T J$ ties the inner products in parameter space and real space without contradiction.

We computed here in cylindrical coordinates $(r,\theta,z)$, but for spherical coordinates $(r,\theta,\phi)$ likewise $\mathbf{g}$ is obtained from $J$, with $\theta$ taken as the polar angle:

$$\mathbf{g}_{\text{spherical}} = \begin{pmatrix} 1 & 0 & 0 \\ 0 & r^2 & 0 \\ 0 & 0 & r^2\sin^2\theta \end{pmatrix}$$

The factors $r^2$ and $r^2\sin^2\theta$ on the diagonal are likewise conversion factors derived mechanically from $J$, which lines up the linear coefficients of the coordinate change.

With $g$ as mediator, we can compute inner products (lengths and angles) consistently in any coordinate system. Moreover, the action of the Hodge star $\ast$, treated later, can be read naturally from these rules of inner product.


§6.2 Converting Between Column Vectors and Row Vectors Using $g$

6.2.1 $\mathbf{v}^T \mathbf{g}$ Is a Row Vector

Look once more at the inner-product formula from §6.1, $\mathbf{v}_1^T \mathbf{g} \,\mathbf{v}_2$. Matrix products can be executed from the left in order, so this expression can be read as “first compute $\mathbf{v}_1^T \mathbf{g}$, then multiply the result by $\mathbf{v}_2$.”

$\mathbf{v}_1$ is a column vector, but transposed $\mathbf{v}_1^T$ is a row vector. Multiplying a row vector by a square matrix again yields a row vector. So if we extract only the part $\mathbf{v}_1^T \mathbf{g}$,

$$\omega = \mathbf{v}^T \mathbf{g}$$

this is a row vector of size $1 \times 3$. The metric $g$ is, arising naturally from the matrix form of the inner product, a device that converts column vectors into row vectors.

Let us also write the reverse direction. Suppose a $1$-form given as a row vector

$$ \omega = \begin{pmatrix} \omega_1 & \omega_2 & \omega_3 \end{pmatrix} $$

is given. The corresponding column vector $\mathbf{v}_\omega$ is

$$ \mathbf{v}_\omega = \mathbf{g}^{-1}\omega^T $$

Note (Inverse matrix) For a square matrix $A$, the matrix $A^{-1}$ satisfying

$$ A^{-1}A = AA^{-1} = I $$

is called the inverse of $A$. Intuitively, it is the matrix that undoes the transformation by $A$. Here we use $\mathbf{g}^{-1}$ to return a row vector built with $\mathbf{g}$ to the original column vector.

In fact, if $\omega = \mathbf{v}^T\mathbf{g}$, transposing gives

$$ \omega^T = \mathbf{g}^T\mathbf{v} $$

Since $\mathbf{g}$ is symmetric, $\mathbf{g}^T = \mathbf{g}$. Therefore

$$ \mathbf{v} = \mathbf{g}^{-1}\omega^T $$

returns us to the start.

In orthogonal Cartesian coordinates on real space, $\mathbf{g}=I$, so this reverse conversion too looks like mere transpose. In a general coordinate system, however, $\mathbf{g}^{-1}$ is needed to return a row vector to a column vector.

Note (What we smuggled in Chapter 2) When we sought a matrix representation of a $2$-form in Chapter 2, we already used the form $\mathbf{v}_1^T M \mathbf{v}_2$. At that time we only said that “laying a column vector on its side is mathematically ill-behaved” and pushed on without going deeper. The $\mathbf{v}^T\mathbf{g}$ we are looking at now is the true nature of the operation we smuggled in then. In real space $\mathbf{g}=I$, so transpose alone makes a column vector look like a row vector. In parameter space, however, to convert a column vector into a row measuring device that reproduces the inner product in real space, we must pass explicitly through the metric $\mathbf{g}$.

Consider the meaning of this conversion. Since Chapter 1 we have distinguished “the figure being measured (column vector $\mathbf{v}$)” from “the measuring device (row vector $\omega$).” They were inhabitants of different worlds. But via $g$, a column vector that lived on the figure side changes form to the scale side—the row vector. Even for the same arrow, the “sensitivity as a scale” changes from place to place because $\mathbf{g}$ absorbs all the differences in calibration at each location.

In real space ($\mathbf{g}=I$) we have $\omega = \mathbf{v}^T$, and the components of the column vector become the components of the row vector as-is. In parameter space $\mathbf{g} = \begin{pmatrix} 1 & 0 & 0 \\ 0 & r^2 & 0 \\ 0 & 0 & 1 \end{pmatrix}$, so $\omega = \begin{pmatrix} \Delta r & r^2\Delta\theta & \Delta z \end{pmatrix}$. Only the $\theta$ component is multiplied by $r^2$, reflecting that the $\theta$ direction’s scale is stretched by a factor of $r$.

Without $g$, we cannot convert between “the figure being measured” and “the measuring device” without strain. $g$ is the standard go-between that ties the two together.

6.2.2 What the Metric Gives — Geometric Size of Parallel $k$-Parallelepipeds

The most basic role of the metric $g$ is to define the actual geometric size (length, area, volume) of the parallel $k$-parallelepiped spanned by $k$ vectors.

In the chapters until now we performed “oriented counting” by feeding $k$ vectors to a $k$-form. $dx(\mathbf{v})$ is the number of the $x$ component of $\mathbf{v}$; $dx \wedge dy(\mathbf{v}_1, \mathbf{v}_2)$ is the oriented area of the parallelogram spanned by $\mathbf{v}_1, \mathbf{v}_2$. But these are measurements of shadows after all; how many meters of segment the vectors span, how many square meters of parallelogram they span under that metric, is determined only once the metric $g$ is given.

For $k=1$ (the segment spanned by one vector $\mathbf{v}$), the square of its length is defined by the inner product with itself, as we already derived in §6.1:

$$|\mathbf{v}|^2 = \mathbf{v}^T \mathbf{g} \,\mathbf{v}$$

In real space ($\mathbf{g}=I$), $|\mathbf{v}|^2 = \Delta x^2 + \Delta y^2 + \Delta z^2$. In the parameter space of cylindrical coordinates, $|\mathbf{v}|^2 = \Delta r^2 + r^2\Delta\theta^2 + \Delta z^2$.

For $k=0$, no displacement vector is fed in; the corresponding size is simply the absolute value of the scalar coefficient $|f|$.

The same holds for $k=2$ (area of the parallelogram spanned by two vectors) and $k=3$ (volume of the parallelepiped spanned by three vectors). As we saw in Chapter 2 §2.4.7, the square root of the sum of squares of the coefficients $A_{xy}, A_{yz}, A_{zx}$ of the three basis $2$-forms $dx \wedge dy, dy \wedge dz, dz \wedge dx$, namely $\sqrt{A_{yz}^2 + A_{zx}^2 + A_{xy}^2}$, gives the unsigned area of the parallelogram. In $xyz$ coordinates this calculation needs only the coefficients of the basis $2$-forms; no other information is required.

Note (Why we do not write $k=2$ in vector form) For $k=1$ we wrote length as the inner product $\mathbf{v}^T\mathbf{g}\,\mathbf{v}$ of the column vector $\mathbf{v}$. The analogous inner product for $k=2$—inner product between matrices (the Frobenius product in Appendix D)—we have not yet defined. Moreover, to display the coefficients of a $2$-form as a $3$-component vector requires the Hodge star $\ast$, which changes degree. Here we still treat a face as a face—that is, as coefficients of a $2$-form.

The role of the metric $g$ is to generalize this “computation in $xyz$” to coordinate systems with distorted scales, such as parameter space. In every case, once $g$ is fixed, a consistent geometric size is defined for parallel $k$-parallelepipeds of every degree—that is the most fundamental role of the metric.

6.2.3 Properties of the Inner Product

Just as in Chapter 5 we summarized the properties of the exterior derivative $d$ (linearity, raising degree, $d^2=0$, Leibniz rule), let us organize here the properties of the inner product that emerge from the calculations so far. These do not depend on a particular coordinate system.

  1. Symmetry: $\mathbf{v}_1 \cdot \mathbf{v}_2 = \mathbf{v}_2 \cdot \mathbf{v}_1$. Apply this to the matrix form derived in §6.1, $\mathbf{v}_1^T \mathbf{g} \,\mathbf{v}_2$. We have $\mathbf{v}_2 \cdot \mathbf{v}_1 = \mathbf{v}_2^T \mathbf{g} \,\mathbf{v}_1$, but a scalar is unchanged under transpose, so $\mathbf{v}_2^T \mathbf{g} \,\mathbf{v}_1 = (\mathbf{v}_2^T \mathbf{g} \,\mathbf{v}_1)^T = \mathbf{v}_1^T \mathbf{g}^T \mathbf{v}_2$. For this to equal $\mathbf{v}_1^T \mathbf{g} \,\mathbf{v}_2$ we must have $\mathbf{g} = \mathbf{g}^T$. That is, from symmetry of the inner product it follows that $\mathbf{g}$ must be a symmetric matrix.
  1. Linearity (bilinearity): $(a\mathbf{v}_1 + b\mathbf{v}_2) \cdot \mathbf{v}_3 = a(\mathbf{v}_1 \cdot \mathbf{v}_3) + b(\mathbf{v}_2 \cdot \mathbf{v}_3)$. Linear in each argument. This is nothing but the distributive law for matrix products, $(a\mathbf{v}_1 + b\mathbf{v}_2)^T \mathbf{g} \,\mathbf{v}_3 = a\mathbf{v}_1^T \mathbf{g} \,\mathbf{v}_3 + b\mathbf{v}_2^T \mathbf{g} \,\mathbf{v}_3$.
  1. Positive definiteness: $\mathbf{v} \cdot \mathbf{v} > 0$ ($\mathbf{v} \neq \mathbf{0}$). This guarantees that “square of length” is positive. In special relativity, where signs are mixed, this need not hold; but on the three-dimensional real space that is the stage of this book it always does.

Note (Symmetric vs antisymmetric matrices)

The fact that the area-measuring device in Chapter 2 required an antisymmetric matrix ($M = -M^T$) and that the inner product here requires a symmetric matrix ($\mathbf{g} = \mathbf{g}^T$) are exactly paired. Antisymmetry governs oriented counting; the symmetric metric governs length and angle, and from there the unsigned area and volume induced.

6.2.4 Letting the Geometry of the Metric Matrix Live in Its Components

The diagonal entries of $\mathbf{g}$ express “how much a step along that axis contributes to length”; the off-diagonal entries express “how much steps along different axes interfere with each other.” That $\mathbf{g}$ in cylindrical coordinates was diagonal is because the axes $r,\theta,z$ are mutually orthogonal. In any case, the single matrix $\mathbf{g}$ carries in its components all the “ruler properties” at that location.

Note (Axiomatic inner product—reversed order of axioms and matrix) In a standard linear-algebra textbook one first defines the three properties above as axioms, then derives the theorem that any operation satisfying them can be written, in some basis, in the form $\mathbf{v}^T \mathbf{g} \mathbf{w}$ with a symmetric matrix $\mathbf{g}$. The logical order is the reverse of ours. On the axiomatic side the matrix representation is only one representation; on the measuring-device reading of this book, this matrix $\mathbf{g}$ is the scale of space itself, the starting point of all measurement. Neither is “correct” in preference to the other. One enters from abstract properties; one enters from concrete symbolic operations of measurement. Only the direction that suits you differs.

Checkpoint so far — §6.0–§6.2

- The inner product is the operation of taking the sum of products of components between vectors of the same kind (row × row, or column × column). It is a different thing from row × column.

- The inner product in parameter space is computed by pulling back to real space first. In that process $J^T J$ appears naturally. That is the true nature of the metric $g$.

- $g$ converts column vectors to row vectors. Concretely, from a column vector $\mathbf{v}$ the corresponding $1$-form is built by $\omega_{\mathbf{v}} = \mathbf{v}^T\mathbf{g}$.

- Conversely, from a $1$-form $\omega$ given as a row vector, the corresponding column vector is $\mathbf{v}_\omega = \mathbf{g}^{-1}\omega^T$. In Cartesian coordinates $\mathbf{g}=I$, so both look like almost nothing but transpose.

- With $g$ as mediator, geometric size is defined for parallel $k$-parallelepipeds of every degree.

- From the properties of the inner product (symmetry, linearity, positive definiteness) the symmetry of $g$ follows. This pairs with the antisymmetry of the area-measuring device $\wedge$.


§6.3 The Hodge Star $\ast$ — The Correspondence Connecting Two Routes

6.3.1 Two Ways to Obtain a Scalar

By now the true nature of $g = J^T J$ has been revealed, and we can compute inner products in any coordinate system. Let us step back and look at the larger picture.

When we survey the formulas of physics, we notice that there is more than one way to produce a scalar quantity. When we integrate a field along a line or over a surface, a natural notation appears: something that measures acts on something that is measured and returns a number. On the other hand, quantities such as power and energy density are often written as inner products of the same kind of arrow, as in $\mathbf{E}\cdot\mathbf{J}$ or $\mathbf{A}\cdot(\nabla\times\mathbf{A})$.

In other words, at least on the surface, two conventions for obtaining scalars coexist in the notation of physics. In the language of this book, they are the following two:

Note (What to call the two methods—the author's dilemma) The author long wondered what to call these two conventions in a textbook. Calling Route ① the "differential forms method" is natural, but in this book we also write "d method" alongside it. For readers who do not yet know differential forms, the symbol $d$ is more familiar. The author personally also calls it the "topology method." For readers raised in topology or geometry, that name may feel more natural. Likewise, calling Route ② the "vector calculus method" is natural, but in this book we also write "$\nabla$ method" alongside it. The symbol $\nabla$ (nabla) is the most familiar symbol for readers who have studied physics. The author personally also calls it the "inner product method," using that name when emphasizing that the structure of the inner product sits at the core. All of these names refer to the same two conventions.

Note (Genealogy of the two notations) Route ① lies close to Grassmann's exterior algebra and Élie Cartan's line of development; Route ② lies close to the vector calculus systematized by Gibbs and Heaviside. This is not a matter of one being correct and the other wrong.

In fact, whichever method we adopt, the physical result we obtain in the end is the same. This is merely a difference of expression.

Yet a practical problem arises here. In the framework of Route ②, quantities that ought to be measured through a "surface" ($2$-forms) and quantities measured along a "line" ($1$-forms) are all treated as the same kind of "three-component arrow." To handle the notions of area and volume from Route ① within the framework of Route ②'s "inner product of arrows," we need a correspondence that links forms of different degrees.

The number of independent components of a $k$-form had a characteristic symmetry. As we saw in Chapter 2 §2.5.9, in three-dimensional space a $0$-form has $1$ component, a $1$-form has $3$, a $2$-form also has $3$, and a $3$-form has $1$—folded at the middle, the pattern $1, 3, 3, 1$ is symmetric left and right. Between a $1$-form and a $2$-form with the same number of independent components, and between a $0$-form and a $3$-form, some correspondence ought to exist. The device that supplies that correspondence is the Hodge star $\ast$.

$\ast$ uses the information of the inner product (that is, $g$) to assign to each form the partner that combines with it to make a volume. In the main text we first build this correspondence in the basis of physical space; verification in components is deferred to Appendix D.

Note (Between the two methods) We need not use only one of Route ① and Route ②. The author's ideal is to move freely between both. This book starts from Route ① (differential forms) in order to build a solid foundation that does not depend on distortion of space. On top of that, we want to be able to read the same quantities in the language of Route ② when needed—and $\ast$ is what bridges the two.

Note (For readers who know vector calculus) Many operations that in ordinary vector calculus are treated as face orientations or cross products can be read anew as the correspondence given by this $\ast$. In this chapter, however, we do not use those as known formulas; we rebuild them from $g$ and the basis $2$-forms of Chapter 2.

6.3.2 The $\ast$ Dictionary in Physical Space—Choosing the Partner That Makes a Volume

So how is $\ast$ determined?

Here we first consider orthogonal Cartesian coordinates in physical space $(x,y,z)$. In this space, $dx, dy, dz$ are mutually orthogonal reference measuring devices of length $1$. We also fix the orientation of space so that

$$ dx \wedge dy \wedge dz $$

is the positive volume form.

We may think of $\ast$ as the operation that, given a form, returns the partner which combines with it to yield a volume form.

For example, what $2$-form combines with $dx$ to produce the positive volume form? It is $dy \wedge dz$. Indeed,

$$ dx \wedge dy \wedge dz $$

is obtained. Therefore it is natural to set

$$ \ast dx = dy \wedge dz. $$

Likewise,

$$ dy \wedge dz \wedge dx = dx \wedge dy \wedge dz, $$

so

$$ \ast dy = dz \wedge dx, $$

and

$$ dz \wedge dx \wedge dy = dx \wedge dy \wedge dz, $$

so

$$ \ast dz = dx \wedge dy. $$

In other words, $\ast$ assigns to each line measuring device the surface measuring device that combines with it to give the positive volume form. By the same idea, the scalar $1$ corresponds to the volume form $dx \wedge dy \wedge dz$, and the volume form corresponds to the scalar $1$.

Therefore the $\ast$ dictionary in physical space is as follows.

$$\begin{aligned} \ast(1) &= dx \wedge dy \wedge dz, \qquad &\ast(dx \wedge dy \wedge dz) &= 1 \\ \ast(dx) &= dy \wedge dz, \qquad &\ast(dy \wedge dz) &= dx \\ \ast(dy) &= dz \wedge dx, \qquad &\ast(dz \wedge dx) &= dy \\ \ast(dz) &= dx \wedge dy, \qquad &\ast(dx \wedge dy) &= dz \end{aligned}$$

At this stage it is enough to read $\ast$ as a "dictionary." However, this correspondence is not a mere memorization table. Once a basis is chosen, $\ast$ can be written as an array representation of a linear transformation that rearranges coefficients. The array representation over all degrees is confirmed in Appendix D.

The meaning of this dictionary is simple. What corresponds to a line segment in the $x$ direction ($dx$) is the face perpendicular to the $x$ axis ($dy \wedge dz$). Likewise $y \leftrightarrow dz \wedge dx$, $z \leftrightarrow dx \wedge dy$. And the scalar $1$ ($0$-form) and the volume $dx \wedge dy \wedge dz$ ($3$-form) correspond to each other. This is precisely the manifestation of the symmetry $1, 3, 3, 1$ of independent component counts that we saw in Chapter 2.

Note (Why this dictionary is the right one) That $\ast dx = dy \wedge dz$ holds is because $dx, dy, dz$ form an orthonormal basis (mutually orthogonal, length $1$) and because we have taken the orientation of space so that $dx \wedge dy \wedge dz$ is positive in a right-handed system. Under these conditions, the dictionary above is determined uniquely. Once parameter space is taken into account, the $\ast$ dictionary depends on the metric $g$ and on how orientation is chosen—in other words, if $g$ is not $I$, the coefficients in the dictionary become more complicated.

6.3.3 $\ast\ast = \mathrm{id}$

As the dictionary above shows immediately, applying $\ast$ twice returns the original:

$$\ast(\ast(dx)) = \ast(dy \wedge dz) = dx.$$

In general, in three-dimensional Cartesian physical space, $\ast\ast = \mathrm{id}$ (the identity operator) holds. One application of $\ast$ reverses the degree of a form; two applications return it to the original.

Checkpoint so far — §6.3

- There are two ways to obtain a scalar. Route ① obtains a scalar by row $\times$ column action. Route ② reads it as an inner product of like vectors, and the metric $g$ appears there.

- The symmetry $1, 3, 3, 1$ seen in Chapter 2 suggests that a correspondence can be set up between $1$-forms and $2$-forms with the same number of independent components.

- The Hodge star $\ast$ returns, for a given form, the partner which combines with it to give the positive volume form. The dictionary in physical space includes $\ast dx = dy \wedge dz$, and so on.

- This dictionary is not a mere memorization table; once a basis is chosen, it can be written as an array representation of a linear transformation. Details of the array representation are confirmed in Appendix D.

- $\ast\ast = \mathrm{id}$ follows immediately from this dictionary.

6.3.4 Properties of $\ast$

Following the pattern of §6.2.3, where we summarized the properties of the inner product, and of Chapter 5, where we summarized the properties of the exterior derivative $d$, let us organize the properties of $\ast$ that emerge from the construction so far.

  1. Reversal of degree: In three-dimensional space, $\ast$ pairs $0$-forms with $3$-forms and $1$-forms with $2$-forms. In general it sends a $k$-form to a $(3-k)$-form.
  1. Linearity: $\ast(\omega_1 + \omega_2) = \ast\omega_1 + \ast\omega_2$. $\ast$ is the operation that extends the dictionary on the basis linearly with coefficients; linearity is immediate. Details of the array representation are confirmed in Appendix D.
  1. Correspondence with the inner product: For forms $\alpha, \beta$ of the same degree, combining $\alpha$ and $\ast\beta$ by the wedge product yields a $3$-form whose coefficient is the inner product of $\alpha$ and $\beta$. In Cartesian coordinates,
$$ \alpha \wedge \ast\beta = (\alpha\cdot\beta)\,dx\wedge dy\wedge dz $$

Here $\alpha\cdot\beta$ denotes the metric-induced inner product of forms of the same degree. For two $1$-forms it is the sum of products of components; for two $2$-forms it is the Frobenius product seen in Appendix D. This formula expresses that $\ast$ links the "scalar obtained by the inner product" with the "$3$-form obtained by the wedge product."

Note (Relation to more advanced definitions) In more advanced textbooks, $\alpha\wedge\ast\beta=(\alpha\cdot\beta)\,dx\wedge dy\wedge dz$ is often adopted as the definition of $\ast$. In this book we first built the dictionary on each basis element and its array representation; that is a concrete realization of this definition in Cartesian coordinates.

  1. $\ast\ast = \mathrm{id}$: Applying it twice returns the original (§6.3.3). This holds under the conditions of an orthonormal basis and a right-handed system; in Appendix D it can also be verified algebraically through antisymmetric matrices and the Frobenius product.
  1. Dependence on the metric $g$: The $\ast$ dictionary is determined by the metric $g$ of space. In Cartesian coordinates ($g=I$), the concise dictionary of §6.3.2 suffices; in a general coordinate system, the coefficients of $\ast$ become functions of position. We treat the general case in detail in Chapter 9.

These properties show that $\ast$ is not a mere "dictionary" but the very degree-reversal structure of a space equipped with a metric and an orientation.


§6.4 Examples of the Correspondence — Differential Forms and Vector Analysis

6.4.1 Joule Heating $P=VI$ — Two Routes

Let us verify in actual computation how $\ast$ concretely links differential forms with dot-product representations of like-kind vectors. The purpose of this section is not to learn formulas from electromagnetism or vector analysis. It is to survey, in two examples, how quantities that arise naturally in the language of differential forms can be read in the notation of scalar products and vector analysis.

Note (you need not know electromagnetism or vector analysis) From here on, for the sake of explanation we will use a few symbols from electromagnetism and vector analysis a little ahead of their formal introduction. You need not understand right now what $E$ or $J$ represent, or what $\mathbf{A}\cdot(\nabla\times\mathbf{A})$ means. What we want to see here is a single point: a $3$-form that appears naturally in the language of differential forms can be read, through the Hodge star $\ast$, as a scalar or an inner product. The relation to vector-analytic notation will peek through a little later, but a full systematic treatment is deferred to later chapters.

Revisit the high-school physics formula "electric power = voltage × current ($P=VI$)" as a phenomenon that occurs at each point in space.

First, let us compute by Route ① (differential forms). Take the wedge product of the $1$-form $E$ and the $2$-form $J$:

$$E \wedge J$$

The wedge product of a $1$-form and a $2$-form is a $3$-form (a volume density). This means "power per unit volume (W/m³)." Integrating this over a spatial region ($\int_V E \wedge J$) yields the total power consumed in the region (watts). This route uses only the operation of wedging a $1$-form and a $2$-form into a $3$-form.

Next, see how the same calculation is carried out by Route ② (dot product of like-kind vectors). To take an inner product treating both $E$ and $J$ as if they were $1$-forms, convert $J$, which measures surfaces, into a form that measures lines via $\ast$:

$$\text{Route ② display of }J \longleftrightarrow \ast J$$

Now we have two $1$-forms, $E$ and $\ast J$. Take their inner product (dot product). Here $E\cdot J$ is the Route ② display after $J$ has been read through $\ast$ as a like-kind object. To write the inner product of $1$-forms in the language of differential forms, apply $\ast$ to one side to obtain a $2$-form, wedge to a $3$-form, then apply $\ast$ overall to drop to a $0$-form:

$$\text{Route ② display }E\cdot J \longleftrightarrow \ast(E \wedge \ast(\ast J))$$

Here $\ast\ast = \mathrm{id}$, so the inner $\ast(\ast J) = J$:

$$\ast(E \wedge J)$$

The bracketed quantity $E \wedge J$ is exactly the same $3$-form obtained in Route ①. The outer $\ast$ strips the volume $dx \wedge dy \wedge dz$ from the $3$-form and extracts a scalar value. The way of writing that multiplies the scalar $E \cdot J$ by a small volume $dV$ and integrates again can be read as restoring afterward the volume that this $3$-form already carried.

In short, $\ast$ links the $3$-form of Route ① and the scalar display of Route ② without contradiction. Because $\ast\ast = \mathrm{id}$, either route leads to the same result.

6.4.2 Helicity $A \cdot (\nabla \times A)$ — The Round Trip of $\ast$

In one more example, let us see further how $\ast$ works. Consider the following $3$-form built from a $1$-form $\alpha$ and its exterior derivative $d\alpha$:

$$\alpha \wedge d\alpha$$

In the language of Route ②, this quantity appears as the inner product of a vector field $\mathbf{A}$ with its curl:

$$\mathbf{A} \cdot (\nabla \times \mathbf{A})$$

Write the $1$-form corresponding to $\mathbf{A}$ as $\alpha$. The $d\alpha$ is first obtained as a $2$-form; in the language of Route ② it corresponds to the $1$-form $\ast d\alpha$.

First compute by Route ①. The wedge product of the $1$-form $\alpha$ and the $2$-form $d\alpha$:

$$\alpha \wedge d\alpha$$

The wedge of a $1$-form and a $2$-form is a $3$-form. On this route, $\alpha$ and $d\alpha$ are combined directly by the wedge product.

Next look at the internal structure of the same calculation on Route ②. To read it as a dot product of like-kind vectors, we cannot leave the $2$-form $d\alpha$ as it is; we assign the corresponding $1$-form $\ast d\alpha$:

$$\nabla \times \mathbf{A} \longleftrightarrow \ast d\alpha$$

Now we have two $1$-forms, $\alpha$ and $\ast d\alpha$. Take their inner product:

$$\mathbf{A} \cdot (\nabla \times \mathbf{A}) \longleftrightarrow \ast(\alpha \wedge \ast(\ast d\alpha))$$

By $\ast\ast = \mathrm{id}$, the inner $\ast(\ast d\alpha) = d\alpha$. Therefore,

$$\ast(\alpha \wedge d\alpha)$$

The bracketed quantity is the same as the $3$-form $\alpha \wedge d\alpha$ from Route ①. The outer $\ast$ strips the volume and extracts a scalar value.

What deserves attention here is that $\ast$ appears twice and cancels via $\ast\ast = \mathrm{id}$. Inside Route ② we use $\ast d\alpha$ as the $1$-form corresponding to $d\alpha$ (a $2$-form), and call on $\ast$ again to take the inner product. This round trip is canceled by $\ast\ast = \mathrm{id}$.

Note (why the round trip of $\ast$ occurs) Vector analysis has no concept of the degree of a form and treats everything uniformly as "a three-component arrow." To read a $2$-form as an arrow, one must build the corresponding $1$-form with $\ast$; to take an inner product one must again use $\ast$ to return to the original degree. The Hodge star $\ast$ is the single correspondence that links these two representations.

Checkpoint so far — §6.4

- In the Joule-heating calculation, Route ① ($E \wedge J$) and Route ② ($E \cdot J = \ast(E \wedge J)$) agree via $\ast$.

- In the helicity calculation, $\ast$ is used twice and cancels by $\ast\ast = \mathrm{id}$. Route ① ($\alpha \wedge d\alpha$) and Route ② ($\mathbf{A} \cdot (\nabla \times \mathbf{A})$) give the same result.

- $\ast$ is the correspondence linking differential forms and vector analysis; $\ast\ast = \mathrm{id}$ guarantees consistency of the round trip.


§6.5 The Types of the Three Operations — grad, curl, div

By now we have the two operators $d$ (Chapter 5) and $\ast$ (this chapter). Combining them yields operations that move the degree of a form in various ways.

Among these, three types are especially named in three-dimensional vector analysis:

$$ 0 \to 1,\qquad 1 \to 2 \to 1,\qquad 1 \to 2 \to 3 \to 0 $$

In this book we call them, respectively,

$$ \mathrm{grad},\qquad \mathrm{curl},\qquad \mathrm{div} $$

This chapter stops at naming these three types. The correspondence with the usual $\nabla f$, $\nabla\times\mathbf F$, and $\nabla\cdot\mathbf F$ is treated again in later chapters.

Note (for readers who know vector analysis) By this point you may already see the usual $\nabla f$, $\nabla\times\mathbf F$, and $\nabla\cdot\mathbf F$ in the background. This chapter, however, does not formally develop the usual vector-analytic notation. Here we confirm the path: "from combinations of $d$ and $\ast$, extract the three types corresponding to grad, curl, and div." The usual nabla notation, the correspondence with Stokes' theorem, and why formulas grow long in curvilinear coordinates are taken up again in later chapters.

Below, we work first in Cartesian coordinates. Use the $1$-form

$$ \omega = P\,dx + Q\,dy + R\,dz $$

Readers who know vector analysis may read this as the field with components $(P,Q,R)$. A full systematic treatment of this correspondence is deferred to later chapters.

6.5.1 Gradient $\mathrm{grad}\,f = df$

The exterior derivative $df$ of a $0$-form (scalar field) $f$ is a $1$-form:

$$ df = \frac{\partial f}{\partial x}\,dx {}+ \frac{\partial f}{\partial y}\,dy {}+ \frac{\partial f}{\partial z}\,dz $$

In this book we call this the $1$-form representation of the gradient and write

$$ \mathrm{grad}\,f = df $$

The operator $d$ raises the degree from $0 \to 1$. That is the form of the gradient needed in this chapter. The relation to the column-vector display $\nabla f$ of usual vector analysis is organized again in later chapters.

6.5.2 Curl $\mathrm{curl}\,\omega = \ast\,d\,\omega$

For the $1$-form

$$ \omega = P\,dx + Q\,dy + R\,dz $$

applying $d$ yields the $2$-form $d\omega$:

$$d\omega = (\frac{\partial R}{\partial y} - \frac{\partial Q}{\partial z})\,dy \wedge dz + (\frac{\partial P}{\partial z} - \frac{\partial R}{\partial x})\,dz \wedge dx + (\frac{\partial Q}{\partial x} - \frac{\partial P}{\partial y})\,dx \wedge dy$$

Applying $\ast$ returns the $2$-form to a $1$-form:

$$\ast(d\omega) = (\frac{\partial R}{\partial y} - \frac{\partial Q}{\partial z})\,dx + (\frac{\partial P}{\partial z} - \frac{\partial R}{\partial x})\,dy + (\frac{\partial Q}{\partial x} - \frac{\partial P}{\partial y})\,dz$$

In this book we call this $1$-form the $1$-form representation of the curl and write

$$\mathrm{curl}\,\omega = \ast\,d\,\omega$$

The composition $\ast\,d$ has the degree transition

$$ 1 \to 2 \to 1 $$

This structure of "rising once to a surface form, then returning to a line form" corresponds to the curl. The relation to what is written $\nabla\times\mathbf F$ in usual vector analysis is treated again in later chapters.

6.5.3 Divergence $\mathrm{div}\,\omega = \ast\,d\,\ast\,\omega$

Apply $\ast$ first to the $1$-form $\omega$ to obtain a $2$-form. Apply $d$ to obtain a $3$-form. Apply $\ast$ once more to return to a $0$-form, namely a scalar field:

$$ \ast d\ast\omega $$

Computing in Cartesian coordinates,

$$\ast(d(\ast\omega)) = \frac{\partial P}{\partial x} + \frac{\partial Q}{\partial y} + \frac{\partial R}{\partial z}$$

In this book we call this $0$-form the divergence and write

$$\mathrm{div}\,\omega = \ast\,d\,\ast\,\omega$$

The composition $\ast\,d\,\ast$ has the degree transition

$$ 1 \to 2 \to 3 \to 0 $$

The relation to what is written $\nabla\cdot\mathbf F$ in usual vector analysis is treated again in later chapters.

6.5.4 The Shadow of $d^2 = 0$

Recall $d^2 = 0$ from Chapter 5. From this single fact, relations arise among the three operations.

First,

$$ \mathrm{curl}(\mathrm{grad}\,f) = \ast d(df) = \ast(d^2 f) = 0 $$

Also,

$$ \mathrm{div}(\mathrm{curl}\,\omega) = \ast d\ast(\ast d\omega) = \ast d(d\omega) = \ast(d^2\omega) = 0 $$

Here we stop at viewing these as identities on the side of forms. Readers who know vector analysis will see the correspondence with the usual identities. In this book, however, the meaning in the usual nabla notation is deferred to later chapters.


§6.6 Toward Vector Analysis

In Chapter 5 we acquired the exterior derivative $d$; in this chapter, the metric $g$ and the Hodge star $\ast$. With this, the main tools needed to move from the world of forms toward vector analysis are in place.

What we confirmed in this chapter is that from combinations of $d$ and $\ast$, one can extract the three types called grad, curl, and div in three-dimensional vector analysis.

Let us tabulate the correspondences obtained here.

Operation Decomposition in this book Degree transition Number of uses of $\ast$
$\mathrm{grad}$ $d$ $0 \to 1$ 0
$\mathrm{curl}$ $\ast\,d$ $1 \to 2 \to 1$ 1
$\mathrm{div}$ $\ast\,d\,\ast$ $1 \to 2 \to 3 \to 0$ 2

By now, a substantial part of the structure behind nabla notation has come into view. However, this chapter has not yet formally developed the usual vector analysis.

Readers who know vector analysis may already see the usual $\nabla$ notation rising from here. But this book will not rush. The systematic organization in the usual nabla notation is deferred to later chapters.

In later chapters we will use the $d$, $g$, and $\ast$ obtained here to return to the notation of usual vector analysis. Stokes' theorem, Gauss' theorem, and why formulas grow long in curvilinear coordinates will all be reread from this toolkit.

Note (by flipping just one sign in the metric) This chapter assumed a positive definite metric ($\mathbf{v}^T \mathbf{g} \,\mathbf{v} > 0$ for $\mathbf{v} \neq 0$). But on the four-dimensional spacetime stage of special relativity, an indefinite metric appears, such as $\mathbf{g} = \begin{pmatrix} -1 & 0 & 0 & 0 \\ 0 & 1 & 0 & 0 \\ 0 & 0 & 1 & 0 \\ 0 & 0 & 0 & 1 \end{pmatrix}$, with only the time component's sign flipped. This book is fixed on three-dimensional Cartesian physical space, so we do not go further here; but the idea learned in this chapter of "inserting the matrix $\mathbf{g}$" extends directly to four-dimensional spacetime. When that becomes necessary, I hope the reader will return to this chapter.


Checkpoint so far — Chapter 6 as a whole

- The inner product is an operation taking the product-sum of components between vectors of the same kind (row×row, or column×column). It is a different operation from row×column computation.

- In physical space the inner-product matrix is the identity $I$. In parameter space it is determined by the linear coefficients of coordinate transformation as $\mathbf{g}=J^T J$. That is, the metric $g$ is nothing other than the matrix that appears in the middle when one pulls back to physical space and takes the inner product.

- $\mathbf{v}^T \mathbf{g}$ converts a column vector into a row vector; $\mathbf{g}^{-1}\omega^T$ returns a row vector to column-vector display. Without $g$, objects and measuring devices cannot convert into each other.

- There are two ways to obtain a scalar: Route ① (row×column) and Route ② (inner product of like-kind vectors). The Hodge star $\ast$ is the correspondence linking the two.

- $\ast$ returns, for a given form, the partner that combines with it to yield a positive volume form. The dictionary in physical space is $\ast dx = dy \wedge dz$, etc., and it satisfies $\ast\ast = \mathrm{id}$.

- The Joule-heating and helicity examples show that Route ① and Route ② give the same result through $\ast$ and $\ast\ast = \mathrm{id}$.

- In Chapter 6 we confirmed that the three types $\mathrm{grad}=d$, $\mathrm{curl}=\ast\,d$, $\mathrm{div}=\ast\,d\,\ast$ appear. The systematic organization in the usual nabla notation is deferred to later chapters.

- From $d^2=0$, on the side of forms we get $\mathrm{curl}(\mathrm{grad}\,f)=0$ and $\mathrm{div}(\mathrm{curl}\,\omega)=0$. The meaning in usual vector analysis is treated again in later chapters.


Appendix D: Array Representation of the Hodge Star

§6.3.2 gave the dictionary for $\ast$ in real space. This appendix checks how that dictionary looks as array operations. The Hodge star is not an abstract symbol; once a basis is chosen, it is a linear transformation that can be displayed as a concrete array. We first view the $1$-form $\leftrightarrow$ $2$-form correspondence through antisymmetric matrices, and finally view the $0$-form $\leftrightarrow$ $3$-form correspondence through a third-order array.

D.1 $\ast_{1\to2}$ — placing three coefficients into an antisymmetric matrix

For the $1$-form

$$ \omega = \begin{pmatrix}P & Q & R\end{pmatrix} = P\,dx + Q\,dy + R\,dz $$

apply the dictionary from the main text,

$$ \ast dx = dy \wedge dz,\qquad \ast dy = dz \wedge dx,\qquad \ast dz = dx \wedge dy $$

to obtain

$$ \ast\omega = P\,dy\wedge dz + Q\,dz\wedge dx + R\,dx\wedge dy $$

In the antisymmetric matrix representation of Chapter 2, let the matrix representing $dy\wedge dz$ be

$$ E_1 = \begin{pmatrix} 0 & 0 & 0 \\ 0 & 0 & 1 \\ 0 & -1 & 0 \end{pmatrix} $$

Similarly, let the matrix representing $dz\wedge dx$ be

$$ E_2 = \begin{pmatrix} 0 & 0 & -1 \\ 0 & 0 & 0 \\ 1 & 0 & 0 \end{pmatrix} $$

and let the matrix representing $dx\wedge dy$ be

$$ E_3 = \begin{pmatrix} 0 & 1 & 0 \\ -1 & 0 & 0 \\ 0 & 0 & 0 \end{pmatrix} $$

Therefore,

$$ \ast_{1\to2} = \begin{pmatrix} E_1\\ E_2\\ E_3 \end{pmatrix} $$

can be viewed as a "vertical vector of matrices." Multiplying the $1$-form $\omega=\begin{pmatrix}P&Q&R\end{pmatrix}$ on the left gives

$$ \omega\,\ast_{1\to2} = \begin{pmatrix}P&Q&R\end{pmatrix} \begin{pmatrix} E_1\\ E_2\\ E_3 \end{pmatrix} = P E_1+Q E_2+R E_3 $$

Hence

$$ \ast_{1\to2}(\omega) = \begin{pmatrix} 0 & R & -Q \\ -R & 0 & P \\ Q & -P & 0 \end{pmatrix} $$

The three coefficients $P,Q,R$ of the $1$-form have been placed into the three independent components of the antisymmetric matrix of the $2$-form. This is the most visible form of $\ast_{1\to2}$.

Note (relation to $\widehat{\epsilon}$ in Chapter 2) These $E_1,E_2,E_3$ are the same matrices written in Appendix A as $\varepsilon_{1,\cdot,\cdot},\varepsilon_{2,\cdot,\cdot},\varepsilon_{3,\cdot,\cdot}$. The essence of $\ast_{1\to2}$ is to place the first index of the Levi-Civita symbol $\varepsilon_{ijk}$ in the direction of the components of the $1$-form, and the remaining two indices in the directions of the $3\times3$ matrix. The same triple introduced in Chapter 2 as the volume-measuring device $\widehat{\epsilon}$ reappears here as the Hodge star.

D.2 $\ast_{2\to1}$ — extracting coefficients by the Frobenius product

The reverse map $\ast_{2\to1}$ is the operation that extracts three independent components from an antisymmetric matrix. Looking at matrix entries alone, it suffices to read $A=M_{23}$, $B=M_{31}$, $C=M_{12}$. That is, to return from the antisymmetric matrix

$$ M= \begin{pmatrix} 0 & C & -B \\ -C & 0 & A \\ B & -A & 0 \end{pmatrix} $$

representing the $2$-form

$$ \eta = A\,dy\wedge dz + B\,dz\wedge dx + C\,dx\wedge dy $$

to $A\,dx+B\,dy+C\,dz$, one need only read these three components. If we stop at this mere "reading," however, the transpose relation with $\ast_{1\to2}$ is hard to see. So we rewrite coefficient extraction itself as an inner product between matrices.

Define the inner product of $3\times3$ matrices $A,B$ by

$$ A\cdot B = \frac{1}{2}\operatorname{tr}(A^T B) = \frac{1}{2} \sum_{i=1}^{3} \sum_{j=1}^{3} A_{ij}B_{ij} $$

Note (trace) $\operatorname{tr}$ is the trace of a matrix (the sum of diagonal entries). It appeared when we treated the matrix representation of the exterior derivative in Appendix B. $\operatorname{tr}(A^T B)$ is the three-step operation "transpose $A$, multiply by $B$, and add the diagonal entries." Since the product $A\cdot B$ is equivalently $\frac{1}{2}\sum A_{ij}B_{ij}$, use the latter expression when you want to think in components.

This is the same notation as the inner product $\mathbf{v}_1 \cdot \mathbf{v}_2$ between column vectors in §6.2.3. Matrices, too, are "things of the same kind," and we multiply corresponding components and take the sum. Because we contract successively over the two indices $i$ and $j$, this is called the Frobenius product or consecutive contraction.

Note (the factor $1/2$) Some conventions adopt $\operatorname{tr}(A^T B)$ (without $1/2$) as the Frobenius product. For antisymmetric matrices, however, that picks up both members of a pair such as $M_{23}$ and $M_{32}=-M_{23}$, and the inner product is doubled. In this book we build $1/2$ into the definition. Then the inner product of the $2$-form $M$ with itself, $M\cdot M=M_{23}^2+M_{31}^2+M_{12}^2$, agrees directly with the "squared unsigned area of the parallelogram" in §6.2.2.

Note (the pull of abstraction) §6.2.3 touched on an axiomatic inner product, but once you actually work by hand like this, you can see why that brevity is attractive. For the inner product of column vectors we use $\mathbf{v}_1^T \mathbf{v}_2$; for the inner product of matrices we use $\frac{1}{2}\operatorname{tr}(A^T B)$—redefining the inner product from scratch every time the representation changes is, when you think about it, quite a burden. One notices the urge to lump these together and settle everything with one phrase: "an inner product is an operation satisfying certain axioms." Even so, this book does not abandon its policy of making representations explicit to the end. By this point, that is stubbornness.

The $E_1,E_2,E_3$ of D.1 are orthonormal with respect to this inner product. Indeed, each $E_k$ has only two nonzero components; for the same $E_k$ we get $\frac{1}{2}(1^2+(-1)^2)=1$, and for different $E_i,E_j$ the positions of the nonzero components do not overlap. Therefore,

$$ E_i\cdot E_j=\delta_{ij} $$

Using this orthonormality, the reverse transformation $\ast_{2\to1}$ can be written as inner products with $E_1,E_2,E_3$. Let $M$ be an arbitrary $3\times3$ matrix; when it comes from a $2$-form, $M$ is an antisymmetric matrix.

$$ \ast_{2\to1}(M) = \begin{pmatrix} E_1\cdot M & E_2\cdot M & E_3\cdot M \end{pmatrix} $$

Here $E_k\cdot M=\frac{1}{2}\operatorname{tr}(E_k^T M)$ is the Frobenius product defined above. In components,

$$ \begin{aligned} E_1 \cdot M &= \frac{1}{2}(M_{23}-M_{32}) \\ E_2 \cdot M &= \frac{1}{2}(M_{31}-M_{13}) \\ E_3 \cdot M &= \frac{1}{2}(M_{12}-M_{21}) \end{aligned} $$

If $M$ is an antisymmetric matrix, then $M_{32}=-M_{23}$, $M_{13}=-M_{31}$, $M_{21}=-M_{12}$, so

$$ \ast_{2\to1}(M) = \begin{pmatrix} M_{23} & M_{31} & M_{12} \end{pmatrix} $$

Therefore, $\ast_{2\to1}$ is at once the operation that extracts independent components from an antisymmetric matrix and coefficient extraction by the Frobenius product with $E_k$.

Writing out all components, $\ast_{2\to1}$ is the following "horizontal vector of matrices":

$$ \ast_{2\to1} = \begin{pmatrix} \begin{pmatrix} 0 & 0 & 0 \\ 0 & 0 & 1 \\ 0 & -1 & 0 \end{pmatrix} & \begin{pmatrix} 0 & 0 & -1 \\ 0 & 0 & 0 \\ 1 & 0 & 0 \end{pmatrix} & \begin{pmatrix} 0 & 1 & 0 \\ -1 & 0 & 0 \\ 0 & 0 & 0 \end{pmatrix} \end{pmatrix} $$

In $\ast_{1\to2}$ we multiply by coefficients and add; in $\ast_{2\to1}$ we extract coefficients by inner product. Vertical and horizontal, weighted sum and inner-product extraction, correspond to each other.

D.3 $\ast\ast=\mathrm{id}$ in the $1$-form and $2$-form case

Combining D.1 and D.2, $\ast\ast=\mathrm{id}$ for $1$-forms and $2$-forms appears as orthonormality of arrays.

For

$$ \omega=P\,dx+Q\,dy+R\,dz $$

D.1 gives

$$ \ast_{1\to2}(\omega)=P E_1+Q E_2+R E_3 $$

Applying $\ast_{2\to1}$ to this,

$$ \begin{aligned} \ast_{2\to1}(\ast_{1\to2}(\omega)) &= \begin{pmatrix} E_1\cdot (P E_1+Q E_2+R E_3) & E_2\cdot (P E_1+Q E_2+R E_3) & E_3\cdot (P E_1+Q E_2+R E_3) \end{pmatrix} \\ &= \begin{pmatrix} P & Q & R \end{pmatrix} = \omega \end{aligned} $$

In the last equality we used $E_i\cdot E_j=\delta_{ij}$. That is, the three coefficients placed into $E_1,E_2,E_3$ by $\ast_{1\to2}$ are recovered unchanged by $\ast_{2\to1}$ through inner products with the same $E_1,E_2,E_3$.

The reverse direction is the same. For the antisymmetric matrix

$$ M=P E_1+Q E_2+R E_3 $$

D.2 gives

$$ \ast_{2\to1}(M) = \begin{pmatrix} E_1\cdot M & E_2\cdot M & E_3\cdot M \end{pmatrix} = \begin{pmatrix} P & Q & R \end{pmatrix} $$

and applying D.1's $\ast_{1\to2}$,

$$ \ast_{1\to2}(\ast_{2\to1}(M)) = P E_1+Q E_2+R E_3 = M $$

Therefore, in the $1$-form and $2$-form case,

$$ \ast_{2\to1}(\ast_{1\to2}(\omega))=\omega, \qquad \ast_{1\to2}(\ast_{2\to1}(M))=M $$

hold. $\ast_{1\to2}$ places three coefficients into $E_1,E_2,E_3$; $\ast_{2\to1}$ extracts coefficients by inner product with $E_1,E_2,E_3$. Using the normalized Frobenius product, the transpose relation between the two appears as this correspondence between placement and coefficient extraction.

D.4 Array representation of $\ast_{0\to3}$ and $\ast_{3\to0}$

So far we have displayed $\ast_{1\to2}$ and $\ast_{2\to1}$ as transformations that move coefficient arrays. What remains is $\ast_{0\to3}$ and $\ast_{3\to0}$.

A $0$-form has a single coefficient. A $3$-form, viewed purely as an array, is a completely antisymmetric third-order array with three indices. Therefore, $\ast_{0\to3}$ is the transformation that spreads one component into $3\times3\times3$ components, and $\ast_{3\to0}$ is the transformation that extracts one component from $3\times3\times3$ components.

Here we bring $\varepsilon_{ijk}$, which also appeared in Chapter 2, to the fore and look at its form.

Note (what $\varepsilon_{ijk}$ really is) $\varepsilon_{ijk}$ is the completely antisymmetric symbol that appeared in Chapter 2. If the indices $(i,j,k)$ are an even permutation of $(1,2,3)$, it is $+1$; if an odd permutation, $-1$; if the same index appears twice, $0$. In an orthonormal Cartesian basis, it can be read as the component of a completely antisymmetric tensor. In this book we use those components as the representation of the Hodge star.

First, apply $\ast_{0\to3}$ to the $0$-form $f$. The output is a $3$-form, so it can be written in components with three indices. Here we take the components of $\ast_{0\to3}$ itself to be

$$ (\ast_{0\to3})_{ijk} = \varepsilon_{ijk} $$

Display this as three $3\times3$ matrices with $i$ fixed—that is, for each of $i=1,2,3$, arrange the $(j,k)$ components as a matrix:

$$ (\ast_{0\to3})_{1jk} = \begin{pmatrix} 0&0&0\\ 0&0&1\\ 0&-1&0 \end{pmatrix}, $$ $$ (\ast_{0\to3})_{2jk} = \begin{pmatrix} 0&0&-1\\ 0&0&0\\ 1&0&0 \end{pmatrix}, $$ $$ (\ast_{0\to3})_{3jk} = \begin{pmatrix} 0&1&0\\ -1&0&0\\ 0&0&0 \end{pmatrix}. $$

As you can see, these are exactly the antisymmetric matrices $E_1,E_2,E_3$ used in D.1. That is,

$$ (\ast_{0\to3})_{1jk}= (E_1)_{jk}, \qquad (\ast_{0\to3})_{2jk}= (E_2)_{jk}, \qquad (\ast_{0\to3})_{3jk}= (E_3)_{jk} $$

Almost nothing new happens here. The antisymmetric matrices $E_1,E_2,E_3$ seen in D.1 are now simply arranged as three slices with $i=1,2,3$. In the $1$-form and $2$-form case we read three coefficients as the coefficients of three antisymmetric matrices. In the $0$-form and $3$-form case we build, from one coefficient, the completely antisymmetric third-order array obtained by stacking all three slices at once.

Indeed, acting on the $0$-form $f$,

$$ (\ast_{0\to3}f)_{ijk} = (\ast_{0\to3})_{ijk}f = \varepsilon_{ijk}f $$

This is the operation of spreading the coefficient $f$ into a completely antisymmetric third-order array.

Next, consider the reverse map $\ast_{3\to0}$. This is the transformation that takes a third-order array and returns a single coefficient. Its components are

$$ (\ast_{3\to0})_{ijk} = \frac{1}{3!}\varepsilon_{ijk} $$

Therefore, displayed again as three $3\times3$ matrices with $i$ fixed,

$$ (\ast_{3\to0})_{1jk} = \frac{1}{3!} \begin{pmatrix} 0&0&0\\ 0&0&1\\ 0&-1&0 \end{pmatrix}, $$ $$ (\ast_{3\to0})_{2jk} = \frac{1}{3!} \begin{pmatrix} 0&0&-1\\ 0&0&0\\ 1&0&0 \end{pmatrix}, $$ $$ (\ast_{3\to0})_{3jk} = \frac{1}{3!} \begin{pmatrix} 0&1&0\\ -1&0&0\\ 0&0&0 \end{pmatrix}. $$

That is, $\ast_{3\to0}$ uses the same three antisymmetric matrices. The only difference is that the whole thing is multiplied by $1/3!$.

If we think of the indices $(i,j,k)$ lined up in a row as a single number, then $\ast_{0\to3}$ is a column vector that spreads one component into $27$ components, and $\ast_{3\to0}$ is a row vector that extracts one component from $27$ components. Apart from the normalization factor $1/3!$, the two are in a transpose relation.

Let a $3$-form be represented by its completely antisymmetric components $\eta_{ijk}$. Acting with $\ast_{3\to0}$ sums over all three indices:

$$ \ast_{3\to0}\eta = \sum_{i=1}^{3} \sum_{j=1}^{3} \sum_{k=1}^{3} (\ast_{3\to0})_{ijk}\eta_{ijk} $$

Therefore,

$$ \ast_{3\to0}\eta = \frac{1}{3!} \sum_{i=1}^{3} \sum_{j=1}^{3} \sum_{k=1}^{3} \varepsilon_{ijk}\eta_{ijk} $$

This is a triple contraction that extracts one coefficient from a completely antisymmetric third-order array.

The factor $1/3!$ plays the same role as the $1/2$ that appears in the Frobenius product of D.2. In an antisymmetric matrix, each independent component appears twice, so we multiply by $1/2$ to correct for the duplication. Here, each independent component of a completely antisymmetric third-order array appears $3!=6$ times, so we multiply by $1/3!$ to correct for the duplication. Alternatively, if we define the inner product of two $3$-forms by $\langle\alpha,\beta\rangle = \frac{1}{3!}\alpha_{ijk}\beta_{ijk}$, then after absorbing the normalization into the inner product we can say that $\ast_{0\to3}$ and $\ast_{3\to0}$ are in a transpose relation.

Let us verify that applying $\ast$ twice returns the original. First, start from the $0$-form $f$:

$$ \ast_{3\to0}(\ast_{0\to3}f) = \frac{1}{3!} \sum_{i=1}^{3} \sum_{j=1}^{3} \sum_{k=1}^{3} \varepsilon_{ijk}(\ast_{0\to3}f)_{ijk} $$

Since $(\ast_{0\to3}f)_{ijk}=\varepsilon_{ijk}f$,

$$ \ast_{3\to0}(\ast_{0\to3}f) = \frac{1}{3!} \sum_{i=1}^{3} \sum_{j=1}^{3} \sum_{k=1}^{3} \varepsilon_{ijk}\varepsilon_{ijk}f $$

Only the $3!=6$ cases in which $(i,j,k)$ is a permutation of $(1,2,3)$ are nonzero; then $\varepsilon_{ijk}\varepsilon_{ijk}=1$, so

$$ \ast_{3\to0}(\ast_{0\to3}f) = \frac{1}{3!}(3!)f = f $$

Let us also check the reverse direction. An arbitrary $3$-form has only one independent component in three dimensions. Therefore the completely antisymmetric components $\eta_{ijk}$ can be written using some coefficient $h$ as

$$ \eta_{ijk} = h\,\varepsilon_{ijk} $$

Then

$$ \ast_{3\to0}\eta = \frac{1}{3!} \sum_{i=1}^{3} \sum_{j=1}^{3} \sum_{k=1}^{3} \varepsilon_{ijk}(h\varepsilon_{ijk}) = h $$

Therefore,

$$ (\ast_{0\to3}(\ast_{3\to0}\eta))_{ijk} = (\ast_{0\to3}h)_{ijk} = h\varepsilon_{ijk} = \eta_{ijk} $$

Thus we have confirmed $\ast\ast=\mathrm{id}$ in the $0$-form and $3$-form case as well, in the array representation.

In the end, what we saw here is the same as in D.1–D.3. In $\ast_{1\to2}$ we distributed three coefficients among three antisymmetric matrices. In $\ast_{0\to3}$ we distributed one coefficient simultaneously among all three antisymmetric matrices. In $\ast_{2\to1}$ we extracted coefficients from an antisymmetric matrix; in $\ast_{3\to0}$ we extracted coefficients by triple contraction with the three antisymmetric matrices. The Hodge star, at every degree, is a linear transformation that can be displayed as a concrete array once a basis is chosen.

D.5 Summary

Checkpoint so far — Appendix D

- $\ast_{1\to2}$ is the linear transformation that places three coefficients into the antisymmetric matrices $E_1,E_2,E_3$.

- $\ast_{2\to1}$ is the linear transformation that extracts coefficients from an antisymmetric matrix, and can be written by the Frobenius product with $E_k$.

- Using the Frobenius product $A\cdot B=\frac{1}{2}\operatorname{tr}(A^TB)$, we have $E_i\cdot E_j=\delta_{ij}$.

- Therefore, in the $1$-form and $2$-form case, $\ast_{2\to1}(\ast_{1\to2}(\omega))=\omega$ and $\ast_{1\to2}(\ast_{2\to1}(M))=M$ hold.

- $\ast_{0\to3}$ can be displayed as a third-order array obtained by stacking the three antisymmetric matrices $E_1,E_2,E_3$. In components, $(\ast_{0\to3})_{ijk}=\varepsilon_{ijk}$.

- $\ast_{3\to0}$ can be displayed as the same array multiplied by the normalization factor $1/3!$. In components, $(\ast_{3\to0})_{ijk}=\frac{1}{3!}\varepsilon_{ijk}$.

- The factors $1/2$ and $1/3!$ correct for duplication of antisymmetric components.

- In the $0$-form/$3$-form case as well, $\ast_{3\to0}(\ast_{0\to3}f)=f$ and $\ast_{0\to3}(\ast_{3\to0}\eta)=\eta$ hold.

- Therefore, the Hodge star $\ast$, for both $0\leftrightarrow3$ and $1\leftrightarrow2$, is a linear transformation that can be displayed as a concrete array once a basis is chosen.