Einstein notation
From Wikipedia, the free encyclopedia
- For other topics related to Einstein, see Einstein (disambiguation).
In mathematics, especially in applications of linear algebra to physics, the Einstein notation or Einstein summation convention is a notational convention useful when dealing with coordinate formulae. It was introduced by Albert Einstein in 1916 [1].
According to this convention, when an index variable appears twice in a single term, once in an upper (superscript) and once in a lower (subscript) position, it implies that we are summing over all of its possible values. In typical applications, the indices are 1,2,3 (representing the three dimensions of physical Euclidean space), or 0,1,2,3 or 1,2,3,4 (representing the four dimensions of space-time, or Minkowski space), but they can have any range, even (in some applications) an infinite set. Abstract index notation is an improvement of Einstein notation.
In general relativity, the Greek alphabet and the Roman alphabet are used to distinguish whether summing over 1,2,3 or 0,1,2,3 (usually Roman, i, j, ... for 1,2,3 and Greek, μ, ν, ... for 0,1,2,3). As in sign conventions, the convention used in practice varies: Roman and Greek may be reversed.
Sometimes (as in general relativity), the index is required to appear once as a superscript and once as a subscript; in other applications, all indices are subscripts. See Dual vector space and Tensor product.
It is important to keep in mind that no new physical laws or ideas result from using Einstein notation; rather, it merely helps in identifying relationships and symmetries often 'hidden' by more conventional notation.
In some fields, Einstein notation is referred to simply as index notation, or indicial notation. Additionally, the use of the implied summation of repeated indices is referred to as the Einstein Sum Convention.
Contents |
[edit] Introduction
The basic idea of Einstein notation is very simple. It allows one to replace something bulky, such as:
- y = c1x1 + c2x2 + c3x3 + ... + cnxn
typically written as:
with something even simpler, in Einstein notation:
In Einstein notation, indices such as i in the equation above can appear as either subscripts or superscripts. The position of the index has a specific meaning. It is important, of course, not to interpret an index appearing in the superscript position as if it were an exponent, which is the convention in standard algebra. Here, the superscripted i above the symbol x represents an integer-valued index running from 1 to n.
The virtue of Einstein notation is that an index appearing two or more times in a single term implies summation across that index, so that the summation symbol is unnecessary. Since the summation in effect "eliminates" the index over which the sum is taken, the summation index does not appear on the opposite side of the equals sign.
[edit] Vector representations
First, we can use Einstein notation in linear algebra to distinguish easily between row vectors and column vectors. We could, for example, use superscripted indices to represent the elements of column vectors, and subscripted indices to represent the elements of row vectors. Following this convention, then,
represents an M × 1 column vector, and
represents a 1 × N row vector.
In mathematics and theoretical physics, and in particular general relativity, column vectors represent contravariant vectors whereas row vectors represent covariant vectors.
[edit] Matrix representation
Using standard notation, we can generate M × N matrix A by multiplying column vector u by row vector v:
In Einstein notation, we have:
Since i and j represent two different indices, and in this case over two different ranges M and N respectively, the indices are not eliminated by the multiplication. Both indices survive the multiplication to become the two indices of the newly-created matrix A.
[edit] Matrix multiplication
We can represent matrix multiplication as:
This expression is equivalent to the more conventional (and less compact) notation:
[edit] Vector dot product
In mechanics and engineering, vectors in 3D space are often described in relation to orthogonal unit vectors i, j and k.
If the basis vectors i, j, and k are instead expressed as e1, e2, and e3, a vector can be expressed in terms of a summation:
In Einstein notation, the summation symbol is omitted since the index i is repeated and we simply write
Using e1, e2, and e3 instead of i, j, and k, together with Einstein notation, we obtain a concise algebraic presentation of vector and tensor equations. For example,
or equivalently:
where
and is the Kronecker delta, which is equal to 1 when i = j, and 0 otherwise. It logically follows that this allows one j in the equation to be converted to an i, or one i to be converted to a j. Then,
[edit] Vector cross product
For the cross product,
where and
is the Levi-Civita symbol defined by:
which recovers
from
.
Additionally, if , then
and
. This also highlights that when an index appears once on both sides of the equation, this implies a system of equations instead of a summation:
Alternatively, this could be expressed as
but, this isn't the notation Einstein used.
[edit] Abstract definitions
In the traditional usage, one has in mind a vector space V with finite dimension n, and a specific basis of V. We can write the basis vectors as e1, e2, ..., en. Then if v is a vector in V, it has coordinates v1, ..., vn relative to this basis.
The basic rule is:
- v = vi ei.
In this expression, it was assumed that the term on the right side was to be summed as i goes from 1 to n, because the index i does not appear on both sides of the expression. (Or, using Einstein's convention, because the index i appeared twice.)
The i is known as a dummy index since the result is not dependent on it; thus we could also write, for example:
- v = vj ej.
An index that is not summed over is a free index and should be found in each term of the equation or formula.
In contexts where the index must appear once as a subscript and once as a superscript, the basis vectors ei retain subscripts but the coordinates become vi with superscripts. Then the basic rule is:
- v = vi ei.
The value of the Einstein convention is that it applies to other vector spaces built from V using the tensor product and duality. For example, , the tensor product of V with itself, has a basis consisting of tensors of the form
. Any tensor T in
can be written as:
.
V*, the dual of V, has a basis e1, e2, ..., en which obeys the rule
.
Here δ is the Kronecker delta, so is 1 if i =j and 0 otherwise.
[edit] Examples
Einstein summation is clarified with the help of a few simple examples. Consider four-dimensional spacetime, where indices run from 0 to 3:
The above example is one of contraction, a common tensor operation. The tensor becomes a new tensor by summing over the first upper index and the lower index. Typically the resulting tensor is renamed with the contracted indices removed:
For a familiar example, consider the dot product of two vectors a and b. The dot product is defined simply as summation over the indices of a and b:
which is our familiar formula for the vector dot product. Remember it is sometimes necessary to change the components of a in order to lower its index; however, this is not necessary in Euclidean space, or any space with a metric equal to its inverse metric (e.g., flat spacetime).
[edit] See also
[edit] References
- ^ Einstein, Albert (1916). "The Foundation of the General Theory of Relativity" (PDF). Annalen der Physik. Retrieved on 2006-09-03.