Aller au contenu principal

Differential (mathematics)


Differential (mathematics)


In mathematics, differential refers to several related notions derived from the early days of calculus, put on a rigorous footing, such as infinitesimal differences and the derivatives of functions.

The term is used in various branches of mathematics such as calculus, differential geometry, algebraic geometry and algebraic topology.

Introduction

The term differential is used nonrigorously in calculus to refer to an infinitesimal ("infinitely small") change in some varying quantity. For example, if x is a variable, then a change in the value of x is often denoted Δx (pronounced delta x). The differential dx represents an infinitely small change in the variable x. The idea of an infinitely small or infinitely slow change is, intuitively, extremely useful, and there are a number of ways to make the notion mathematically precise.

Using calculus, it is possible to relate the infinitely small changes of various variables to each other mathematically using derivatives. If y is a function of x, then the differential dy of y is related to dx by the formula d y = d y d x d x , {\displaystyle dy={\frac {dy}{dx}}\,dx,} where d y d x {\displaystyle {\frac {dy}{dx}}\,} denotes the derivative of y with respect to x. This formula summarizes the intuitive idea that the derivative of y with respect to x is the limit of the ratio of differences Δyx as Δx becomes infinitesimal.

Basic notions

  • In calculus, the differential represents a change in the linearization of a function.
    • The total differential is its generalization for functions of multiple variables.
  • In traditional approaches to calculus, the differentials (e.g. dx, dy, dt, etc.) are interpreted as infinitesimals. There are several methods of defining infinitesimals rigorously, but it is sufficient to say that an infinitesimal number is smaller in absolute value than any positive real number, just as an infinitely large number is larger than any real number.
  • The differential is another name for the Jacobian matrix of partial derivatives of a function from Rn to Rm (especially when this matrix is viewed as a linear map).
  • More generally, the differential or pushforward refers to the derivative of a map between smooth manifolds and the pushforward operations it defines. The differential is also used to define the dual concept of pullback.
  • Stochastic calculus provides a notion of stochastic differential and an associated calculus for stochastic processes.
  • The integrator in a Stieltjes integral is represented as the differential of a function. Formally, the differential appearing under the integral behaves exactly as a differential: thus, the integration by substitution and integration by parts formulae for Stieltjes integral correspond, respectively, to the chain rule and product rule for the differential.

History and usage

Infinitesimal quantities played a significant role in the development of calculus. Archimedes used them, even though he did not believe that arguments involving infinitesimals were rigorous. Isaac Newton referred to them as fluxions. However, it was Gottfried Leibniz who coined the term differentials for infinitesimal quantities and introduced the notation for them which is still used today.

In Leibniz's notation, if x is a variable quantity, then dx denotes an infinitesimal change in the variable x. Thus, if y is a function of x, then the derivative of y with respect to x is often denoted dy/dx, which would otherwise be denoted (in the notation of Newton or Lagrange) or y. The use of differentials in this form attracted much criticism, for instance in the famous pamphlet The Analyst by Bishop Berkeley. Nevertheless, the notation has remained popular because it suggests strongly the idea that the derivative of y at x is its instantaneous rate of change (the slope of the graph's tangent line), which may be obtained by taking the limit of the ratio Δyx as Δx becomes arbitrarily small. Differentials are also compatible with dimensional analysis, where a differential such as dx has the same dimensions as the variable x.

Calculus evolved into a distinct branch of mathematics during the 17th century CE, although there were antecedents going back to antiquity. The presentations of, e.g., Newton, Leibniz, were marked by non-rigorous definitions of terms like differential, fluent and "infinitely small". While many of the arguments in Bishop Berkeley's 1734 The Analyst are theological in nature, modern mathematicians acknowledge the validity of his argument against "the Ghosts of departed Quantities"; however, the modern approaches do not have the same technical issues. Despite the lack of rigor, immense progress was made in the 17th and 18th centuries. In the 19th century, Cauchy and others gradually developed the Epsilon, delta approach to continuity, limits and derivatives, giving a solid conceptual foundation for calculus.

In the 20th century, several new concepts in, e.g., multivariable calculus, differential geometry, seemed to encapsulate the intent of the old terms, especially differential; both differential and infinitesimal are used with new, more rigorous, meanings.

Differentials are also used in the notation for integrals because an integral can be regarded as an infinite sum of infinitesimal quantities: the area under a graph is obtained by subdividing the graph into infinitely thin strips and summing their areas. In an expression such as f ( x ) d x , {\displaystyle \int f(x)\,dx,} the integral sign (which is a modified long s) denotes the infinite sum, f(x) denotes the "height" of a thin strip, and the differential dx denotes its infinitely thin width.

Approaches

There are several approaches for making the notion of differentials mathematically precise.

  1. Differentials as linear maps. This approach underlies the definition of the derivative and the exterior derivative in differential geometry.
  2. Differentials as nilpotent elements of commutative rings. This approach is popular in algebraic geometry.
  3. Differentials in smooth models of set theory. This approach is known as synthetic differential geometry or smooth infinitesimal analysis and is closely related to the algebraic geometric approach, except that ideas from topos theory are used to hide the mechanisms by which nilpotent infinitesimals are introduced.
  4. Differentials as infinitesimals in hyperreal number systems, which are extensions of the real numbers that contain invertible infinitesimals and infinitely large numbers. This is the approach of nonstandard analysis pioneered by Abraham Robinson.

These approaches are very different from each other, but they have in common the idea of being quantitative, i.e., saying not just that a differential is infinitely small, but how small it is.

Differentials as linear maps

There is a simple way to make precise sense of differentials, first used on the Real line by regarding them as linear maps. It can be used on R {\displaystyle \mathbb {R} } , R n {\displaystyle \mathbb {R} ^{n}} , a Hilbert space, a Banach space, or more generally, a topological vector space. The case of the Real line is the easiest to explain. This type of differential is also known as a covariant vector or cotangent vector, depending on context.

Differentials as linear maps on R

Suppose f ( x ) {\displaystyle f(x)} is a real-valued function on R {\displaystyle \mathbb {R} } . We can reinterpret the variable x {\displaystyle x} in f ( x ) {\displaystyle f(x)} as being a function rather than a number, namely the identity map on the real line, which takes a real number p {\displaystyle p} to itself: x ( p ) = p {\displaystyle x(p)=p} . Then f ( x ) {\displaystyle f(x)} is the composite of f {\displaystyle f} with x {\displaystyle x} , whose value at p {\displaystyle p} is f ( x ( p ) ) = f ( p ) {\displaystyle f(x(p))=f(p)} . The differential d f {\displaystyle \operatorname {d} f} (which of course depends on f {\displaystyle f} ) is then a function whose value at p {\displaystyle p} (usually denoted d f p {\displaystyle df_{p}} ) is not a number, but a linear map from R {\displaystyle \mathbb {R} } to R {\displaystyle \mathbb {R} } . Since a linear map from R {\displaystyle \mathbb {R} } to R {\displaystyle \mathbb {R} } is given by a 1 × 1 {\displaystyle 1\times 1} matrix, it is essentially the same thing as a number, but the change in the point of view allows us to think of d f p {\displaystyle df_{p}} as an infinitesimal and compare it with the standard infinitesimal d x p {\displaystyle dx_{p}} , which is again just the identity map from R {\displaystyle \mathbb {R} } to R {\displaystyle \mathbb {R} } (a 1 × 1 {\displaystyle 1\times 1} matrix with entry 1 {\displaystyle 1} ). The identity map has the property that if ε {\displaystyle \varepsilon } is very small, then d x p ( ε ) {\displaystyle dx_{p}(\varepsilon )} is very small, which enables us to regard it as infinitesimal. The differential d f p {\displaystyle df_{p}} has the same property, because it is just a multiple of d x p {\displaystyle dx_{p}} , and this multiple is the derivative f ( p ) {\displaystyle f'(p)} by definition. We therefore obtain that d f p = f ( p ) d x p {\displaystyle df_{p}=f'(p)\,dx_{p}} , and hence d f = f d x {\displaystyle df=f'\,dx} . Thus we recover the idea that f {\displaystyle f'} is the ratio of the differentials d f {\displaystyle df} and d x {\displaystyle dx} .

This would just be a trick were it not for the fact that:

  1. it captures the idea of the derivative of f {\displaystyle f} at p {\displaystyle p} as the best linear approximation to f {\displaystyle f} at p {\displaystyle p} ;
  2. it has many generalizations.

Differentials as linear maps on Rn

If f {\displaystyle f} is a function from R n {\displaystyle \mathbb {R} ^{n}} to R {\displaystyle \mathbb {R} } , then we say that f {\displaystyle f} is differentiable at p R n {\displaystyle p\in \mathbb {R} ^{n}} if there is a linear map d f p {\displaystyle df_{p}} from R n {\displaystyle \mathbb {R} ^{n}} to R {\displaystyle \mathbb {R} } such that for any ε > 0 {\displaystyle \varepsilon >0} , there is a neighbourhood N {\displaystyle N} of p {\displaystyle p} such that for x N {\displaystyle x\in N} , | f ( x ) f ( p ) d f p ( x p ) | < ε | x p | . {\displaystyle \left|f(x)-f(p)-df_{p}(x-p)\right|<\varepsilon \left|x-p\right|.}

We can now use the same trick as in the one-dimensional case and think of the expression f ( x 1 , x 2 , , x n ) {\displaystyle f(x_{1},x_{2},\ldots ,x_{n})} as the composite of f {\displaystyle f} with the standard coordinates x 1 , x 2 , , x n {\displaystyle x_{1},x_{2},\ldots ,x_{n}} on R n {\displaystyle \mathbb {R} ^{n}} (so that x j ( p ) {\displaystyle x_{j}(p)} is the j {\displaystyle j} -th component of p R n {\displaystyle p\in \mathbb {R} ^{n}} ). Then the differentials ( d x 1 ) p , ( d x 2 ) p , , ( d x n ) p {\displaystyle \left(dx_{1}\right)_{p},\left(dx_{2}\right)_{p},\ldots ,\left(dx_{n}\right)_{p}} at a point p {\displaystyle p} form a basis for the vector space of linear maps from R n {\displaystyle \mathbb {R} ^{n}} to R {\displaystyle \mathbb {R} } and therefore, if f {\displaystyle f} is differentiable at p {\displaystyle p} , we can write d f p {\displaystyle \operatorname {d} f_{p}} as a linear combination of these basis elements: d f p = j = 1 n D j f ( p ) ( d x j ) p . {\displaystyle df_{p}=\sum _{j=1}^{n}D_{j}f(p)\,(dx_{j})_{p}.}

The coefficients D j f ( p ) {\displaystyle D_{j}f(p)} are (by definition) the partial derivatives of f {\displaystyle f} at p {\displaystyle p} with respect to x 1 , x 2 , , x n {\displaystyle x_{1},x_{2},\ldots ,x_{n}} . Hence, if f {\displaystyle f} is differentiable on all of R n {\displaystyle \mathbb {R} ^{n}} , we can write, more concisely: d f = f x 1 d x 1 + f x 2 d x 2 + + f x n d x n . {\displaystyle \operatorname {d} f={\frac {\partial f}{\partial x_{1}}}\,dx_{1}+{\frac {\partial f}{\partial x_{2}}}\,dx_{2}+\cdots +{\frac {\partial f}{\partial x_{n}}}\,dx_{n}.}

In the one-dimensional case this becomes d f = d f d x d x {\displaystyle df={\frac {df}{dx}}dx} as before.

This idea generalizes straightforwardly to functions from R n {\displaystyle \mathbb {R} ^{n}} to R m {\displaystyle \mathbb {R} ^{m}} . Furthermore, it has the decisive advantage over other definitions of the derivative that it is invariant under changes of coordinates. This means that the same idea can be used to define the differential of smooth maps between smooth manifolds.

Aside: Note that the existence of all the partial derivatives of f ( x ) {\displaystyle f(x)} at x {\displaystyle x} is a necessary condition for the existence of a differential at x {\displaystyle x} . However it is not a sufficient condition. For counterexamples, see Gateaux derivative.

Differentials as linear maps on a vector space

The same procedure works on a vector space with a enough additional structure to reasonably talk about continuity. The most concrete case is a Hilbert space, also known as a complete inner product space, where the inner product and its associated norm define a suitable concept of distance. The same procedure works for a Banach space, also known as a complete Normed vector space. However, for a more general topological vector space, some of the details are more abstract because there is no concept of distance.

For the important case of a finite dimension, any inner product space is a Hilbert space, any normed vector space is a Banach space and any topological vector space is complete. As a result, you can define a coordinate system from an arbitrary basis and use the same technique as for R n {\displaystyle \mathbb {R} ^{n}} .

Differentials as germs of functions

This approach works on any differentiable manifold. If

  1. U and V are open sets containing p
  2. f : U R {\displaystyle f\colon U\to \mathbb {R} } is continuous
  3. g : V R {\displaystyle g\colon V\to \mathbb {R} } is continuous

then f is equivalent to g at p, denoted f p g {\displaystyle f\sim _{p}g} , if and only if there is an open W U V {\displaystyle W\subseteq U\cap V} containing p such that f ( x ) = g ( x ) {\displaystyle f(x)=g(x)} for every x in W. The germ of f at p, denoted [ f ] p {\displaystyle [f]_{p}} , is the set of all real continuous functions equivalent to f at p; if f is smooth at p then [ f ] p {\displaystyle [f]_{p}} is a smooth germ. If

  1. U 1 {\displaystyle U_{1}} , U 2 {\displaystyle U_{2}} V 1 {\displaystyle V_{1}} and V 2 {\displaystyle V_{2}} are open sets containing p
  2. f 1 : U 1 R {\displaystyle f_{1}\colon U_{1}\to \mathbb {R} } , f 2 : U 2 R {\displaystyle f_{2}\colon U_{2}\to \mathbb {R} } , g 1 : V 1 R {\displaystyle g_{1}\colon V_{1}\to \mathbb {R} } and g 2 : V 2 R {\displaystyle g_{2}\colon V_{2}\to \mathbb {R} } are smooth functions
  3. f 1 p g 1 {\displaystyle f_{1}\sim _{p}g_{1}}
  4. f 2 p g 2 {\displaystyle f_{2}\sim _{p}g_{2}}
  5. r is a real number

then

  1. r f 1 p r g 1 {\displaystyle r*f_{1}\sim _{p}r*g_{1}}
  2. f 1 + f 2 : U 1 U 2 R p g 1 + g 2 : V 1 V 2 R {\displaystyle f_{1}+f_{2}\colon U_{1}\cap U_{2}\to \mathbb {R} \sim _{p}g_{1}+g_{2}\colon V_{1}\cap V_{2}\to \mathbb {R} }
  3. f 1 f 2 : U 1 U 2 R p g 1 g 2 : V 1 V 2 R {\displaystyle f_{1}*f_{2}\colon U_{1}\cap U_{2}\to \mathbb {R} \sim _{p}g_{1}*g_{2}\colon V_{1}\cap V_{2}\to \mathbb {R} }

This shows that the germs at p form an algebra.

Define I p {\displaystyle {\mathcal {I}}_{p}} to be the set of all smooth germs vanishing at p and I p 2 {\displaystyle {\mathcal {I}}_{p}^{2}} to be the product of ideals I p I p {\displaystyle {\mathcal {I}}_{p}{\mathcal {I}}_{p}} . Then a differential at p (cotangent vector at p) is an element of I p / I p 2 {\displaystyle {\mathcal {I}}_{p}/{\mathcal {I}}_{p}^{2}} . The differential of a smooth function f at p, denoted d f p {\displaystyle \mathrm {d} f_{p}} , is [ f f ( p ) ] p / I p 2 {\displaystyle [f-f(p)]_{p}/{\mathcal {I}}_{p}^{2}} .

A similar approach is to define differential equivalence of first order in terms of derivatives in an arbitrary coordinate patch. Then the differential of f at p is the set of all functions differentially equivalent to f f ( p ) {\displaystyle f-f(p)} at p.

Algebraic geometry

In algebraic geometry, differentials and other infinitesimal notions are handled in a very explicit way by accepting that the coordinate ring or structure sheaf of a space may contain nilpotent elements. The simplest example is the ring of dual numbers R[ε], where ε2 = 0.

This can be motivated by the algebro-geometric point of view on the derivative of a function f from R to R at a point p. For this, note first that f − f(p) belongs to the ideal Ip of functions on R which vanish at p. If the derivative f vanishes at p, then f − f(p) belongs to the square Ip2 of this ideal. Hence the derivative of f at p may be captured by the equivalence class [f − f(p)] in the quotient space Ip/Ip2, and the 1-jet of f (which encodes its value and its first derivative) is the equivalence class of f in the space of all functions modulo Ip2. Algebraic geometers regard this equivalence class as the restriction of f to a thickened version of the point p whose coordinate ring is not R (which is the quotient space of functions on R modulo Ip) but R[ε] which is the quotient space of functions on R modulo Ip2. Such a thickened point is a simple example of a scheme.

Algebraic geometry notions

Differentials are also important in algebraic geometry, and there are several important notions.

  • Abelian differentials usually mean differential one-forms on an algebraic curve or Riemann surface.
  • Quadratic differentials (which behave like "squares" of abelian differentials) are also important in the theory of Riemann surfaces.
  • Kähler differentials provide a general notion of differential in algebraic geometry.

Synthetic differential geometry

A fifth approach to infinitesimals is the method of synthetic differential geometry or smooth infinitesimal analysis. This is closely related to the algebraic-geometric approach, except that the infinitesimals are more implicit and intuitive. The main idea of this approach is to replace the category of sets with another category of smoothly varying sets which is a topos. In this category, one can define the real numbers, smooth functions, and so on, but the real numbers automatically contain nilpotent infinitesimals, so these do not need to be introduced by hand as in the algebraic geometric approach. However the logic in this new category is not identical to the familiar logic of the category of sets: in particular, the law of the excluded middle does not hold. This means that set-theoretic mathematical arguments only extend to smooth infinitesimal analysis if they are constructive (e.g., do not use proof by contradiction). Constuctivists regard this disadvantage as a positive thing, since it forces one to find constructive arguments wherever they are available.

Nonstandard analysis

The final approach to infinitesimals again involves extending the real numbers, but in a less drastic way. In the nonstandard analysis approach there are no nilpotent infinitesimals, only invertible ones, which may be viewed as the reciprocals of infinitely large numbers. Such extensions of the real numbers may be constructed explicitly using equivalence classes of sequences of real numbers, so that, for example, the sequence (1, 1/2, 1/3, ..., 1/n, ...) represents an infinitesimal. The first-order logic of this new set of hyperreal numbers is the same as the logic for the usual real numbers, but the completeness axiom (which involves second-order logic) does not hold. Nevertheless, this suffices to develop an elementary and quite intuitive approach to calculus using infinitesimals, see transfer principle.

Differential geometry

The notion of a differential motivates several concepts in differential geometry (and differential topology).

  • The differential (Pushforward) of a map between manifolds.
  • Differential forms provide a framework which accommodates multiplication and differentiation of differentials.
  • The exterior derivative is a notion of differentiation of differential forms which generalizes the differential of a function (which is a differential 1-form).
  • Pullback is, in particular, a geometric name for the chain rule for composing a map between manifolds with a differential form on the target manifold.
  • Covariant derivatives or differentials provide a general notion for differentiating of vector fields and tensor fields on a manifold, or, more generally, sections of a vector bundle: see Connection (vector bundle). This ultimately leads to the general concept of a connection.

Other meanings

The term differential has also been adopted in homological algebra and algebraic topology, because of the role the exterior derivative plays in de Rham cohomology: in a cochain complex ( C , d ) , {\displaystyle (C_{\bullet },d_{\bullet }),} the maps (or coboundary operators) di are often called differentials. Dually, the boundary operators in a chain complex are sometimes called codifferentials.

The properties of the differential also motivate the algebraic notions of a derivation and a differential algebra.

See also

  • Differential equation
  • Differential form
  • Differential of a function

Notes

Citations

References

  • Apostol, Tom M. (1967), Calculus (2nd ed.), Wiley, ISBN 978-0-471-00005-1.
  • Bell, John L. (1998), Invitation to Smooth Infinitesimal Analysis (PDF).
  • Boyer, Carl B. (1991), "Archimedes of Syracuse", A History of Mathematics (2nd ed.), John Wiley & Sons, Inc., ISBN 978-0-471-54397-8.
  • Darling, R. W. R. (1994), Differential forms and connections, Cambridge, UK: Cambridge University Press, ISBN 978-0-521-46800-8.
  • Eisenbud, David; Harris, Joe (1998), The Geometry of Schemes, Springer-Verlag, ISBN 978-0-387-98637-1
  • Keisler, H. Jerome (1986), Elementary Calculus: An Infinitesimal Approach (2nd ed.).
  • Kock, Anders (2006), Synthetic Differential Geometry (PDF) (2nd ed.), Cambridge University Press.
  • Lawvere, F.W. (1968), Outline of synthetic differential geometry (PDF) (published 1998).
  • Moerdijk, I.; Reyes, Gonzalo E. (1991), Models for Smooth Infinitesimal Analysis, Springer-Verlag, ISBN 978-1-441-93095-8.
  • Robinson, Abraham (1996), Non-standard analysis, Princeton University Press, ISBN 978-0-691-04490-3.
  • Weisstein, Eric W. "Differentials". MathWorld.

Text submitted to CC-BY-SA license. Source: Differential (mathematics) by Wikipedia (Historical)


ghbass