Misplaced Pages

Integral: Difference between revisions

Article snapshot taken from[REDACTED] with creative commons attribution-sharealike license. Give it a read and then ask your questions in the chat. We can research this topic together.
Browse history interactively← Previous editNext edit →Content deleted Content addedVisualWikitext
Revision as of 09:04, 13 July 2007 editKSmrq (talk | contribs)5,323 editsm concatenate adjacent Harv refs as requested on talk← Previous edit Revision as of 20:42, 14 July 2007 edit undoDchristle (talk | contribs)261 edits IntroductionNext edit →
Line 50: Line 50:


== Introduction == == Introduction ==
Integrals thrust themselves upon us in many practical situations. Consider a swimming pool. If it is rectangular, then from its length, width, and depth we can easily determine the volume of water it can contain (to fill it), the area of its surface (to cover it), and the length of its edge (to rope it). But if it is oval with a rounded bottom, all of these quantities call for integrals. Practical approximations may suffice at first, but eventually we demand exact and rigorous answers to such problems. Integrals appear in many practical situations. Consider a swimming pool. If it is rectangular, then from its length, width, and depth we can easily determine the volume of water it can contain (to fill it), the area of its surface (to cover it), and the length of its edge (to rope it). But if it is oval with a rounded bottom, all of these quantities call for integrals. Practical approximations may suffice at first, but eventually we demand exact and rigorous answers to such problems.


Measuring depends on counting, with the unit of measurement as a key link. For example, recall the famous ] which says that if ''a'', ''b'', and ''c'' are the side lengths of a ], with ''c'' the longest, then ''a''<sup>2</sup>+''b''<sup>2</sup>&nbsp;= ''c''<sup>2</sup>. If we take the side of a square as one unit, its diagonal is &radic;2 units long, a measurement which is greater than one and less than two. If we take the side as five units, the diagonal is at least seven units long, but still not perfectly measured. A side of 12 units will have a diagonal of about 17 units, but slightly less. In fact, ] that exactly counts off the side length can ever give an exact count for the diagonal length. Thus the ]system is born, permitting an exact answer with no trace of approximating counts. It is as if the units are infinitely fine. Measuring depends on counting, with the unit of measurement as a key link. For example, recall the famous ] which says that if ''a'', ''b'', and ''c'' are the side lengths of a ], with ''c'' the longest, then ''a''<sup>2</sup>+''b''<sup>2</sup>&nbsp;= ''c''<sup>2</sup>. If we take the side of a square as one unit, its diagonal is &radic;2 units long, a measurement which is greater than one and less than two. If we take the side as five units, the diagonal is at least seven units long, but still not perfectly measured. A side of 12 units will have a diagonal of about 17 units, but slightly less. In fact, ] that exactly counts off the side length can ever give an exact count for the diagonal length. Thus the ]system is born, permitting an exact answer with no trace of approximating counts. It is as if the units are infinitely fine.
Line 63: Line 63:
was conceived as a weighted sum (denoted by the elongated "S"), with function values (such as the heights, ''y''&nbsp;= ''f''(''x'')) multiplied by infinitesimal step widths (denoted by ''dx'').<!-- Note: Today's limits of integration were not part of the notation until much later, due to Fourier. --> After the failure of early efforts to rigorously define infinitesimals, Riemann formally defined integrals as a ] of ordinary weighted sums, so that the ''dx'' suggested the limit of a difference (namely, the interval width). Shortcomings of Riemann's dependence on intervals and continuity motivated newer definitions, especially the Lebesgue integral, which is founded on an ability to extend the idea of "measure" in much more flexible ways. Thus the notation was conceived as a weighted sum (denoted by the elongated "S"), with function values (such as the heights, ''y''&nbsp;= ''f''(''x'')) multiplied by infinitesimal step widths (denoted by ''dx'').<!-- Note: Today's limits of integration were not part of the notation until much later, due to Fourier. --> After the failure of early efforts to rigorously define infinitesimals, Riemann formally defined integrals as a ] of ordinary weighted sums, so that the ''dx'' suggested the limit of a difference (namely, the interval width). Shortcomings of Riemann's dependence on intervals and continuity motivated newer definitions, especially the Lebesgue integral, which is founded on an ability to extend the idea of "measure" in much more flexible ways. Thus the notation
:<math> \int_A f(x) \, d\mu \,\!</math> :<math> \int_A f(x) \, d\mu \,\!</math>
refers to a weighted sum in which the function values are partitioned, with &mu; measuring the weight to be assigned to each value. (Here ''A'' denotes the region of integration.) ], with its "calculus on ]s", gives the familiar notation yet another interpretation. Now ''f''(''x'') and ''dx'' meld together to become something new, a ], &omega;&nbsp;= ''f''(''x'')''dx''; a ] '''d''' appears; and the fundamental theorem evolves into ], refers to a weighted sum in which the function values are partitioned, with &mu; measuring the weight to be assigned to each value. (Here ''A'' denotes the region of integration.) ], with its "calculus on ]s", gives the familiar notation yet another interpretation. Now ''f''(''x'') and ''dx'' become a ], &omega;&nbsp;= ''f''(''x'')''dx'', a new ] '''d''', known as the ] appears, and the fundamental theorem becomes the more general ],
:<math> \int_{A} \bold{d} \omega = \int_{\part A} \omega . \,\!</math> :<math> \int_{A} \bold{d} \omega = \int_{\part A} \omega , \,\!</math>
from which ], the ], and the ] follow.

More recently, infinitesimals have reappeared with rigor, through modern innovations such as ]. Not only do these methods vindicate the intuitions of the pioneers, they also lead to new mathematics. More recently, infinitesimals have reappeared with rigor, through modern innovations such as ]. Not only do these methods vindicate the intuitions of the pioneers, they also lead to new mathematics.



Revision as of 20:42, 14 July 2007

This article is about the concept of integrals in calculus. For other meanings, see integration and integral (disambiguation).
The integral of f(x) from a to b is the area above the x-axis and below the curve y = f(x), minus the area below the x-axis and above the curve, for x in the interval .

Integral is a core concept of advanced mathematics, specifically, in the fields of calculus and mathematical analysis. Given a function f(x) of a real variable x and an interval of the real line, the integral

a b f ( x ) d x {\displaystyle \int _{a}^{b}f(x)dx}

represents the area of a region in the xy-plane bounded by the graph of f, the x-axis, and the vertical lines x=a and x=b.

The idea of integration was formulated in the late seventeenth century by Isaac Newton and Gottfried Wilhelm Leibniz. Together with the concept of a derivative, integral became a basic tool of calculus, with numerous applications in science and engineering. Integration and differentiation are related by the Fundamental Theorem of Calculus.

A rigorous mathematical definition of the integral was given by Bernhard Riemann. It is based on a certain limiting procedure which imitates approximating the area of a curvilinear region by breaking the region into thin vertical slabs. Beginning in the nineteenth century, more sophisticated notions of integral began to appear, where the type of the function as well as the domain over which the integration is performed has been generalised. A line integral is defined for functions of two or three variables, and the interval of integration is replaced by a certain curve connecting two points on the plane or in the space. In a surface integral, the curve is replaced by a piece of a surface in the three-dimensional space. Integrals of differential forms play a fundamental role in modern differential geometry. These generalizations of integral first arose from the needs of physics, and they play an important role in the formulation of many physical laws, notably, in electrodynamics. An abstract mathematical theory known as Lebesgue integration was developed by Henri Lebesgue.

Part of a series of articles about
Calculus
a b f ( t ) d t = f ( b ) f ( a ) {\displaystyle \int _{a}^{b}f'(t)\,dt=f(b)-f(a)}
Differential
Definitions
Concepts
Rules and identities
Integral
Definitions
Integration by
Series
Convergence tests
Vector
Theorems
Multivariable
Formalisms
Definitions
Advanced
Specialized
Miscellanea

The term "integral" may also refer to the notion of antiderivative, a function F whose derivative is the given function f. In this case it is called an indefinite integral, while the integrals discussed in this article are termed definite integrals. The fundamental theorem of calculus asserts that an antiderivative can be used to compute the integral over an interval. Some authors, for example, Tom Apostol, maintain a distinction between antiderivatives and indefinite integrals.

History

Main article: History of calculus

Pre-calculus integration

Integration can be traced as far back as ancient Egypt, circa 1800 BC, with the Moscow Mathematical Papyrus demonstrating knowledge of a formula for the volume of a pyramidal frustrum. The first documented systematic technique capable of determining integrals is the method of exhaustion of Eudoxus (circa 370 BC), which sought to find areas and volumes by breaking them up into an infinite number of shapes for which the area or volume was known. This method was further developed and employed by Archimedes and used to calculate areas for parabolas and an approximation to the area of a circle. Similar methods were independently developed in China around the 3rd Century AD by Liu Hui, who used it to find the area of the circle. This method was later used by Zu Chongzhi to find the volume of a sphere.

Significant advances on techniques such as the method of exhaustion did not begin to appear until the 16th Century AD. At this time the work of Cavalieri with his method of indivisibles, and work by Fermat, began to lay the foundations of modern calculus. Further steps were made in the early 17th Century by Barrow and Torricelli, who provided the first hints of a connection between integration and differentiation.

Newton and Leibniz

The major advance in integration came in the 17th Century with the independent discovery of the fundamental theorem of calculus by Newton and Leibniz. The theorem demonstrates a connection between integration and differentiation. This connection, combined with the comparative ease of differentiation, can be exploited to calculate integrals. In particular, the fundamental theorem of calculus allows one to solve a much broader class of problems. Equal in importance is the comprehensive mathematical framework that both Newton and Leibniz developed. Given the name infinitesimal calculus, it allowed for precise analysis of functions within continuous domains. This framework eventually became modern Calculus, whose notation for integrals is drawn directly from the work of Leibniz.

Formalising integrals

While Newton and Leibniz provided systematic approach to integration, their work lacked a degree of rigour. Bishop Berkeley memorably attacked infinitesimals as "the ghosts of departed quantity". Calculus acquired a firmer footing with the development of limits and was given a suitable foundation by Cauchy in the first half of the 19th century. Integration was first rigorously formalised, using limits, by Riemann. Although all piecewise continuous and bounded functions are Riemann integrable on a bounded interval, subsequently more general functions were considered, to which Riemann's definition does not apply, and Lebesgue formulated a different definition of integral, founded in measure theory. Other definitions of integral, extending Riemann's and Lebesgue's approaches, were proposed.

Notation

Isaac Newton used a small vertical bar above a variable to indicate integration, or placed the variable inside a box. The vertical bar was easily confused with x ˙ {\displaystyle {\dot {x}}} or x {\displaystyle x'\,\!} , which Newton used to indicate differentiation, and the box notation was difficult for printers to reproduce, so these notations were not widely adopted.

The modern notation for the indefinite integral was introduced by Gottfried Leibniz in 1675 (Burton 1988, p. 359 harvnb error: no target: CITEREFBurton1988 (help); Leibniz 1899, p. 154). He derived the integral symbol, "∫", from an elongated letter S, standing for summa (Latin for "sum" or "total"). The modern notation for the definite integral, with limits above and below the integral sign, was first used by Joseph Fourier in Mémoires of the French Academy around 1819–20, reprinted in his book of 1822 (Cajori 1929, pp. 249–250; Fourier 1822, §231). In Arabic which is written from right to left, an inverted integral symbol File:ArabicIntegralSign.png is used (W3C 2006).

Terminology and notation

If a function has an integral, it is said to be integrable. The function for which the integral is calculated is called the integrand. The region over which a function is being integrated is called the domain of integration. In general the integrand may be a function of more than one variable, and the domain of integration may be an area, volume, a higher dimensional region, or even an abstract space that does not have a geometric structure in any usual sense.

The simplest case, the integral of a real-valued function f of one real variable x on the interval , is denoted by

a b f ( x ) d x . {\displaystyle \int _{a}^{b}f(x)\,dx.}

The ∫ sign, an elongated "S", represents integration; a and b are the lower limit and upper limit of integration, defining the domain of integration; f is the integrand, to be evaluated as x varies over the interval ; and dx can have different interpretations depending on the theory being used. For example, it can be seen as merely a notation indicating that x is the 'dummy variable' of integration, as a reflection of the weights in the Riemann sum, a measure (in Lebesgue integration and its extensions), an infinitesimal (in non-standard analysis) or as an independent mathematical quantity: a differential form. More complicated cases may vary the notation slightly.

Introduction

Integrals appear in many practical situations. Consider a swimming pool. If it is rectangular, then from its length, width, and depth we can easily determine the volume of water it can contain (to fill it), the area of its surface (to cover it), and the length of its edge (to rope it). But if it is oval with a rounded bottom, all of these quantities call for integrals. Practical approximations may suffice at first, but eventually we demand exact and rigorous answers to such problems.

Measuring depends on counting, with the unit of measurement as a key link. For example, recall the famous theorem in geometry which says that if a, b, and c are the side lengths of a right triangle, with c the longest, then a+b = c. If we take the side of a square as one unit, its diagonal is √2 units long, a measurement which is greater than one and less than two. If we take the side as five units, the diagonal is at least seven units long, but still not perfectly measured. A side of 12 units will have a diagonal of about 17 units, but slightly less. In fact, no choice of unit that exactly counts off the side length can ever give an exact count for the diagonal length. Thus the real number system is born, permitting an exact answer with no trace of approximating counts. It is as if the units are infinitely fine.

Approximations to integral of √x from 0 to 1, with ■ 5 right samples (above) and ■ 12 left samples (below)

So it is with integrals. For example, consider the curve y = f(x) between x = 0 and x = 1, with f(x) = √x. In the simpler case where y = 1, the region under the "curve" would be a unit square, so its area would be exactly 1. As it is, the area must be somewhat less. A smaller "unit of measure" should do better; so cross the interval in five steps, from 0 to ⁄5, from ⁄5 to ⁄5, and so on to 1. Fit a box for each step using the right end height of each curve piece, thus √⁄5, √⁄5, and so on to √1 = 1. The total area of these boxes is about 0.7497; but we can easily see that this is still too large. Using 12 steps in the same way, but with the left end height of each piece, yields about 0.6203, which is too small. As before, using more steps produces a closer approximation, but will never be exact. Thus the integral is born, permitting an exact answer of ⁄3, with no trace of approximating sums. It is as if the steps are infinitely fine (infinitesimal).

The most powerful new insight of Newton and Leibniz was that, under suitable conditions, the value of an integral over a region can be determined by looking at the region's boundary alone. For example, the surface area of the pool can be determined by an integral along its edge. This is the essence of the fundamental theorem of calculus. Applied to the square root curve, it says to look at the related function F(x) = ⁄3x, and simply take F(1)−F(0), where 0 and 1 are the boundaries of the interval . (This is an example of a general rule, that for f(x) = x, with q ≠ −1, the related function is F(x) = (x)/(q+1).)

In Leibniz's own notation — so well chosen it is still common today — the meaning of

f ( x ) d x {\displaystyle \int f(x)\,dx\,\!}

was conceived as a weighted sum (denoted by the elongated "S"), with function values (such as the heights, y = f(x)) multiplied by infinitesimal step widths (denoted by dx). After the failure of early efforts to rigorously define infinitesimals, Riemann formally defined integrals as a limit of ordinary weighted sums, so that the dx suggested the limit of a difference (namely, the interval width). Shortcomings of Riemann's dependence on intervals and continuity motivated newer definitions, especially the Lebesgue integral, which is founded on an ability to extend the idea of "measure" in much more flexible ways. Thus the notation

A f ( x ) d μ {\displaystyle \int _{A}f(x)\,d\mu \,\!}

refers to a weighted sum in which the function values are partitioned, with μ measuring the weight to be assigned to each value. (Here A denotes the region of integration.) Differential geometry, with its "calculus on manifolds", gives the familiar notation yet another interpretation. Now f(x) and dx become a differential form, ω = f(x)dx, a new differential operator d, known as the exterior derivative appears, and the fundamental theorem becomes the more general Stokes' theorem,

A d ω = A ω , {\displaystyle \int _{A}{\mathbf {d}}\omega =\int _{\partial A}\omega ,\,\!}

from which Green's Theorem, the divergence theorem, and the fundamental theorem of calculus follow.

More recently, infinitesimals have reappeared with rigor, through modern innovations such as non-standard analysis. Not only do these methods vindicate the intuitions of the pioneers, they also lead to new mathematics.

Although there are differences between these conceptions of integral, there is considerable overlap. Thus the area of the surface of the oval swimming pool can be handled as a geometric ellipse, as a sum of infinitesimals, as a Riemann integral, as a Lebesgue integral, or as a manifold with a differential form. The calculated result will be the same for all.

Formal definitions

There are many ways of formally defining an integral, not all of which are equivalent. The differences exist mostly to deal with differing special cases which may not be integrable under other definitions, but also occasionally for pedagogical reasons. The most commonly used definitions of integral are Riemann integrals and Lebesgue integrals.

Riemann integral

Main article: Riemann integral
Integral approached as Riemann sum based on tagged partition, with irregular sampling positions and widths (max in red). True value is 3.76; estimate is 3.648.

The Riemann integral is defined in terms of Riemann sums of functions with respect to tagged partitions of an interval. Let be a closed interval of the real line; then a tagged partition of is a finite sequence

a = x 0 t 1 x 1 t 2 x 2 x n 1 t n x n = b . {\displaystyle a=x_{0}\leq t_{1}\leq x_{1}\leq t_{2}\leq x_{2}\leq \cdots \leq x_{n-1}\leq t_{n}\leq x_{n}=b.\,\!}
Riemann sums converging as intervals halve, whether sampled at ■ right, ■ minimum, ■ maximum, or ■ left.

This partitions the interval into i sub-intervals , each of which is "tagged" with a distinguished point ti ∈ . Let Δi = xixi−1 be the width of sub-interval i; then the mesh of such a tagged partition is the width of the largest sub-interval formed by the partition, maxi=1…n Δi. A Riemann sum of a function f with respect to such a tagged partition is defined as

i = 1 n f ( t i ) Δ i ; {\displaystyle \sum _{i=1}^{n}f(t_{i})\Delta _{i};}

thus each term of the sum is the area of a rectangle with height equal to the function value at the distinguished point of the given sub-interval, and width the same as the sub-interval width. The Riemann integral of a function f over the interval is equal to S if:

For all ε > 0 there exists δ > 0 such that, for any tagged partition with mesh less than δ, we have
| S i = 1 n f ( t i ) Δ i | < ϵ . {\displaystyle \left|S-\sum _{i=1}^{n}f(t_{i})\Delta _{i}\right|<\epsilon .}

When the chosen tags give the maximum (respectively, minimum) value of each interval, the Riemann sum becomes an upper (respectively, lower) Darboux sum, suggesting the close connection between the Riemann integral and the Darboux integral.

Lebesgue integral

Main article: Lebesgue integral

The Riemann integral is not defined for a wide range of functions and situations of importance in applications (and of interest in theory). For example, the Riemann integral can easily integrate density to find the mass of a steel beam, but cannot accomodate a steel ball resting on it. This motivates other definitions, under which a broader assortment of functions is integrable (Rudin 1987). The Lebesgue integral, in particular, achieves great flexibility by directing attention to the weights in the weighted sum.

The definition of the Lebesgue integral thus begins with a measure, μ. In the simplest case, the Lebesgue measure μ(A) of an interval A = is its width, ba, so that the Lebesgue integral agrees with the (proper) Riemann integral when both exist. In more complicated cases, the sets being measured can be highly fragmented, with no continuity and no resemblence to intervals.

To exploit this flexibility, Lebesgue integrals reverse the approach to the weighted sum. As Folland (1984, p. 56) puts it, "To compute the Riemann integral of f, one partitions the domain into subintervals", while in the Lebesgue integral, "one is in effect partitioning the range of f".

One common approach first defines the integral of the indicator function of a measurable set A by:

1 A d μ = μ ( A ) {\displaystyle \int 1_{A}d\mu =\mu (A)} .

This extends by linearity to a measurable simple function s, which attains only a finite number, n, of distinct non-negative values:

s d μ = ( i = 1 n a i 1 A i ) d μ = i = 1 n a i 1 A i d μ = i = 1 n a i μ ( A i ) {\displaystyle {\begin{aligned}\int s\,d\mu &{}=\int \left(\sum _{i=1}^{n}a_{i}1_{A_{i}}\right)d\mu \\&{}=\sum _{i=1}^{n}a_{i}\int 1_{A_{i}}\,d\mu \\&{}=\sum _{i=1}^{n}a_{i}\,\mu (A_{i})\end{aligned}}}

(where the image of Ai under the simple function s is the constant value ai). Thus if E is a measurable set one defines

E s d μ = i = 1 n a i μ ( A i E ) . {\displaystyle \int _{E}s\,d\mu =\sum _{i=1}^{n}a_{i}\,\mu (A_{i}\cap E).}

Then for any non-negative measurable function f one defines

E f d μ = sup { E s d μ : 0 s f  and  s  is a simple function } ; {\displaystyle \int _{E}f\,d\mu =\sup \left\{\int _{E}s\,d\mu \,\colon 0\leq s\leq f{\text{ and }}s{\text{ is a simple function}}\right\};}

that is, the integral of f is set to be the supremum of all the integrals of simple functions that are less than or equal to f. A general measurable function f, is split into its positive and negative values by defining

f + ( x ) = { f ( x ) , if  f ( x ) > 0 0 , otherwise f ( x ) = { f ( x ) , if  f ( x ) < 0 0 , otherwise {\displaystyle {\begin{aligned}f^{+}(x)&{}={\begin{cases}f(x),&{\text{if }}f(x)>0\\0,&{\text{otherwise}}\end{cases}}\\f^{-}(x)&{}={\begin{cases}-f(x),&{\text{if }}f(x)<0\\0,&{\text{otherwise}}\end{cases}}\end{aligned}}}

Finally, f is Lebesgue integrable if

E | f | d μ < , {\displaystyle \int _{E}|f|\,d\mu <\infty ,\,\!}

and then the integral is defined by

E f d μ = E f + d μ E f d μ . {\displaystyle \int _{E}f\,d\mu =\int _{E}f^{+}\,d\mu -\int _{E}f^{-}\,d\mu .\,\!}

When the measure space on which the functions are defined is also a locally compact topological space (as is the case with the real numbers R), measures compatible with the topology in a suitable sense (Radon measures, of which the Lebesgue measure is an example) and integral with respect to them can be defined differently, starting from the integrals of continuous functions with compact support. More precisely, the compactly supported functions form a vector space that carries a natural topology, and a (Radon) measure can be defined as any continuous linear functional on this space; the value of a measure at a compactly supported function is then also by definition the integral of the function. One then proceeds to expand the measure (the integral) to more general functions by continuity, and defines the measure of a set as the integral of its indicator function. This is the approach taken by Bourbaki (2004) and a certain number of other authors. For details see Radon measures.

Other integrals

Although the Riemann and Lebesgue integrals are the most important definitions of the integral, a number of others exist, including:

General properties

Some properties are common to all definitions of the integral. Integration is essentially a linear functional, a mapping from a space of integrable functions into the real or complex numbers. It can also be formulated as an inner product defined on a vector space of integrable functions. In a general setting, with the domain of the functions being integrated being a measure space, the term "integrable" is replaced with "measurable".

The following linearity property necessarily holds: Let E be a measure space with measure μ, let F be either the real numbers R or the complex numbers C, and let L be the set of all bounded measurable functions f : EF. If

I : L F {\displaystyle I:L\rightarrow F\,}

is given by

I ( f ) = E f d μ {\displaystyle I(f)=\int _{E}f\,d\mu }

(which exists for each function f in L), then I is linear. That is, if f and g are measurable functions (elements of L), and if a and b are elements of F, then af + bg is also measurable, and therefore an element of L, and

I ( a f + b g ) = a I ( f ) + b I ( g ) . {\displaystyle I(af+bg)=aI(f)+bI(g).\,}

In the case of E being a closed interval , μ being the standard measure μ() = yx for each sub-interval of , and L being the set of Riemann-integrable functions on , then the operation becomes

I ( f ) = a b f d x {\displaystyle I(f)=\int _{a}^{b}f\,dx}

for all f in L.

Basic inequalities

  • Upper and lower bounds. It is necessary, for the integral of a function f to be defined on a closed and bounded interval , that f be bounded on that interval. Thus there are real numbers m and M so that mf (x) ≤ M for all x in . Since the lower and upper sums of f over are therefore bounded by, respectively, m(ba) and M(ba), it follows that
m ( b a ) a b f ( x ) d x M ( b a ) . {\displaystyle m(b-a)\leq \int _{a}^{b}f(x)\,dx\leq M(b-a).}
  • Inequalities between functions. If f(x) ≤ g(x) for each x in then each of the upper and lower sums of f is bounded above by the upper and lower sums, respectively, of g. Thus
a b f ( x ) d x a b g ( x ) d x . {\displaystyle \int _{a}^{b}f(x)\,dx\leq \int _{a}^{b}g(x)\,dx.}
  • Products and absolute values of functions. The functions fg, f , g , and | f |, defined by, respectively,
( f g ) ( x ) = f ( x ) g ( x ) , f 2 ( x ) = ( f ( x ) ) 2 , g 2 ( x ) = ( g ( x ) ) 2 , | f | ( x ) = | f ( x ) | , {\displaystyle {\begin{aligned}(fg)(x)&{}=f(x)g(x),\\f^{2}(x)&{}=(f(x))^{2},\\g^{2}(x)&{}=(g(x))^{2},\\|f|(x)&{}=|f(x)|,\end{aligned}}}
for each x in are integrable, with
| a b f ( x ) d x | a b | f ( x ) | d x {\displaystyle \left|\int _{a}^{b}f(x)\,dx\right|\leq \int _{a}^{b}|f(x)|\,dx}
and
( a b ( f g ) ( x ) d x ) 2 ( a b f ( x ) 2 d x ) ( a b g ( x ) 2 d x ) . {\displaystyle \left(\int _{a}^{b}(fg)(x)\,dx\right)^{2}\leq \left(\int _{a}^{b}f(x)^{2}\,dx\right)\left(\int _{a}^{b}g(x)^{2}\,dx\right).}

Conventions

The integral

a b f ( x ) d x {\displaystyle \int _{a}^{b}f(x)\,dx}

is defined formally if a < b. We take this as meaning that the upper and lower sums of the function f are evaluated on a partition a = x0x1 ≤ . . . ≤ xn = b whose values xi are increasing. Geometrically, this means that integration takes place "left to right", evaluating f within intervals where an interval with a higher index lies to the right of one with a lower index. If, on the other hand, a > b, then define

a b f ( x ) d x = b a f ( x ) d x , {\displaystyle \int _{a}^{b}f(x)\,dx=-\int _{b}^{a}f(x)\,dx,}

which implies, if a = b, that

a a f ( x ) d x = 0. {\displaystyle \int _{a}^{a}f(x)\,dx=0.}

The first convention is necessary in consideration of subintervals of ; the second says that an integral taken over a degenerate interval, or a point, should be zero. One reason for the first convention is that the integrability of f on an interval implies that f is integrable on any subinterval , but in particular, if c is any element of , then

a b f ( x ) d x = a c f ( x ) d x + c b f ( x ) d x . {\displaystyle \int _{a}^{b}f(x)\,dx=\int _{a}^{c}f(x)\,dx+\int _{c}^{b}f(x)\,dx.}

With the first convention the resulting relation

a c f ( x ) d x = a b f ( x ) d x c b f ( x ) d x = a b f ( x ) d x + b c f ( x ) d x {\displaystyle {\begin{aligned}\int _{a}^{c}f(x)\,dx&{}=\int _{a}^{b}f(x)\,dx-\int _{c}^{b}f(x)\,dx\\&{}=\int _{a}^{b}f(x)\,dx+\int _{b}^{c}f(x)\,dx\end{aligned}}}

is then well-defined for any cyclic permutation of a, b, and c.

Extensions

Improper integrals

Main article: Improper integral
The improper integral
0 d x ( x + 1 ) x = π {\displaystyle \int _{0}^{\infty }{\frac {dx}{(x+1){\sqrt {x}}}}=\pi }
has unbounded intervals for both domain and range.

A "proper" Riemann integral assumes the integrand is defined and finite on a closed and bounded interval, bracketed by the limits of integration. An improper integral occurs when one or more of these conditions is not satisfied. In some cases such integrals may be defined by considering the limit of a sequence of proper Riemann integrals on progressively larger intervals.

If the interval is unbounded, for instance at its upper end, then the improper integral is the limit as that endpoint goes to infinity.

a f ( x ) d x = lim b a b f ( x ) d x {\displaystyle \int _{a}^{\infty }f(x)dx=\lim _{b\to \infty }\int _{a}^{b}f(x)dx}

If the integrand is only defined or finite on a half-open interval, for instance (a,b], then again a limit may provide a finite result.

a b f ( x ) d x = lim ϵ 0 a + ϵ b f ( x ) d x {\displaystyle \int _{a}^{b}f(x)dx=\lim _{\epsilon \to 0}\int _{a+\epsilon }^{b}f(x)dx}

That is, the improper integral is the limit of proper integrals as one endpoint of the interval of integration approaches either a specified real number, or ∞, or −∞. In more complicated cases, limits are required at both endpoints, or at interior points.

Consider, for example, the function 1/((x+1)√x) integrated from 0 to ∞ (shown right). At the lower bound, as x goes to 0 the function goes to ∞; and the upper bound is itself ∞, though the function goes to 0. Thus this is a doubly improper integral. Integrated, say, from 1 to 3, an ordinary Riemann sum suffices to produce a result of π/6. To integrate from 1 to ∞, a Riemann sum is not possible. However, any finite upper bound, say t (with t > 1), gives a well-defined result, π/2 − 2 arctan(1/√t). This has a finite limit as t goes to infinity, namely π/2. Similarly, the integral from ⁄3 to 1 allows a Riemann sum as well, coincidentally again producing π/6. Replacing ⁄3 by an arbitrary positive value s (with s < 1) is equally safe, giving −π/2 + 2 arctan(1/√s). This, too, has a finite limit as s goes to zero, namely π/2. Combining the limits of the two fragments, the result of this improper integral is

0 d x ( x + 1 ) x = lim s 0 s 1 d x ( x + 1 ) x + lim t 1 t d x ( x + 1 ) x = lim s 0 ( π 2 + 2 arctan ( 1 / s ) ) + lim t ( π 2 2 arctan ( 1 / t ) ) = π 2 + π 2 = π . {\displaystyle {\begin{aligned}\int _{0}^{\infty }{\frac {dx}{(x+1){\sqrt {x}}}}&{}=\lim _{s\to 0}\int _{s}^{1}{\frac {dx}{(x+1){\sqrt {x}}}}+\lim _{t\to \infty }\int _{1}^{t}{\frac {dx}{(x+1){\sqrt {x}}}}\\&{}=\lim _{s\to 0}\left(-{\frac {\pi }{2}}+2\arctan(1/{\sqrt {s}})\right)+\lim _{t\to \infty }\left({\frac {\pi }{2}}-2\arctan(1/{\sqrt {t}})\right)\\&{}={\frac {\pi }{2}}+{\frac {\pi }{2}}\\&{}=\pi .\end{aligned}}}

This process is not guaranteed success; a limit may fail to exist, or may be unbounded. For example, over the bounded interval 0 to 1 the integral of 1/x does not converge; and over the unbounded interval 1 to ∞ the integral of 1/√x does not converge.

A depiction of the improper integral described to the left.

It may also happen that an integrand is unbounded at an interior point, in which case the integral must be split at that point, and the limit integrals on both sides must exist and must be bounded. Thus

1 1 d x x 2 3 = lim s 0 1 s d x x 2 3 + lim t 0 t 1 d x x 2 3 = lim s 0 3 ( 1 s 3 ) + lim t 0 3 ( 1 t 3 ) = 3 + 3 = 6. {\displaystyle {\begin{aligned}\int _{-1}^{1}{\frac {dx}{\sqrt{x^{2}}}}&{}=\lim _{s\to 0}\int _{-1}^{-s}{\frac {dx}{\sqrt{x^{2}}}}+\lim _{t\to 0}\int _{t}^{1}{\frac {dx}{\sqrt{x^{2}}}}\\&{}=\lim _{s\to 0}3(1-{\sqrt{s}})+\lim _{t\to 0}3(1-{\sqrt{t}})\\&{}=3+3\\&{}=6.\end{aligned}}}

But the similar integral

1 1 d x x {\displaystyle \int _{-1}^{1}{\frac {dx}{x}}\,\!}

cannot be assigned a value in this way, as the integrals above and below zero do not independently converge. (However, see Cauchy principal value.)

Multiple integration

Main article: Multiple integral

Integrals can be taken over regions other than intervals. In general, an integral over a set E of a function f is written:

E f ( x ) d x {\displaystyle \int _{E}f(x)\,dx}

Here x need not be a real number, but can be other suitable algebraic quantities. For instance, a vector in R. Fubini's theorem shows that such integrals can be rewritten as an iterated integral. In other words, the integral can be calculated by integrating one coordinate at a time.

Just as the definite integral of a positive function of one variable represents the area of the region between the graph of the function and the x-axis, the double integral of a positive function of two variables represents the volume of the region between the surface defined by the function and the plane which contains its domain. (Note that the same volume can be obtained via the triple integral — the integral of a function in three variables — of the constant function f(x, y, z) = 1 over the above-mentioned region between the surface and the plane.) If the number of variables is higher, one will calculate "hypervolumes" (volumes of solid of more than three dimensions) that cannot be graphed.

Multiple integral as volume under a surface.

For example, the volume of the parallelepiped of sides 4×6×5 may be obtained in two ways:

  • By the double integral
D 5   d x d y {\displaystyle \iint _{D}5\ dx\,dy}
of the function f(x, y) = 5 calculated in the region D in the xy-plane which is the base of the parallelepiped.
  • By the triple integral
p a r a l l e l e p i p e d 1 d x d y d z {\displaystyle \iiint _{\mathrm {parallelepiped} }1\,dx\,dy\,dz}
of the constant function 1 calculated on the parallelepiped itself.

Because it is impossible to calculate the antiderivative of a function of more than one variable, indefinite multiple integrals do not exist so they are all definite integrals.

Line integrals

Main article: Line integral
A line integral sums together elements along a curve.

The concept of an integral can be extended to more general domains of integration, such as curved lines and surfaces. Such integrals are known as line integrals and surface integrals respectively. These have important applications in physics, as when dealing with vector fields.

A line integral (sometimes called a path integral) is an integral where the function to be integrated is evaluated along a curve. Various different line integrals are in use. In the case of a closed curve it is also called a contour integral.

The function to be integrated may be a scalar field or a vector field. The value of the line integral is the sum of values of the field at all points on the curve, weighted by some scalar function on the curve (commonly arc length or, for a vector field, the scalar product of the vector field with a differential vector in the curve). This weighting distinguishes the line integral from simpler integrals defined on intervals. Many simple formulas in physics have natural continuous analogs in terms of line integrals; for example, the fact that work is equal to force multiplied by distance may be expressed (in terms of vector quantities) as:

W = F d {\displaystyle W={\vec {F}}\cdot {\vec {d}}} ;

which is paralleled by the line integral:

W = C F d s {\displaystyle W=\int _{C}{\vec {F}}\cdot d{\vec {s}}} ;

which sums up vector components along a continuous path, and thus finds the work done on an object moving through a field, such as an electric or gravitational field

Surface integrals

Main article: Surface integral
The definition of surface integral relies on splitting the surface into small surface elements.

A surface integral is a definite integral taken over a surface (which may be a curved set in space); it can be thought of as the double integral analog of the line integral. The function to be integrated may be a scalar field or a vector field. The value of the surface integral is the sum of the field at all points on the surface. This can be achieved by splitting the surface into surface elements, which provide the partitioning for Riemann sums.

For an example of applications of surface integrals, consider a vector field v on a surface S; that is, for each point x in S, v(x) is a vector. Imagine that we have a fluid flowing through S, such that v(x) determines the velocity of the fluid at x. The flux is defined as the quantity of fluid flowing through S in unit amount of time. To find the flux, we need to take the dot product of v with the unit surface normal to S at each point, which will give us a scalar field, which we integrate over the surface:

S v d S {\displaystyle \int _{S}{\mathbf {v} }\cdot \,d{\mathbf {S} }} .

The fluid flux in this example may be from a physical fluid such as water or air, or from electrical or magnetic flux. Thus surface integrals have applications in physics, particularly with the classical theory of electromagnetism.

Integrals of differential forms

A differential form is a mathematical concept in the fields of multivariate calculus, differential topology and tensors. The modern notation for the differential form, as well as the idea of the differential forms as being the wedge products of exterior derivatives forming an exterior algebra, was introduced by Élie Cartan.

We initially work in an open set in R. A 0-form is defined to be a smooth function f. When we integrate a function f over an m-dimensional subspace S of R, we write it as

S f d x 1 d x m . {\displaystyle \int _{S}f\,dx^{1}\cdots dx^{m}.}

(The superscripts are indices, not exponents.) We can consider dx through dx to be formal objects themselves, rather than tags appended to make integrals look like Riemann sums. Alternatively, we can view them as covectors, and thus a measure of "density" (hence integrable in a general sense). We call the dx, …,dx basic 1-forms.

We define the wedge product, "∧", a bilinear "multiplication" operator on these elements, with the alternating property that

d x a d x a = 0 {\displaystyle dx^{a}\wedge dx^{a}=0\,\!}

for all indices a. Note that alternation along with linearity implies dxdx = −dxdx. This also ensures that the result of the wedge product has an orientation.

We define the set of all these products to be basic 2-forms, and similarly we define the set of products of the form dxdxdx to be basic 3-forms. A general k-form is then a weighted sum of basic k-forms, where the weights are the smooth functions f. Together these form a vector space with basic k-forms as the basis vectors, and 0-forms (smooth functions) as the field of scalars. The wedge product then extends to k-forms in the natural way. Over R at most n covectors can be linearly independent, thus a k-form with k > n will always be zero, by the alternating property.

In addition to the wedge product, there is also the exterior derivative operator d. This operator maps k-forms to (k+1)-forms. For a k-form ω = f dx over R, we define the action of d by:

d ω = i = 1 n f x i d x i d x a . {\displaystyle {\mathbf {d}}{\omega }=\sum _{i=1}^{n}{\frac {\partial f}{\partial x_{i}}}dx^{i}\wedge dx^{a}.}

with extension to general k-forms occurring linearly.

This more general approach allows for a more natural coordinate-free approach to integration on manifolds. It also allows for a natural generalisation of the fundamental theorem of calculus, called Stoke's theorem, which we may state as

Ω d ω = Ω ω {\displaystyle \int _{\Omega }{\mathbf {d}}\omega =\int _{\partial \Omega }\omega \,\!}

where ω is a general k-form, and ∂Ω denotes the boundary of the region Ω. Thus in the case that ω is a 0-form and Ω is a closed interval of the real line, this reduces to the fundamental theorem of calculus. In the case that ω is a 1-form and Ω is a 2-dimensional region in the plane, the theorem reduces to Green's theorem. Similarly, using 2-forms, and 3-forms and Hodge duality, we can arrive at Stoke's theorem and the divergence theorem. In this way we can see that differential forms provide a powerful unifying view of integration.

Methods and applications

Computing integrals

The most basic technique for computing integrals of one real variable is based on the fundamental theorem of calculus. It proceeds like this:

  1. Choose a function f(x) and an interval .
  2. Find an antiderivative of f, that is, a function F such that F' = f.
  3. By the fundamental theorem of calculus, provided the integrand and integral have no singularities on the path of integration,
    a b f ( x ) d x = F ( b ) F ( a ) . {\displaystyle \int _{a}^{b}f(x)\,dx=F(b)-F(a).}
  4. Therefore the value of the integral is F(b) − F(a).

Note that the integral is not actually the antiderivative, but the fundamental theorem allows us to use antiderivatives to evaluate definite integrals.

The difficult step is often finding an antiderivative of f. It is rarely possible to glance at a function and write down its antiderivative. More often, it is necessary to use one of the many techniques that have been developed to evaluate integrals. Most of these techniques rewrite one integral as a different one which is hopefully more tractable. Techniques include:

Even if these techniques fail, it may still be possible to evaluate a given integral. The next most common technique is residue calculus, whilst for nonelementary integrals Taylor series can sometimes be used to find the antiderivative. There are also many less common ways of calculating definite integrals; for instance, Parseval's identity can be used to transform an integral over a rectangular region into an infinite sum. Occasionally, an integral can be evaluated by a trick; for an example of this, see Gaussian integral.

Computations of volumes of solids of revolution can usually be done with disk integration or shell integration.

Specific results which have been worked out by various techniques are collected in the list of integrals.

Symbolic algorithms

Main article: Symbolic integration

Many problems in mathematics, physics, and engineering involve integration where an explicit formula for the integral is desired. Extensive tables of integrals have been compiled and published over the years for this purpose. With the spread of computers, many professionals, educators, and students have turned to computer algebra systems that are specifically designed to perform difficult or tedious tasks, including integration. Symbolic integration presents a special challenge in the development of such systems.

A major mathematical difficulty in symbolic integration is that in many cases, a closed formula for the antiderivative of a rather innocently looking function simply does not exist. For instance, it is known that that the antiderivatives of the functions e, x and sin x /x cannot be expressed in the closed form involving only rational and exponential functions, logarithm, trigonometric and inverse trigonometric functions, and the operations of multiplication and composition; in other words, none of the three given functions is integrable in elementary functions. Differential Galois theory provides general criteria that allow one to determine whether the antiderivative of an elementary function is elementary. Unfortunately, it turns out that functions with closed expressions of antiderivatives are the exception rather than the rule. Consequently, computerized algebra systems have no hope of being able to find an antiderivative for a randomly constructed elementary function. On the positive side, if the 'building blocks' for antiderivatives are fixed in advance, it may be still be possible to decide whether the antiderivative of a given function can be expressed using these blocks and operations of multiplication and composition, and to find the symbolic answer whenever it exists. The Risch-Norman algorithm, implemented in Mathematica and the Maple computer algebra systems, does just that for functions and antiderivatives built from rational functions, radicals, logarithm, and exponential functions.

Some special integrands occur often enough to warrant special study. In particular, it may be useful to have, in the set of antiderivatives, the special functions of physics (like the Legendre functions, the hypergeometric function, the Gamma function and so on). Extending the Risch-Norman algorithm so that it includes these functions is possible but challenging.

Most humans are not able to integrate such general formulae, so in a sense computers are more skilled at integrating highly complicated formulae. Very complex formulae are unlikely to have closed-form antiderivatives, so how much of an advantage does this present is a philosophical question that is open for debate.

Numerical quadrature

Main article: numerical integration

The integrals encountered in a basic calculus course are deliberately chosen for simplicity; those found in real applications are not always so accommodating. Some integrals cannot be found exactly, some require special functions which themselves are a challenge to compute, and others are so complex that finding the exact answer is too slow. This motivates the study and application of numerical methods for approximating integrals, which today use floating point arithmetic on digital electronic computers. Many of the ideas arose much earlier, for hand calculations; but the speed of general-purpose computers like the ENIAC created a need for improvements.

The goals of numerical integration are accuracy, reliability, efficiency, and generality. Sophisticated methods can vastly outperform a naive method by all four measures (Dahlquist, Björck & forthcoming harvnb error: no target: CITEREFDahlquistBjörckforthcoming (help); Kahaner, Moler & Nash 1989; Stoer & Bulirsch 2002). Consider, for example, the integral

2 2 1 5 ( 1 100 ( 322 + 3 x ( 98 + x ( 37 + x ) ) ) 24 x 1 + x 2 ) d x , {\displaystyle \int _{-2}^{2}{\tfrac {1}{5}}\left({\tfrac {1}{100}}(322+3x(98+x(37+x)))-24{\frac {x}{1+x^{2}}}\right)dx,}

which has the exact answer ⁄25 = 3.76. (In ordinary practice the answer is not known in advance, so an important task — not explored here — is to decide when an approximation is good enough.) A “calculus book” approach divides the integration range into, say, 16 equal pieces, and computes function values.

Spaced function values
x −2.00 −1.50 −1.00 −0.50  0.00  0.50  1.00  1.50  2.00
f(x)  2.22800  2.45663  2.67200  2.32475  0.64400 −0.92575 −0.94000 −0.16963  0.83600
x   −1.75 −1.25 −0.75 −0.25  0.25  0.75  1.25  1.75
f(x)  2.33041  2.58562  2.62934  1.64019 −0.32444 −1.09159 −0.60387  0.31734
Numerical quadrature methods: ■ Rectangle, ■ Trapezoid, ■ Romberg, ■ Gauss

Using the left end of each piece, the rectangle method sums 16 function values and multiplies by the step width, h, here 0.25, to get an approximate value of 3.94325 for the integral. The accuracy is not impressive, but calculus formally uses pieces of infinitesimal width, so initially this may seem little cause for concern. Indeed, repeatedly doubling the number of steps eventually produces an approximation of 3.76001. However 2 pieces are required, a great computational expense for so little accuracy; and a reach for greater accuracy can force steps so small that arithmetic precision becomes an obstacle.

A better approach replaces the horizontal tops of the rectangles with slanted tops touching the function at the ends of each piece. This trapezoidal rule is almost as easy to calculate; it sums all 17 function values, but weights the first and last by one half, and again multiplies by the step width. This immediately improves the approximation to 3.76925, which is noticeably more accurate. Furthermore, only 2 pieces are needed to achieve 3.76000, substantially less computation than the rectangle method for comparable accuracy.

The Romberg method builds on the trapezoid method to great effect. First, the step lengths are halved incrementally, giving trapezoid approximations denoted by T(h0), T(h1), and so on, where hk+1 is half of hk. For each new step size, only half the new function values need to be computed; the others carry over from the previous size. But the really powerful idea is to interpolate a polynomial through the approximations, and extrapolate to T(0). With this method a numerically exact answer here requires only four pieces (five function values)! The Lagrange polynomial interpolating {hk,T(hk)}k=0…2 = {(4.00,6.128), (2.00,4.352), (1.00,3.908)} is 3.76+0.148h, producing the extrapolated value 3.76 at h = 0.

Gaussian quadrature significantly improves upon these methods. In this example, it computes the function values at just two x positions, ±⁄√3, then doubles each value and sums to get the numerically exact answer. The explanation for this dramatic success lies in error analysis, and a little luck. An n-point Gaussian method is exact for polynomials of degree up to 2n−1. The function in this example is a degree 3 polynomial, plus a term that cancels because the chosen endpoints are symmetric around zero. (Cancellation also benefits the Romberg method.)

Shifting the range left a little, so the integral is from −2.25 to 1.75, removes the symmetry. Nevertheless, the trapezoid method is rather slow, the polynomial interpolation method of Romberg is acceptable, and the Gaussian method requires the least work — if the number of points is known in advance. As well, rational interpolation can use the same trapezoid evaluations as the Romberg method to greater effect.

Quadrature method cost comparison
Method Trapezoid Romberg Rational Gauss
Points 1048577 257 129 36
Rel. Err. −5.3×10 −6.3×10 8.8×10 3.1×10
Value 2.25 1.75 f ( x ) d x = 4.1639019006585897075 {\displaystyle \textstyle \int _{-2.25}^{1.75}f(x)\,dx=4.1639019006585897075\ldots }

In practice, each method must use extra evaluations to ensure an error bound on an unknown function; this tends to offset some of the advantage of the pure Gaussian method, and motivates the popular Gauss–Kronrod hybrid. Symmetry can still be exploited by splitting this integral into two ranges, from −2.25 to −1.75 (no symmetry), and from −1.75 to 1.75 (symmetry). More broadly, adaptive quadrature partitions a range into pieces based on function properties, so that data points are concentrated where they are needed most.

This brief introduction omits higher-dimensional integrals (for example, area and volume calculations), where alternatives such as Monte Carlo integration have great importance.

A calculus text is no substitute for numerical analysis, but the reverse is also true. Even the best adaptive numerical code sometimes requires a user to help with the more demanding integrals. For example, improper integrals may require a change of variable or methods that can avoid infinite function values; and known properties like symmetry and periodicity may provide critical leverage.

See also

References

External links

Online books

Categories:
Integral: Difference between revisions Add topic