# Stone-Weierstrass and an Alternative Proof of Itô's Lemma

In a similar sense to line integrals, stochastic calculus extends the classical tools to working with stochastic processes. One of the most elegant and useful result is the change of variable formula for stochastic integrals, commonly known as Itô’s Lemma (see end of this post for a discussion on Doeblin’s contribution). While this lemma is quite easy to use, the proof usually relies heavily on technical lemmas, hence difficult to develop intuition, especially for the first time reader.

With this motivation in mind, it was quite pleasant to discover a set of excellent lecture notes by Jason Miller (2016), which contained an alternative proof built on the idea of Stone-Weierstrass Theorem. We shall see that not only do we have a more interpretable proof, the technique is also generalizable beyond stochastic calculus. In particular, this blog post intends to illustrate the technique in detail through Itô’s Lemma.

## A Brief Background on Stochastic Calculus

We will introduce (without too much rigour) some basic definitions and results to support the proofs in later sections. The reader need not to carefully analyze the technical details here to understand the proofs to come. Readers familiar with stochastic calculus may skip to the next section.

First we let be a probability space equipped with a filtration (also satisfying the usual conditions to be rigorous). With this we can define several useful objects.

**Definition**
A stochastic process
is said to be a **martingale** if

(i) , we have is measurable with respect to , denoted ;

(ii) , we have a.s.

**Definition** We say a random variable
is a **stopping time** if .

An important property of stopping time is that if is a martingale and a stopping time, then is also a martingale.

**Definition**
Let the interval
be partitioned using increments of ,
i.e. ,
where .
Let be a continuous martingale,
and be a continuous (possibly stochastic) process.
We define the **Itô integral** as

\[ \int_0^T f_t \, dX_t := \lim_{n\to\infty} \sum_{k=0}^{\lfloor T 2^n \rfloor} f_{t_k^n} (X_{t_{k+1}^n} - X_{t_k^n}), \]

if the limit converges u.c.p. (uniformly on compact intervals in probability to be precise).

**Remark** Observe the above definition uses
a **left Riemann sum** to define the integral,
where as *other choices* will lead to *different* integrals.
This is opposed to deterministic integrals,
where the all choices are equivalent.

**Definition** Consider the same partition
as above.
Let be two continuous martingales,
we define the **quadratic covariation** as

\[ [M,N]_T := \lim_{n\to\infty} [M,N]^n_T := \lim_{n\to\infty} \sum_{k=0}^{\lfloor T 2^n \rfloor} (M_{t_{k+1}^n} - M_{t_k^n}) (N_{t_{k+1}^n} - N_{t_k^n}), \]

where the limit is also u.c.p.
We also define the **quadratic variation** as
.

Several useful results are stated next.

**Proposition (Finite Variation)**
Let be continuous stochastic processes such that
has finite variation, i.e.

and a.s. Then we have

**Proposition (Itô’s Product Rule)**
Let be continuous martingales,
then we have

**Proposition (Fundamental Theorem)**
Let be continuous martingales,
then we have

**Proposition (Kunita-Watanabe Identity)**
Let be continuous martingales, then we have

where both uses of denotes the covariation.

**Proposition (Itô’s Isometry)**

Let be a continuous martingale, and be a continuous stochastic process. Then we have

## The Lemma and the Classical Approach

For the purpose of the blog post, we will only state and prove a much simpler version of the lemma, but it is not difficult to adapt to more general conditions.

**Theorem (Itô’s Lemma)**
Let be a continuous martingale,
and .
Then we have

\[ f(X_t) = f(X_0) + \int_0^t \frac{\partial f}{\partial x}(X_s) dX_s + \frac{1}{2} \int_0^t \frac{\partial^2 f}{\partial x^2} (X_s) d[X]_s. \]

Here we will sketch the proof from Karatzas and Shreve (1991).

*proof sketch:*
We start by defining a stopping time
,
and replace with .
This *localization* technique will allow us
to only consider the function in
the interval
(or a ball in higher dimensions),
which has bounded derivatives.

By observing the lemma’s statement, the reader may notice the formula appears like the second order Taylor expansion of . Indeed we can write

where is chosen as part of Taylor’s theorem to satisfy the above equality. It’s not difficult to see the first sum converges to the first stochastic integral, then it remains to show the second term converges.

To this goal, we will define

where observe converges to the desired integral. Next we will use the following technical inequality. Let be a martingale, then we have

Without stating the details, using this and Cauchy-Schwarz inequality, we can show

To complete the proof, we will need one more technical lemma. Let , then we have

Then once again omitting the details, we can get

which combined with the previous lemma and bounded convergence theorem, we get the desired result

Putting everything together gives us the desired formula as stated.

**Remark** The use of the propositions
listed in the previous section is implicit
in the two technical lemmas we stated above,
where we also hide most of the proof difficulty in.

**Interpretation** This proof naturally leads to
an interpretation that Itô’s Lemma as
a consequence of Taylor’s expansion.
However this proof provides no clear intuition on why
the second order approximation is the correct order,
and pushes the justification to complicated technical details.
Probably the most troubling consequence is that
a different integration scheme
(e.g. Stratonovich
which rises from a mid-point Riemann sum)
leads to a different change of variable formula,
therefore the Taylor expansion intuition can lead
to further confusion.

## Overview of the Alternative Approach

At this point, we will first take a step back from Itô’s Lemma and look at a rough sketch of the proof technique.

Suppose we want to prove a collection of functions (e.g. ) satisfy a certain property , we will start by defining as the subset of that satisfies the desired property .

(Step 1) We will identify a certain algebraic structure such that is closed under, e.g. for an algebra (over a field) we have if , then . In other words, an algebra is a vector space with an associative vector multiplication.

(Step 2) Then we can say that the collection (or a dense subset) is generated by some very simple functions, e.g. under an algebra, the functions generate the entire collection of polynomials.

(Step 3) At this point, we use a density argument such as Weierstrass approximation to show is dense in . Specifically, , such that with respect to some metric .

(Step 4) Finally, it is sufficient to show is closed under this metric . I.e. if all satisfy are such that in , then we have also satisfies , hence .

**Remark**
The reader may already recognize that
the sketch above was intentionally phrased
in a very general sense,
so we can observe the flexibility of the technique.
In fact we can even generalize beyond function spaces,
as long as we have an equivalent approximation technique.

## The Proof in Detail

We start by stating the key theorem.

**Theorem (Stone-Weierstrass, Real Numbers)**
Let be a compact
Hausdorff space,
and
an algebra which contains a non-zero constant function.
Then is dense in
if and only if it separates points.

Clearly, if we let , we have a compact Hausdorff space, and the collections of polynomials contains the functions and separates points. Therefore we have is dense in with respect to the sup-norm.

Applying the same theorem to the derivatives, we then have the same result for with respect to a similar norm

*proof (of Itô’s Lemma):*
We will similarly use a localization argument, i.e. define
,
and replace with .

(Step 1, 2) Let be the collection of functions where Itô’s Lemma is satisfied. Trivially we have that are in , and forms a vector space.

Next we show that forms an algebra. In particular, suppose , and define . Using the product rule gives us

Using the Fundamental Theorem and Itô’s Lemma on , we get

and observe the same is true switching the order of . Next we use Itô’s Lemma and expand with the Kunita-Watanabe identity to get

where the extra terms are zero because the covariation with one finite variation process is zero, i.e. as has finite variation. By grouping the integrals by the integrators (e.g. ), we get that satisfies Itô’s Lemma or simply .

(Step 3) Here we can apply *the Stone-Weierstrass Theorem*
to get that is dense in
with respect to the norm .

(Step 4) It remains to show that is closed with respect to . In particular, let be a sequence in such that in . Then we have

At the same time, we also have by Itô’s Isometry

Since the process is localized we have that , and therefore we can pass the limit in the Itô formula and get

Finally, since Itô’s Lemma hold for all , we can simply take to complete the proof.

**Remark**
Clearly the alternative proof is *not necessarily easier*,
however let us observe a couple of advantages.

Firstly, none of the steps above were very complicated, as most steps followed directly from useful (and well known) propositions. Notably, a first time reader of this subject will have a much easier time following the steps and seeing the bigger picture, rather than getting trapped by technical details.

Secondly, we now have an additional interpretation
of the second integral in the formula,
which clearly arises
as a consequence of Itô’s product rule
and Kunita-Watanabe identity.
For the readers that have not seen the proof,
it follows almost directly from the definition,
i.e. a direct consequence of choosing *the left Riemann sum*.

## Summary

We have shown the Stone-Weierstrass Theorem is not only a strong result on its own, but leads to a powerful technique in general. In particular, we saw a nice alternative proof of Itô’s Lemma with much better interpretations. Ideally, the author would have liked to add another example, but the post is already quite long at this point. Hopefully the readers will still have enjoyed an interesting blog post, and added another proof technique in their arsenal.

Please comment below (new feature!) for any questions or feedback!

## An Interesting Story to Wrap Up

For the longest time, the lemma was credited to Kiyosi Itô alone in his 1950 paper. This was until the 1990s with a resurgence of interests in the late French-German mathematician Wolfgang Doeblin, who was well known to be quite gifted. The interests led to a demand to open the remaining “pli cacheté” (sealed envelope) held by the French Academy of Sciences, which he submitted just before he passed away in 1940 - he burned his notes and took his own life so the German soldiers cannot take advantage of his work. To everyone’s surprise, Doeblin’s letter contained significant research progress ahead of his time, including a statement of the same change of variables formula! To honour his contribution, the result is sometimes referred to as the Itô-Doeblin Lemma.

For the interested readers, I would strongly recommend an excellent commentary by Bernard Bru and Marc Yor (2002) for further details on this topic.

## References

- Bru, B. & Yor, M. (2002). Comments on the life and mathematical legacy of Wolfgang Doeblin.. Finance and Stochastics, 6, 3-47.
- Karatzas, I. & Shreve, S.E. (1991). Brownian Motion and Stochastic Calculus. Springer New York
- Miller, J. (2016). Stochastic Calculus, Lent 2016 Lecture Notes. Retrieved from http://statslab.cam.ac.uk/~jpm205/teaching/lent2016/lecture_notes.pdf