Lecture 2 - fouriernew

4. Heat equation on the circle¶

Derivation of heat equation¶

In this note we identify the circle $S^1$ with $\mathbb{T}:=\mathbb{R}/\mathbb{Z}$ via $e^{2\pi i x}$ $\leftrightarrow$ $x\in [0,1]$ . When we say the point $x$ on the circle, we mean the point $e^{2\pi i x}\in S^1$ . In our discussion “time” is a real variable $t\in [0,+\infty]$ .

The distribution of temperature on the circle is modeled by a density function $f(x)$ for $x\in \mathbb{T}$ . The temperature on interval $[a,b]$ is given by $\int_{a}^b f(x)dx$ .

Let $u(x,t)$ be the temperature distribution of circle at time $t$ , note that $u(0,t)=u(1,t)$ , so the temperature of an interval $(a,b)$ is given by $\int_{a}^b u(x,t)dx$ . Let $u(x,0)=f(x)$ be the initial temperature. We study the heat diffusion when time increases under the following postulates:

Increasing of temperature of an interval $(a,b)$ during time $\Delta t$ is proportional to increasing of heat on $(a,b)$ . In other words, $[ T(a,b,t+\epsilon)-T(a,b,t)] = c\cdot [H(a,b,t+\epsilon)-H(a,b,t)]$ for some constant $c>0$ , or $\frac{\partial}{\partial t}H(a,b,t) = c\frac{\partial}{\partial t}T(a,b,t)$ if we look at infinitesimal change of time $t$ .
Fix time $t$ , consider heat transfer during an infinitesimal time interval. Let $\omega_t(x)$ be the flux of heat at $x$ , i.e. the amount of heat which goes from leftside to right side at $x$ , so that $\omega_t(x)$ is positive if heat transfers from left to right and negative if heat transfers from right to left. We get
$\frac{\partial}{\partial t}H(a,b,t) = \omega_t(a)-\omega_t(b).$
(1)
Fourier’s law: the heat flux is negatively proportional to the derivative of temperature, i.e. at time $t$ , $\omega(x) = \kappa \lim_{\delta\to 0}\frac{u(x+\delta,t)-u(x,t)}{\delta} = -\kappa \frac{\partial}{\partial x}u(x,t)$ for a constant $\kappa > 0$ . Note that negative because heat transfers from higher temperature to lower temperature.
- For example, if the temperature is higher on the right, so u(x,t) is increasing in x variable, so the partial derivative \frac{\partial}{\partial x}u(x,t) is positive, but in this case the heat will go from right(higher) to the left(lower), so \omega_t(x) < 0 because \omega_t(x) means the amount of heat going from left to right.
Consequently, $\frac{\partial}{\partial x}\omega_t(x) = -\kappa \frac{\partial^2}{\partial x^2}u(x,t)$ .

From the first two postulates, we get $\frac{d}{dt}\int_{a}^bu(x,t)dx = \int_{a}^b \frac{\partial}{\partial t}u(x,t)dx = c(\omega_t(a)-\omega_t(b)) = -c\int_{a}^b\omega_t'(x)dx$ . By the third postulate, $\omega_t'(x)=-\kappa \frac{\partial^2}{\partial x^2}u(x,t)$ , so we get $\int_{a}^b\frac{\partial}{\partial t}u(x,t)dx = \int_{a}^b \frac{\partial^2}{\partial x^2}c\kappa u(x,t)dx$ . Equality holds for any interval so we get the equation

\frac{\partial}{\partial t}u(x,t) = c\kappa \frac{\partial}{\partial x^2}u(x,t).

(2)

We consider the standard form $u_t = \frac{1}{2}u_{xx}$ .

Solving the heat equation using Fourier series¶

Since $u(x,t)$ is periodic on the $x$ variable, Fourier’s idea is to look for solutions of the form $u(x,t) = \sum_{n=-\infty}^{+\infty}c_n(t)e^{2\pi i n x}$ . It follows that $c_n(t) = \int_{0}^1 u(x,t)e^{-2\pi i n x}dx$ . Take $\frac{d}{dt}$ on $c_n(t)$ we get

c_n'(t) = \int_{0}^1 \frac{\partial}{\partial t}u(x,t)e^{-2\pi i n x}dx = \frac{1}{2}\int_{0}^1 u_{xx}(x,t)e^{-2\pi i n x}dx.

(3)

(It makes sense to assume $u$ to be smoothly depending on $t$ so that we can change derivative and integral here, by applying mean value theorem).

By the derivative theorem, $\widehat{g'}(n) = 2\pi i n \hat{g}(n)$ , so apply to $u(x,t)$ in the $x$ variable we get $c_n'(t) = -2\pi^2n^2c_n(t)$ . Solving the ODE we get $c_n(t) = c_n(0)e^{-2\pi^2n^2 t}$ . $c_n(0)$ is the Fourier coefficient of $u(x,0) = f(x)$ , so we get $u(x,t) = \sum_{n = -\infty}^{+\infty}\hat{f}(n)e^{-2\pi^2n^2 t}e^{2\pi i n x}$ .

Write back $\hat{f}(n) = \int_{0}^1 f(y)e^{-2\pi i n y}dy$ , then $u(x,t) = \sum_{n = -\infty}^{+\infty} \int_{0}^1 f(y)e^{-2\pi i n y}e^{2\pi i n x}e^{-2\pi^2n^2 t}dy = \int_{0}^1 f(y)e^{-2\pi^2n^2 t}e^{2\pi i n (x-y)}dy$ , provided we can change the integral and summation. Let $g(x,t) = \sum_{n=-\infty}^{+\infty}e^{-2\pi^2n^2 t}e^{2\pi in x}$ , then the series converges absolutely hence uniformly in $x$ variable for every $t$ , ( $\sum e^{-2\pi n^2 t}<\infty$ as a geometric series), then $g(x,t)$ is a continuous function on the circle, this also justifies the change of integral and summation.

We obtained a simpler form of solution: $u(x,t) = \int_{0}^1g(x-y,t)f(y)dy$ . This type of integrals is called convolution, as we shall study next.

5. Convolution¶

Definition 5.1. Let $f,g$ be integrable periodic functions with period 1. The convolution of $f,g$ is a function determined pointwise by

(f*g)(x) = \int_{0}^1f(x-y)g(y)dy

(4)

Remarks.

By changing variable $y' = x-y$ , we get $\int_{0}^1 f(x-y)g(y)dy = \int_{x}^{x-1}f(y')g(x-y')d(-y') = -\int_{0}^{-1}f(y')g(x-y')dy' = \int_{0}^1g(x-y)f(y)dy$ , so we can also define convolution by $(f*g)(x) = \int_{0}^1 g(x-y)f(y)dy$ .
Our definition of convolution here is periodic convolution. Since convolution means taking average, we need to normalize in period $T$ so that $f*g = \frac{1}{T}\int_{0}^Tf(x-y)g(y)dy$ . The standard notion of convolution is on the real line and plays an important rule in Fourier transforms.
The $N$ -th Fourier partial sum $S_N(f)$ can be written as $f*D_N$ . In fact, $S_N(f)(x) = \sum_{n=-N}^N \hat{f}(n)e^{2\pi inx} = \sum_{n=-N}^N \int_{0}^1f(y)e^{-2\pi i n y} dy \cdot e^{2\pi i n x} =\int_{0}^1f(y) \sum_{n = -N}^N e^{2\pi i n (x-y)} dy = f*D_N(x)$ , here $D_N = \sum_{n=-N}^N e^{2\pi i n x}$ is the Dirichlet kernel on $\mathbb{T}$ .

Properties of convolution¶

Let $f,g, h$ be integrable periodic functions with period 1.

Linearity¶

The linearity of convolutions follows directly from linearity of integrals.

$(f+g)*h = f*h+g*h$ .
$(cf)*g = cf*g$ .

Commutativity & Associativity¶

$f*g = g*f$ . This is just the first remark of Definition 5.1.
$(f*g)*h = f*(g*h)$ . This follows from the Fubini theorem.

Note

The practical application of Fubini theorem usually consists of the following two cases: Let $f(x,y)$ be a function on $[a,b]\times [c,d]$

If $f\geq 0$ then one can always change order, i.e. $\int_{[a,b]\times [c,d]}f(x,y)dxdy = \int_{x\in [a,b]}\int_{y\in [c,d]}f(x,y)dy dx = \int_{y\in[c,d]}\int_{x\in[a,b]}f(x,y)dxdy$ . This is also called Tonelli theorem.
If $f$ takes both positive and negative value, then the condition to change order is the $L^1$ condition $\int_{[a,b]\times [c,d]}|f(x,y)|dxdy < \infty.$ By the Tonelli theorem, this is the case when either $\int_{x\in [a,b]}\int_{y\in [c,d]}|f(x,y)|dy dx$ or $\int_{y\in[c,d]}\int_{x\in[a,b]}|f(x,y)|dxdy$ (they are equal) is $<\infty$ .

From the first properties one may feel that convolution looks quite similar to multiplication. This intuition may be justified in some sense by the fact that convolution and multiplication are Fourier transform of each other.

Convolution theorem¶

Let $f,g$ be integrable period 1, then $\widehat{f*g}(n) = \hat{f}(n)\hat{g}(n)$ .

Proof. By definition of convolution and Fubini theorem, $\widehat{f*g}(n) = \int_{0}^1 (f*g)(x)e^{-2\pi i n x}dx = \int_x\int_y f(x-y)g(y)dy e^{-2\pi i nx}dx = \int_{y}\int_xf(x-y)e^{-2\pi i n(x-y)}dx \; e^{-2\pi i n y }g(y)dy = \hat{f}(n)\hat{g}(n)$ .

The integrand satisfies |f(x-y)g(y)e^{-2\pi i n x}| = |f(x-y)g(y)| and \int_y\int_x |f(x-y)||g(y)|dxdy = \int_y |g(y)|\int_x |f(x)|dx dy = \|f\|_{L^1}\|g\|_{L^1} < \infty because f,g are both L^1 on [0,1] .

Continuty of convolution¶

Another good property of convolution is its regulartiy. The convolution is defined by integration, which are good because integrals usually have more regularity than integrands. In other words, $f*g$ is “nicer” than the nicer one of $f$ and $g$ , that’s why convolution is widely used to do approximations. As an example we present the approximation of integrable functions in detail to let you have some feeling about this technique.

Proposition 5.1. Let $f,g$ be periodic functions with period 1. Then

If $f$ is continuous, $g$ is integrable, then $f*g$ is continuous.
Moreover, if $f$ is integrable, $g$ is bounded, then $f*g$ is also continuous.

Proof. Let $h$ be a real number. $|f*g(x+h) - f*g(x)| = |\int_{0}^1 (f(x+h-y)-f(x-y))g(y)dy|\leq \int_{0}^1|f(x+h-y)-f(x-y)||g(y)|dy$ . For $\epsilon > 0$ , since $f$ is continuous, when $|h|$ is small enough, $|f(x'+h)-f(x')|<\epsilon$ for every $x'\in \mathbb{R}$ . It follows that $|f(x+h-y)-f(x-y)|<\epsilon$ for every $y\in (0,1)$ . Then the right hand side $\leq \epsilon \|g\|_{L^1}$ , where $\|g\|_{L^1}:=\int_{0}^1|g(y)|dy<\infty$ because $g$ is integrable. This shows that $f*g$ is continuous. Actually $f*g$ is uniformly continuous in this case.

To prove the second stronger assertion we need an approximation lemma which will be proved in problem set 1.

Lemma 5.2. Let $f$ be an integrable function on $[0,1]$ , then for $\epsilon > 0$ , there exists a continuous function $\tilde f$ such that $\|f - \tilde f\|_{L^1}< \epsilon$ .

By the approximation lemma, choose a sequence of continuous functions $f_k$ such that $\|f- f_k\|_{L^1} \to 0$ when $k\to \infty$ . Then $|f*g(x) - f_k*g(x)| = |(f-f_k)*g(x)|=\int_{0}^1|(f - f_k)(y)||g(x-y)|dy\leq B\|f-f_k\|_{L^1}$ , where $0<B<\infty$ is an upper bound of $|g|$ . Since the right hand side goes to 0 when $k\to \infty$ and does not depend on $x$ , we conclude that $f_k*g\to f*g$ uniformly. Since $f_k*g$ is continuous because $f_k$ is continuous, this implies that $f*g$ is a uniform limit of continuous functions, which is continuous.

Note

The above argument requires the function to be bounded, this is not an issue for Riemann integrals because Riemann integrable on a finite interval require bounded. In the case of Lebesgue integrals, one can proceed to the unbounded case by approximating the function by bounded functions. WLOG assume $f\geq 0$ , otherwise consider $|f|$ . Let $A_n:=\{f(x)\leq n\}$ and $f_n(x) = \chi_{A_n}(x)f(x)$ . Then $f_n(x)\to f(x)$ increasingly. The monotone convergence theorem implies $\int|f_n - f|dx \to 0$ when $n\to \infty$ . So there exists some $N$ s.t. $\|f_N - f\|\leq \epsilon$ and $|f_N|\leq N$ . This shows that we can approximate an $L^1$ function in the $L^1$ norm using bounded functions.

The approximation theorem¶

Here are the idea to prove Lemma 5.2 for the Riemann intregral case. First we give some simple notions to shorten the discussion.

Definition 5.3. Let $A$ be a set. Let $\chi_A$ given by

\chi_A(x) = \begin{cases}1,x\in A \\ 0,x\notin A\end{cases}

(5)

Definition 5.4. A rect function is a function of the form $\chi_{(a,b)}$ , where $(a,b)$ is an interval. rect stands for rectangle, the graph of a rect function looks like a rectangle hence the name.

Definition 5.5. A step function is given by $\sum_{k=1}^m c_k\chi_{(a_k,b_k)}$ , where $c_k$ are real numbers, $(a_k,b_k)$ are disjoint intervals. Still, the name explains its shape. A step function is a finite linear combination of rect functions of disjoint intervals.

Now let $f$ be a Riemann integrable function.

Step 0. By definition, there exists a step function $h$ such that $\|f - h\|_{L^1}<\epsilon$ . In fact, this is how the Riemann integral is defined. The integral of a function is defined by taking limit of integral of step functions for suitable functions for which the limit makes sense.

Step 1. Approximate step function by continuous functions. That is, find a continuous function $g$ such that $\|g-h\|_{L^1}< \epsilon.$ Since a step function is a linear combination of rect functions, it sufficies to do so for rect functions. I’ll walk you through the proof in problem set 1. The idea is to use convolution.

Step 2. Now we have $\|f - g\|_{L^1} \leq \|f-h\|_{L^1}+\|g-h\|_{L^1}\leq 2\epsilon$ by triangle inequality. We are done.

Note that the approximation lemma holds for the more general Lebesgue integrals, the proof is actually similar. The integrals are designed to be approximated by simple functions.