Normal numbers

We are going to apply the pointwise ergodic theorem to deduce a famous result of Borel that says almost every number is normal in base 10. We will do this by studying the map T(x)=10xmod1 on [0,1) equipped with Lebesgue measure λ on the Borel subsets of [0,1).

First we check that Lebesgue measure is T invariant. It suffices to check that λ(T1([a,b)))=λ([a,b))=ba for all 0a<b1 because the collection of sub-intervals of [0,1) forms, together with the empty set, a π-system. But T1([a,b))=i=09[a+i10,b+i10) is a disjoint union of intervals whose lengths do add up to ba. Thus λ is an invariant measure for T.

Theorem

The map T(x)=10xmod1 is ergodic with respect to Lebesgue measure.

Proof:

Fix B[0,1) that is T invariant. We wish to prove that μ(B) is either zero or one. Suppose it is positive and define a measure ν by ν(E)=λ(BE)λ(B) on the Borel subsets of [0,1). We will check that ν(I)=λ(I) for every interval of the form I=[a10r,a+110r) with rN and 0a<10r. Intervals of the above kind are called decimal intervals. Since such intervals generate the Borel σ-algebra, this will then imply ν=λ and λ(B)=1 by arguments we have seen before.

Fix a decimal interval I. To check the above we note that λ(BI)=λ(B(Tn)1I) for all nN because λ is T invariant and T1(B)=B. Next, fixing ϵ>0 we choose decimal intervals J1,J2, with BJ1J2 and λ(B)+ϵλ(J1)+λ(J2)+λ(J1J2)λ(B) using the definition of Lebesgue measure. We can then estimate |λ(BI)λ(t=1Jt(Tn)1I)|<ϵ for all nN. One calculates for every KN that limnλ(t=1KJt(Tn)1I)=λ(t=1KJt)λ(I) and thus deduces |λ(BI)t=1λ(Jt)λ(B)|<ϵ whence λ(BI) and λ(B)λ(I) are within 2ϵ of each other. Since ϵ>0 was arbitrary we are done.

Now that we know T is ergodic we can apply the pointwise ergodic theorem. Let us first apply the theorem with f=1[0,0.1] to get limN1Nn=0N11[0,0.1](Tn(x))=1[0,0.1]dλ=110 for almost every x[0,1). This means that almost every point x[0,1) has the property that the sequence nTn(x) spends about 10% of its time in the interval [0,0.1]. Formally, the pointwise ergodic theorem gives us a set Ω0[0,1) with λ(Ω0)=1 such that every xΩ0 has the desired property.

We can reinterpret this by expressing x=j=1dj(x)10j for natural numbers d1(x),d2(x), between zero and nine i.e., by writing the decimal expansion of x. In those terms we have x[0,0.1) if and only if d1(x)=0. Moreover T(x)=10(d1(x)10+d2(x)102+d3(x)103+)mod1=d1(x)+d2(x)10+d3(x)102+mod1=d2(x)10+d3(x)102+ and T(x) belongs to [0,0.1] if and only if the second digit in the decimal expansion of x is a zero. Repeatedly applying T as above, we see that n=0N11[0,0.1](Tnx) counts how many of the first N digits in the decimal expansion of x are equal to 0 and the pointwise ergodic theorem is telling us that almost every point x[0,1] has, asymptotically, the property that about 10% of its decimal digits are equal to 0.

We can, for example, repeat the above argument with f=1[0.47,0.48]. The sum n=0N11[0.47,0.48](Tnx) counts the occurrences of 47 amongst the decimal digits of x and the pointwise ergodic theorem provides us with a set Ω47 of full measure such that every point in x has 47 appear in its decimal expansion 1% of the time.

In fact, for every finite string σ of digits between zero and nine we get a set Ωσ with μ(Ωσ)=1 and the property that every xΩσ has the expected frequency - 1/10 where is the number of digits in σ - of appearences of σ in its decimal expansion. A number with this property is sometimes called normal in base 10 and the above argument proves the following theorem.

Theorem (Borel's normal number theorem)

There is a set N[0,1) with λ(N)=1 and the property that every xN is normal in base 10.

Proof:

Let Σ be the set of all finite strings of digits between zero and nine. The intersection σΣΩσ is a countable intersection of sets with full measure and therefore has full measure itself. Every number in the intersection is normal.

Despite the theorem guaranteeing an abundance of normal numbers - one would, for example, have to be unlucky not to come across a normal number by randomly choosing digits to define a number's decimal expansion - the theorem does not provide us with a specific example of a normal number! The somewhat artificial number 0.1234567891011121314151617 known as Champernowne's constant is normal by direct verification, and the artificial but interesting number 0.23579111317192329313741 was proved normal by Copeland and Erdős. It is an infamous problem to determine whether 2 or π or e are normal.

One can show that the map T(x)=1xmod1 is ergodic with respect to the measure ν(E)=1log21Eϕdλ where ϕ(x)=(1+x)1. Applying the pointwise ergodic theorem to functions such as [1k+1,1k] one can show that almost every number x[0,1) has a normal continued fraction expansion, if one appropriately weights each digit according to the measure ν.