MathIsimple

Probability Theory – Problem 55: Prove that there exist constants and such that converges in distribution to the standard normal distribution

Question

Let ξk,kZ\xi_{k},k\in Z be independent and identically distributed with P(ξ1=1)=P(ξ1=0)=1/2P(\xi_{1}=1)=P(\xi_{1}=0)=1/2. Define ηn={1,if ξn=1,ξn+1=0;0,otherwise.\eta_{n}=\left\{\begin{array}{l l}{{1,}}&{{\mathrm{if}\ \xi_{n}=1,\xi_{n+1}=0;}}\\ {{0,}}&{{\mathrm{otherwise.}}}\end{array}\right. Let Sn=η1++ηn\displaystyle S_{n}=\eta_{1}+\cdots+\eta_{n}. Prove that there exist constants μ\mu and σ\sigma such that (Snnμ)/(σn)(S_{n}-n\mu)/(\sigma{\sqrt{n}}) converges in distribution to the standard normal distribution.

Step-by-step solution

Step 1. Proof that {ηk}\{\eta_k\} is a stationary sequence. By hypothesis, ξk (kZ)\xi_k\ (k\in\mathbb{Z}) are i.i.d., and ηk=I(ξk=1,ξk+1=0)\eta_k = I(\xi_k=1, \xi_{k+1}=0) (where I()I(\cdot) denotes the indicator function, taking value 1 when the condition holds and 0 otherwise). For any integer m1m\geq1 and k1<k2<<kmk_1<k_2<\cdots<k_m, the joint distribution of (ηk1,ηk2,,ηkm)(\eta_{k_1}, \eta_{k_2}, \dots, \eta_{k_m}) depends only on the joint distribution of ξ\xi: ηki=I(ξki=1,ξki+1=0)\eta_{k_i} = I(\xi_{k_i}=1, \xi_{k_i+1}=0), and its joint probability is determined by the i.i.d. property of ξk1,ξk1+1,,ξkm,ξkm+1\xi_{k_1},\xi_{k_1+1},\dots,\xi_{k_m},\xi_{k_m+1}. After a time shift, (ηk1+t,ηk2+t,,ηkm+t)(\eta_{k_1+t}, \eta_{k_2+t}, \dots, \eta_{k_m+t}) corresponds to ξk1+t,,ξkm+t+1\xi_{k_1+t},\dots,\xi_{k_m+t+1}; since ξ\xi is stationary, the two joint distributions are identical. Therefore, {ηk}\{\eta_k\} is a stationary sequence. Step 2. Compute the constant μ=E[ηk]\mu = \mathbb{E}[\eta_k]. Since ηk\eta_k is a 0-1 random variable, E[ηk]=P(ηk=1)=P(ξk=1,ξk+1=0)\mathbb{E}[\eta_k] = P(\eta_k=1) = P(\xi_k=1, \xi_{k+1}=0). By the i.i.d. property of ξk\xi_k with P(ξ1=1)=P(ξ1=0)=1/2P(\xi_1=1)=P(\xi_1=0)=1/2: P(ξk=1,ξk+1=0)=P(ξk=1)P(ξk+1=0)=1212=14P(\xi_k=1, \xi_{k+1}=0) = P(\xi_k=1) \cdot P(\xi_{k+1}=0) = \frac{1}{2} \cdot \frac{1}{2} = \frac{1}{4} Hence μ=14\mu = \frac{1}{4}. Step 3. Compute the autocovariance function γ(j)=Cov(η1,η1+j)\gamma(j) = \text{Cov}(\eta_1, \eta_{1+j}) and verify absolute summability. The covariance is γ(j)=E[η1η1+j]E[η1]E[η1+j]\gamma(j) = \mathbb{E}[\eta_1\eta_{1+j}] - \mathbb{E}[\eta_1]\mathbb{E}[\eta_{1+j}]. Cases for j0j\geq0: - When j=0j=0: γ(0)=Var(η1)\gamma(0) = \text{Var}(\eta_1). Since η12=η1\eta_1^2 = \eta_1, E[η12]=1/4\mathbb{E}[\eta_1^2] = 1/4, so γ(0)=14(14)2=316\gamma(0) = \frac{1}{4} - \left(\frac{1}{4}\right)^2 = \frac{3}{16} - When j=1j=1: η1=I(ξ1=1,ξ2=0)\eta_1 = I(\xi_1=1, \xi_2=0), η2=I(ξ2=1,ξ3=0)\eta_2 = I(\xi_2=1, \xi_3=0). If η1=1\eta_1=1 then ξ2=0\xi_2=0, forcing η2=0\eta_2=0, so η1η2=0\eta_1\eta_2=0. Thus E[η1η2]=0\mathbb{E}[\eta_1\eta_2] = 0 and γ(1)=116\gamma(1) = -\frac{1}{16}. - When j2j\geq2: The index sets {ξ1,ξ2}\{\xi_1,\xi_2\} and {ξ1+j,ξ2+j}\{\xi_{1+j},\xi_{2+j}\} do not overlap; by independence, η1\eta_1 and η1+j\eta_{1+j} are independent, so γ(j)=0\gamma(j) = 0. Absolute sum: j=γ(j)=316+2116=516<\sum_{j=-\infty}^\infty |\gamma(j)| = \frac{3}{16} + 2 \cdot \frac{1}{16} = \frac{5}{16} < \infty Step 4. Proof that {ηk}\{\eta_k\} is ergodic. Since ξk\xi_k is i.i.d., it is ergodic. Since ηk=I(ξk=1,ξk+1=0)\eta_k = I(\xi_k=1, \xi_{k+1}=0) is a measurable function of (ξk,ξk+1)(\xi_k, \xi_{k+1}), and a measurable transformation of a stationary ergodic sequence remains ergodic, {ηk}\{\eta_k\} is ergodic. Step 5. Application of the CLT for stationary ergodic sequences. For the stationary ergodic sequence {ηk}\{\eta_k\}, if (1) E[ηk]=μ\mathbb{E}[\eta_k] = \mu is finite and (2) the autocovariance γ(j)\gamma(j) is absolutely summable, then Snnμnσ2dN(0,1)\frac{S_n - n\mu}{\sqrt{n \cdot \sigma^2}} \xrightarrow{d} N(0,1) where σ2=γ(0)+2j=1γ(j)\sigma^2 = \gamma(0) + 2\sum_{j=1}^\infty \gamma(j). Computing: σ2=316+2(116)=116\sigma^2 = \frac{3}{16} + 2\left(-\frac{1}{16}\right) = \frac{1}{16} Hence σ=14\sigma = \frac{1}{4}. Step 6. Conclusion. There exist constants μ=14\mu = \frac{1}{4} and σ=14\sigma = \frac{1}{4} such that SnnμσndN(0,1).\frac{S_n - n\mu}{\sigma \sqrt{n}} \xrightarrow{d} N(0,1).

Final answer

QED.

Marking scheme

The following is the grading rubric for this statistics/probability problem.


1. Checkpoints (max 7 pts total)

Shared prerequisites (Calculation of Moments) [max 5 pts]

*Award these points regardless of the theoretical approach used (Ergodicity vs. mm-dependence).*

  • Mean μ\mu computation (1 pt)
  • Correctly computing E[ηk]=P(ξk=1)P(ξk+1=0)=1/4\mathbb{E}[\eta_k] = P(\xi_k=1)P(\xi_{k+1}=0) = 1/4.
  • Autocovariance structure analysis (3 pts)
  • Variance term (γ(0)\gamma(0)): Correctly computing Var(ηk)=3/16\text{Var}(\eta_k) = 3/16 [additive].
  • Adjacent covariance (γ(1)\gamma(1)): Correctly computing Cov(ηk,ηk+1)=1/16\text{Cov}(\eta_k, \eta_{k+1}) = -1/16.
  • *Note: Must demonstrate the key logic that ηkηk+1=0\eta_k \eta_{k+1} = 0 (mutually exclusive events); if only the result is stated without justification, deduct 0.5.* [additive]
  • Uncorrelatedness (γ(j),j2\gamma(j), j \ge 2): Stating that when the gap j2j \ge 2, ηk\eta_k and ηk+j\eta_{k+j} are independent (or uncorrelated), i.e., the covariance is 0 [additive].
  • Asymptotic variance σ\sigma computation (1 pt)
  • Correctly applying σ2=γ(0)+2j=1γ(j)\sigma^2 = \gamma(0) + 2\sum_{j=1}^\infty \gamma(j) or computing via limn1nVar(Sn)\lim_{n\to\infty} \frac{1}{n}\text{Var}(S_n) to obtain σ2=1/16\sigma^2 = 1/16 (σ=1/4\sigma=1/4).
  • *If preceding covariance computations have errors but the formula and substitution logic are correct, this point may still be awarded (follow-through).*

Score exactly one chain below for the Theoretical Proof:

Chain A: Stationary Ergodic Sequence Path [max 2 pts]

  • Sequence property identification: Explicitly stating that {ηn}\{\eta_n\} is stationary and ergodic. *Derived from the i.i.d. property of ξn\xi_n and the fact that ηn\eta_n is a measurable function thereof.* [1 pt]
  • Theorem citation: Citing the CLT for stationary ergodic sequences and indicating that the conditions are satisfied (e.g., finite second moment, absolute summability of autocovariance). [1 pt]

Chain B: mm-Dependent Sequence Path [max 2 pts]

  • Sequence property identification: Explicitly stating that {ηn}\{\eta_n\} is a 11-dependent sequence (mm-dependent with m=1m=1). *Variables separated by more than 1 index are independent.* [1 pt]
  • Theorem citation: Citing the CLT for mm-dependent sequences. [1 pt]

Total (max 7)


2. Zero-credit items

  • Incorrect premise: Assuming {ηn}\{\eta_n\} is i.i.d. and directly applying the i.i.d. CLT (yielding σ2=Var(η1)=3/16\sigma^2 = \text{Var}(\eta_1) = 3/16).
  • Merely copying the problem: Only restating definitions or given conditions with no derivation.
  • Irrelevant theorem: Citing the SLLN to prove convergence to a constant without discussing convergence in distribution.

3. Deductions

  • Arithmetic error: Errors in simple arithmetic steps leading to incorrect μ\mu or σ\sigma. [-1 flat]
  • Symbol confusion: Confusing σ\sigma (standard deviation) with σ2\sigma^2 (variance), e.g., computing σ2=1/16\sigma^2=1/16 but stating σ=1/16\sigma=1/16. [-0.5 flat]
  • Logical gap: When computing γ(1)\gamma(1), failing to explain why E[η1η2]=0\mathbb{E}[\eta_1 \eta_2]=0 and directly giving an incorrect nonzero value. [-1 flat]
Ask AI ✨