For random variables , define (1) Prove that defines a distance (metric) on the space of random variables. That is, show that satisfies positive definiteness, symmetry, and the triangle inequality. (2) Prove that the random variables converge to in probability if and only if .

Probability Theory Solution – Problem 29: Prove that defines a distance (metric) on the space of random variables

Question

For random variables $X,~Y$ , define $d(X,\ Y)=E{\frac{|X-Y|}{1+|X-Y|}}.$

(1) Prove that $d$ defines a distance (metric) on the space of random variables. That is, show that $d$ satisfies positive definiteness, symmetry, and the triangle inequality.

(2) Prove that the random variables $X_{n}$ converge to $X$ in probability if and only if $\operatorname*{lim}_{n\to\infty}d(X_{n},\ X)=0$ .

Step-by-step solution

Step 1. We first prove that $d(X, Y)$ satisfies the definition of a metric. (i) Non-negativity and positive definiteness: Since $|X-Y| \ge 0$ , the integrand satisfies $\frac{|X-Y|}{1+|X-Y|} \ge 0$ , so $d(X, Y) \ge 0$ after taking expectations. If $d(X, Y) = 0$ , then the non-negative random variable $\frac{|X-Y|}{1+|X-Y|}$ has expectation zero, which implies that it equals zero almost surely. Consequently $|X-Y|=0$ almost surely, i.e., $X=Y$ a.s. (ii) Symmetry: Since $|X-Y| = |Y-X|$ , we have $d(X, Y) = E\left[\frac{|X-Y|}{1+|X-Y|}\right] = E\left[\frac{|Y-X|}{1+|Y-X|}\right] = d(Y, X)$ . (iii) Triangle inequality: Consider the function $f(t) = \frac{t}{1+t} = 1 - \frac{1}{1+t}$ . For $t \ge 0$ , its derivative is $f'(t) = \frac{1}{(1+t)^2} > 0$ , so $f(t)$ is strictly increasing. For any real numbers $x, y, z$ , the absolute value inequality gives $|x-z| \le |x-y| + |y-z|$ . Let $a = |x-y|$ and $b = |y-z|$ , so that $|x-z| \le a+b$ . Using the monotonicity of $f(t)$ together with the inequality $\frac{a+b}{1+a+b} = \frac{a}{1+a+b} + \frac{b}{1+a+b} \le \frac{a}{1+a} + \frac{b}{1+b}$ , we obtain: $\frac{|X-Z|}{1+|X-Z|} \le \frac{|X-Y| + |Y-Z|}{1 + |X-Y| + |Y-Z|} \le \frac{|X-Y|}{1+|X-Y|} + \frac{|Y-Z|}{1+|Y-Z|}$ . Taking expectations on both sides yields $d(X, Z) \le d(X, Y) + d(Y, Z)$ . In summary, $d$ defines a metric on the space of random variables (where equality is understood in the almost sure sense).

Step 2. Proof of sufficiency: If $\lim_{n\to\infty} d(X_n, X) = 0$ , then $X_n \xrightarrow{P} X$ . For any given $\epsilon > 0$ , consider the event $\{|X_n - X| \ge \epsilon\}$ . On this event, by the monotonicity of $f(t)=\frac{t}{1+t}$ , we have $\frac{|X_n - X|}{1+|X_n - X|} \ge \frac{\epsilon}{1+\epsilon}$ . By a generalization of Chebyshev's inequality (or directly from properties of expectation): $d(X_n, X) = E\left[\frac{|X_n - X|}{1+|X_n - X|}\right] \ge E\left[\frac{|X_n - X|}{1+|X_n - X|} I_{\{|X_n - X| \ge \epsilon\}}\right] \ge \frac{\epsilon}{1+\epsilon} P(|X_n - X| \ge \epsilon)$ . Rearranging gives $P(|X_n - X| \ge \epsilon) \le \frac{1+\epsilon}{\epsilon} d(X_n, X)$ . As $n \to \infty$ , since $d(X_n, X) \to 0$ , it follows that $P(|X_n - X| \ge \epsilon) \to 0$ . Hence $X_n$ converges to $X$ in probability.

Step 3. Proof of necessity: If $X_n \xrightarrow{P} X$ , then $\lim_{n\to\infty} d(X_n, X) = 0$ . For any given $\epsilon > 0$ , we split the expectation into two parts: $d(X_n, X) = E\left[\frac{|X_n - X|}{1+|X_n - X|} I_{\{|X_n - X| \ge \epsilon\}}\right] + E\left[\frac{|X_n - X|}{1+|X_n - X|} I_{\{|X_n - X| < \epsilon\}}\right]$ . For the first term, the integrand is always bounded above by 1, so the first term is at most $1 \cdot P(|X_n - X| \ge \epsilon)$ . For the second term, when $|X_n - X| < \epsilon$ , we have $\frac{|X_n - X|}{1+|X_n - X|} < \frac{\epsilon}{1+\epsilon} < \epsilon$ , so the second term is at most $\epsilon$ . Thus $d(X_n, X) \le P(|X_n - X| \ge \epsilon) + \epsilon$ . Since $X_n \xrightarrow{P} X$ , as $n \to \infty$ we have $P(|X_n - X| \ge \epsilon) \to 0$ . Therefore $\limsup_{n\to\infty} d(X_n, X) \le \epsilon$ . Since $\epsilon$ was arbitrary, we conclude $\lim_{n\to\infty} d(X_n, X) = 0$ .

Final answer

QED.

Marking scheme

The following is the scoring rubric based on the official solution (maximum 7 points).

I. Checkpoints (Total max 7)

Part 1: Proving that $d$ is a metric (max 2 pts)

Positive definiteness and symmetry [additive]
State that $d(X, Y) \ge 0$ and $d(X, Y)=0 \iff X=Y$ a.s. (almost sure equality), and briefly justify symmetry $d(X,Y)=d(Y,X)$ .
1 pt
Triangle inequality [additive]
Use the monotonicity or subadditivity of the function $f(t)=\frac{t}{1+t}$ (i.e., $\frac{a+b}{1+a+b} \le \frac{a}{1+a} + \frac{b}{1+b}$ ) to derive $d(X, Z) \le d(X, Y) + d(Y, Z)$ .
*If only the triangle inequality formula is stated without proving the key algebraic inequality, no credit is awarded.*
1 pt

Part 2: Proving the equivalence with convergence in probability (max 5 pts)

Sufficiency: $d(X_n, X) \to 0 \implies X_n \xrightarrow{P} X$ [additive]
Use Chebyshev's inequality (or Markov's inequality) to establish the connection between $P$ and $E$ .
Core argument: obtain $P(|X_n - X| \ge \epsilon) \le \frac{1+\epsilon}{\epsilon} d(X_n, X)$ , or note that on the event $\{|X_n - X| \ge \epsilon\}$ the integrand has the lower bound $\frac{\epsilon}{1+\epsilon}$ .
2 pts
Necessity: $X_n \xrightarrow{P} X \implies d(X_n, X) \to 0$
Score exactly one chain; take the maximum subtotal among chains; do not add points across chains.
`Chain A (Truncation/decomposition method)`
Decompose the expectation: Split $d(X_n, X)$ into integrals over the regions $\{|X_n - X| < \epsilon\}$ and $\{|X_n - X| \ge \epsilon\}$ (or a similar approach). [1 pt]
Bounding and taking limits: Correctly bound both parts (the first part $\le \epsilon$ , the second part $\le 1 \cdot P(\dots)$ ), and let $n \to \infty$ to show the limit is 0. [2 pts]
`Chain B (Convergence theorem method)`
Transfer of convergence in probability: Note that $X_n \xrightarrow{P} X \implies Y_n = \frac{|X_n - X|}{1+|X_n - X|} \xrightarrow{P} 0$ . [1 pt]
Cite a theorem: Invoke the Dominated Convergence Theorem (DCT) (dominated by 1) or the Bounded Convergence Theorem to conclude $E[Y_n] \to 0$ . [2 pts]

Total (max 7)

II. Zero-credit items

Merely copying the definition of the metric $d$ or the definition of convergence in probability from the problem statement without performing any derivation.
In proving the triangle inequality, simply asserting that $|X-Z| \le |X-Y| + |Y-Z|$ implies the corresponding inequality for expectations, without addressing the effect of the denominator $1+|\cdot|$ .
In Part 2, merely stating that "convergence in expectation implies convergence in probability" or vice versa, without providing a proof specific to the nonlinear metric $d$ .

III. Deductions

Omitting almost sure equality (a.s.): In proving positive definiteness, if it is not stated that $d(X,Y)=0 \implies X=Y$ holds only in the "almost sure" sense (or $P(X=Y)=1$ ), deduct 1 point.
Confusing convergence concepts: In the necessity proof, if pointwise convergence or almost sure convergence of $X_n \to X$ is erroneously assumed in order to directly interchange limit and integral, without mentioning subsequences or properties of convergence in probability, deduct 1 point (in Chain B, if DCT is used, the version under convergence in probability must be made explicit or the subsequence principle must be invoked; otherwise this is treated as a logical gap).
Circular reasoning: If the conclusion to be proved is used as a premise in the proof (e.g., directly using the fact that $d$ is a metric to prove the convergence property), that part receives 0 points.

Probability Theory – Problem 29: Prove that defines a distance (metric) on the space of random variables