MathIsimple

Probability Theory – Problem 69: Compute the value of ;

Question

A random variable XX has probability density function p(x)=cxe(lnx)22p(x)=\frac{c}{x}e^{-\frac{(\ln x)^{2}}{2}}, x>0x>0. (1) Compute the value of cc; (2) Determine the distribution of lnX\ln X; (3) Discuss whether (X, lnX)(X,\ \ln X) is a continuous random vector.

Step-by-step solution

1. Write the normalization condition and set up the integral The given density is p(x)=cxe(lnx)22,x>0,p(x)=\dfrac{c}{x}e^{-\dfrac{(\ln x)^2}{2}},\quad x>0, and the necessary condition for it to be a probability density function is 0+p(x)dx=1.\int_{0}^{+\infty}p(x)\,dx=1.

Substituting p(x)p(x): 0+cxe(lnx)22dx=1.\int_{0}^{+\infty}\dfrac{c}{x}e^{-\dfrac{(\ln x)^2}{2}}\,dx=1.

2. Evaluate the integral via substitution

Let t=lnxt=\ln x, so x=etx=e^t and dx=etdtdx=e^t dt. Then dxx=etdtet=dt.\dfrac{dx}{x}=\dfrac{e^t dt}{e^t}=dt.

As xx ranges from 00 to ++\infty, t=lnxt=\ln x ranges from -\infty to ++\infty, so

0+cxe(lnx)22dx=c0+1xe(lnx)22dx=c+et22dt.\int_{0}^{+\infty}\dfrac{c}{x}e^{-\dfrac{(\ln x)^2}{2}}\,dx =c\int_{0}^{+\infty}\dfrac{1}{x}e^{-\dfrac{(\ln x)^2}{2}}\,dx =c\int_{-\infty}^{+\infty}e^{-\dfrac{t^2}{2}}\,dt.

It is well known that +et22dt=2π.\int_{-\infty}^{+\infty}e^{-\dfrac{t^2}{2}}\,dt=\sqrt{2\pi}.

Therefore the normalization condition becomes c2π=1.c\sqrt{2\pi}=1.

3. Solve for the constant cc

c=12π.c=\dfrac{1}{\sqrt{2\pi}}.

Hence c=12πc = \dfrac{1}{\sqrt{2\pi}}.

1. Use the one-dimensional transformation formula to find the density

Let Y=lnXY=\ln X, and write g(x)=lnxg(x)=\ln x, so x=eyx=e^y is the inverse function. Since X>0X>0, the range of YY is the entire real line, i.e., y(,+)y\in(-\infty,+\infty).

The transformation formula for a continuous random variable under a monotone invertible map states: if Y=g(X)Y=g(X) and gg is monotone and invertible, then fY(y)=fX(x)dxdyx=g1(y).f_Y(y)=f_X(x)\left|\dfrac{dx}{dy}\right|\bigg|_{x=g^{-1}(y)}.

Here x=eyx=e^y and dxdy=ey\dfrac{dx}{dy}=e^y, so fY(y)=fX(ey)ey.f_Y(y)=f_X(e^y)\cdot e^y.

2. Substitute the known fXf_X and simplify

From Part (1), fX(x)=12π1xe(lnx)22,x>0.f_X(x)=\dfrac{1}{\sqrt{2\pi}}\dfrac{1}{x}e^{-\dfrac{(\ln x)^2}{2}},\quad x>0.

Substituting x=eyx=e^y:

fY(y)=12π1eye(lney)22×ey=12πey22,yR.f_Y(y)=\dfrac{1}{\sqrt{2\pi}}\dfrac{1}{e^y}e^{-\dfrac{(\ln e^y)^2}{2}}\times e^y =\dfrac{1}{\sqrt{2\pi}}e^{-\dfrac{y^2}{2}},\quad y\in\mathbb{R}.

This is precisely the probability density function of the standard normal distribution.

3. State the conclusion

Therefore Y=lnXY=\ln X follows the standard normal distribution, i.e., lnXN(0,1).\ln X\sim \text{N}(0,1).

In terms of the distribution function, P(lnXy)=Φ(y),yR,P(\ln X\le y)=\Phi(y),\quad y\in\mathbb{R}, where Φ(y)\Phi(y) denotes the standard normal distribution function.

1. Recall the definition of a continuous (bivariate) random vector

A random vector (U,V)(U,V) is called a continuous random vector if there exists a nonnegative integrable function fU,V(u,v)f_{U,V}(u,v) such that for every measurable set BR2B\subset\mathbb{R}^2, P((U,V)B)=BfU,V(u,v)dudv,P\big((U,V)\in B\big)=\iint_B f_{U,V}(u,v)\,du\,dv, and R2fU,V(u,v)dudv=1.\iint_{\mathbb{R}^2} f_{U,V}(u,v)\,du\,dv=1.

Under this definition, for any set BB of planar Lebesgue measure zero, P((U,V)B)=0.P\big((U,V)\in B\big)=0.

2. Analyze the support of (X,lnX)(X,\ln X)

Let Y=lnXY=\ln X. Then P(Y=lnX)=1,P\big(Y=\ln X\big)=1, meaning the random vector (X,Y)(X,Y) almost surely satisfies y=lnxy=\ln x.

Therefore (X,Y)(X,Y) takes values almost entirely within the set A={(x,y)R2: x>0, y=lnx},A=\big\{(x,y)\in\mathbb{R}^2:\ x>0,\ y=\ln x\big\}, i.e., P((X,Y)A)=1.P\big((X,Y)\in A\big)=1.

However, the set AA is a smooth curve in the plane; it is a one-dimensional curve in R2\mathbb{R}^2 and has two-dimensional Lebesgue measure zero.

3. Compare with the definition to reach the conclusion

If (X,Y)(X,Y) were a continuous random vector, then for any set BB of Lebesgue measure zero we would have P((X,Y)B)=0P\big((X,Y)\in B\big)=0. Yet here there exists a measure-zero set AA with P((X,Y)A)=1P\big((X,Y)\in A\big)=1.

This contradicts the property of continuous random vectors. Therefore: (X, lnX)(X,\ \ln X) is not a continuous random vector.

Final answer

1. c=12πc=\dfrac{1}{\sqrt{2\pi}}; 2. lnXN(0,1)\ln X\sim \text{N}(0,1), with density flnX(y)=12πey22f_{\ln X}(y)=\dfrac{1}{\sqrt{2\pi}}e^{-\dfrac{y^2}{2}}; 3. The values of (X, lnX)(X,\ \ln X) are almost surely concentrated on the curve y=lnxy=\ln x, which does not satisfy the definition of a continuous random vector; therefore it is not a continuous random vector.

Marking scheme

The following is the marking rubric based on the official solution (total: 7 points).


1. Checkpoints (Total 7 pts)

Part 1: Computing the value of cc (2 pts)

  • Setting up the integral and substitution [1 pt]
  • Write the normalization condition 0+cxe(lnx)22dx=1\int_{0}^{+\infty} \frac{c}{x}e^{-\frac{(\ln x)^2}{2}} dx = 1, and perform the variable substitution t=lnxt=\ln x (or dx/x=dtdx/x = dt), transforming the integral into the Gaussian integral form +et2/2dt\int_{-\infty}^{+\infty} e^{-t^2/2} dt.
  • *Note: If the substitution process is not explicitly shown but the log-normal distribution definition is directly used to correctly identify the normalization coefficient, this point is also awarded.*
  • Obtaining the result [1 pt]
  • Correctly compute c=12πc = \frac{1}{\sqrt{2\pi}}.

Part 2: Computing the distribution of lnX\ln X (3 pts)

*Note: Score exactly one path below; do not add points across paths.*

  • Path A: Density function transformation method
  • Jacobian / derivative term [1 pt]: Write the inverse function x=eyx=e^y of the transformation y=lnxy=\ln x and its derivative (or Jacobian determinant) dxdy=ey\frac{dx}{dy} = e^y.
  • Substitution and simplification [1 pt]: Correctly substitute x=eyx=e^y and the derivative term into the density transformation formula fY(y)=p(ey)eyf_Y(y) = p(e^y) \cdot e^y, and simplify to obtain cey2/2c e^{-y^2/2} or 12πey2/2\frac{1}{\sqrt{2\pi}} e^{-y^2/2}.
  • Final conclusion [1 pt]: Explicitly state that lnX\ln X follows the standard normal distribution N(0,1)N(0,1), or write the complete standard normal probability density function (with domain yRy\in\mathbb{R}).
  • Path B: Distribution function method
  • Definition and conversion [1 pt]: Write the distribution function definition FY(y)=P(lnXy)=P(Xey)=0eyp(x)dxF_Y(y) = P(\ln X \le y) = P(X \le e^y) = \int_{0}^{e^y} p(x) dx.
  • Integral substitution [1 pt]: Use the substitution t=lnxt=\ln x to transform the limits and integrand into the standard normal distribution function form ycet2/2dt\int_{-\infty}^{y} c e^{-t^2/2} dt.
  • Final conclusion [1 pt]: Identify the integral form as the standard normal distribution; conclusion same as Path A.
  • Shared prerequisite [max 1 pt]: If the student made an error in Part 1 leading to an incorrect value of cc, but the derivation logic in this part is entirely correct, only the result point is deducted; process points are retained (follow-through).

Part 3: Discussing whether (X,lnX)(X, \ln X) is a continuous random vector (2 pts)

  • Identifying the support set / measure [1 pt]
  • State that the probability mass of the random vector (X,lnX)(X, \ln X) is concentrated on the plane curve y=lnxy = \ln x; or state that its support set has two-dimensional Lebesgue measure zero.
  • Determination and conclusion [1 pt]
  • Based on the above reasoning (measure zero yet probability 1, or impossibility of writing a bivariate probability density function f(x,y)f(x,y)), conclude: it is not a continuous random vector.

Total (max 7)


2. Zero-credit items

  • Copying the problem: Merely copying the given conditions or formula names (e.g., "use the density formula") without any concrete substitution or computation.
  • No justification in Part 3: In Part 3, answering only "yes" or "no" without any mathematical argument.
  • Conceptual confusion: In Part 3, arguing "because the marginal distributions of XX and lnX\ln X are both continuous, the joint distribution is also continuous" (this conclusion is false; award 0 pts).
  • Incorrectly assuming independence: In Part 3, attempting to construct a density function via f(x,y)=fX(x)fY(y)f(x,y) = f_X(x) \cdot f_Y(y) (the problem does not give independence, and the variables are in fact perfectly dependent; this approach receives 0 pts).

3. Deductions

*Apply at most one most severe deduction; total score shall not fall below 0.*

  • Computational / arithmetic error (-1 pt):
  • Errors in constant handling during integration (e.g., omitting 2π\sqrt{2\pi}, miscalculating coefficients), leading to an incorrect final value of cc or distribution parameters.
  • Missing domain specification (-1 pt):
  • When writing the probability density function expression as a final result, failing to specify the variable range (e.g., yRy \in \mathbb{R} or <y<+-\infty < y < +\infty); however, this deduction is waived if the text explicitly states "follows the standard normal distribution."
  • Logical / notational confusion (-1 pt):
  • Severely confusing random variable notation (uppercase XX) with value notation (lowercase xx), or writing integration limits with confused logic (e.g., 00 to lnx\ln x), rendering the mathematical expression meaningless.
Ask AI ✨