MathIsimple

Probability Theory – Problem 64: Compute the value of ;

Question

The random variable XX has probability density function p(x)=cxe(lnx)22p(x)=\frac{c}{x}e^{-\frac{(\ln x)^{2}}{2}}, x>0x>0. (1) Compute the value of cc; (2) Determine the distribution of lnX\ln X; (3) Discuss whether (X, lnX)(X,\ \ln X) is a continuous random vector.

Step-by-step solution

1. Write the normalization condition and set up the integral Given the density p(x)=cxe(lnx)22p(x)=\dfrac{c}{x}e^{-\dfrac{(\ln x)^2}{2}}, x>0x>0, the necessary condition for it to be a probability density function is 0+p(x)dx=1.\int_{0}^{+\infty}p(x)\,dx=1.

Substituting p(x)p(x): 0+cxe(lnx)22dx=1.\int_{0}^{+\infty}\dfrac{c}{x}e^{-\dfrac{(\ln x)^2}{2}}\,dx=1.

2. Evaluate the integral via substitution

Let t=lnxt=\ln x, so x=etx=e^t and dx=etdtdx=e^t dt, giving dxx=etdtet=dt.\dfrac{dx}{x}=\dfrac{e^t dt}{e^t}=dt.

As xx ranges from 00 to ++\infty, t=lnxt=\ln x ranges from -\infty to ++\infty, so 0+cxe(lnx)22dx=c+et22dt.\int_{0}^{+\infty}\dfrac{c}{x}e^{-\dfrac{(\ln x)^2}{2}}\,dx =c\int_{-\infty}^{+\infty}e^{-\dfrac{t^2}{2}}\,dt.

Using the known result +et22dt=2π,\int_{-\infty}^{+\infty}e^{-\dfrac{t^2}{2}}\,dt=\sqrt{2\pi},

the normalization condition becomes c2π=1.c\sqrt{2\pi}=1.

3. Solve for the constant cc

c=12π.c=\dfrac{1}{\sqrt{2\pi}}.

1. Use the one-dimensional transformation formula to find the density

Let Y=lnXY=\ln X, and set g(x)=lnxg(x)=\ln x, so x=eyx=e^y is the inverse function. Since X>0X>0, YY takes values on the entire real line, i.e., y(,+)y\in(-\infty,+\infty).

The one-dimensional invertible transformation formula for continuous random variables states: if Y=g(X)Y=g(X) and gg is monotone and invertible, then fY(y)=fX(x)dxdyx=g1(y).f_Y(y)=f_X(x)\left|\dfrac{dx}{dy}\right|\bigg|_{x=g^{-1}(y)}.

Here x=eyx=e^y and dxdy=ey\dfrac{dx}{dy}=e^y, so fY(y)=fX(ey)ey.f_Y(y)=f_X(e^y)\cdot e^y.

2. Substitute the known fXf_X and simplify

From Part 1: fX(x)=12π1xe(lnx)22,x>0.f_X(x)=\dfrac{1}{\sqrt{2\pi}}\dfrac{1}{x}e^{-\dfrac{(\ln x)^2}{2}},\quad x>0.

Substituting x=eyx=e^y: fY(y)=12π1eye(lney)22×ey=12πey22,yR.f_Y(y)=\dfrac{1}{\sqrt{2\pi}}\dfrac{1}{e^y}e^{-\dfrac{(\ln e^y)^2}{2}}\times e^y =\dfrac{1}{\sqrt{2\pi}}e^{-\dfrac{y^2}{2}},\quad y\in\mathbb{R}.

This is precisely the standard normal probability density function.

3. State the conclusion

Therefore Y=lnXY=\ln X follows the standard normal distribution: lnXN(0,1).\ln X\sim \text{N}(0,1).

In terms of the distribution function: P(lnXy)=Φ(y),yR,P(\ln X\le y)=\Phi(y),\quad y\in\mathbb{R}, where Φ(y)\Phi(y) is the standard normal distribution function.

1. Recall the definition of a continuous two-dimensional random vector

A random vector (U,V)(U,V) is called a continuous random vector if there exists a non-negative integrable function fU,V(u,v)f_{U,V}(u,v) such that for any measurable set BR2B\subset\mathbb{R}^2, P((U,V)B)=BfU,V(u,v)dudv,P\big((U,V)\in B\big)=\iint_B f_{U,V}(u,v)\,du\,dv, and R2fU,V(u,v)dudv=1.\iint_{\mathbb{R}^2} f_{U,V}(u,v)\,du\,dv=1.

Under this definition, if fU,Vf_{U,V} exists, then for any set BB with two-dimensional Lebesgue measure 00, P((U,V)B)=0.P\big((U,V)\in B\big)=0.

2. Analyze the support of (X,lnX)(X,\ln X)

Let Y=lnXY=\ln X. Then P(Y=lnX)=1,P\big(Y=\ln X\big)=1, meaning the random vector (X,Y)(X,Y) almost surely satisfies y=lnxy=\ln x.

Therefore (X,Y)(X,Y) takes values almost entirely on the set A={(x,y)R2: x>0, y=lnx},A=\big\{(x,y)\in\mathbb{R}^2:\ x>0,\ y=\ln x\big\}, i.e., P((X,Y)A)=1.P\big((X,Y)\in A\big)=1.

However, the set AA is a smooth curve in the plane; it is a one-dimensional curve in R2\mathbb{R}^2 and has two-dimensional Lebesgue measure 00.

3. Compare with the definition to reach the conclusion

If (X,Y)(X,Y) were a continuous random vector, then for any set BB with Lebesgue measure 00, we should have P((X,Y)B)=0P\big((X,Y)\in B\big)=0. However, here there exists a set AA of measure 00 with P((X,Y)A)=1P\big((X,Y)\in A\big)=1.

This contradicts the property of continuous random vectors. Therefore: (X, lnX)(X,\ \ln X) is not a continuous random vector.

Final answer

1. c=12πc=\dfrac{1}{\sqrt{2\pi}}; 2. lnXN(0,1)\ln X\sim \text{N}(0,1), with density flnX(y)=12πey22f_{\ln X}(y)=\dfrac{1}{\sqrt{2\pi}}e^{-\dfrac{y^2}{2}}; 3. The values of (X, lnX)(X,\ \ln X) are almost surely concentrated on the curve y=lnxy=\ln x, which does not satisfy the definition of a continuous random vector; therefore it is not a continuous random vector.

Marking scheme

The following is the complete marking scheme for this probability theory problem (maximum 7 points).


1. Checkpoints (Total 7 pts)

Part 1: Computing the value of cc (2 points)

*Note: This part tests the use of integral normalization to find the constant.*

  • Setting up the integral and substitution [1 pt]
  • Write the normalization condition 0+cxe(lnx)22dx=1\int_{0}^{+\infty} \frac{c}{x}e^{-\frac{(\ln x)^2}{2}} dx = 1 and perform the substitution t=lnxt=\ln x (or dx/x=dtdx/x = dt), transforming the integral into the Gaussian integral form +et2/2dt\int_{-\infty}^{+\infty} e^{-t^2/2} dt.
  • *Note: If the substitution process is not explicitly shown but the normalization coefficient relationship is correctly identified using the definition of the lognormal distribution, this point may still be awarded.*
  • Solving for the result [1 pt]
  • Correctly compute c=12πc = \frac{1}{\sqrt{2\pi}}.

Part 2: Distribution of lnX\ln X (3 points)

*Note: Score exactly one path below | do not add across paths. This part tests the distribution transformation of a function of a random variable.*

  • Path A: Density function transformation method
  • Jacobian/derivative term [1 pt]: Write the inverse function x=eyx=e^y of the transformation y=lnxy=\ln x and its derivative (or Jacobian determinant) dxdy=ey\frac{dx}{dy} = e^y.
  • Substitution and simplification [1 pt]: Correctly substitute x=eyx=e^y and the derivative term into the density transformation formula fY(y)=p(ey)eyf_Y(y) = p(e^y) \cdot e^y, and simplify to obtain cey2/2c e^{-y^2/2} or 12πey2/2\frac{1}{\sqrt{2\pi}} e^{-y^2/2}.
  • Final conclusion [1 pt]: Explicitly state that lnX\ln X follows the standard normal distribution N(0,1)N(0,1), or write the complete standard normal probability density function (with domain yRy\in\mathbb{R}).
  • Path B: Distribution function method
  • Definition and transformation [1 pt]: Write the distribution function definition FY(y)=P(lnXy)=P(Xey)=0eyp(x)dxF_Y(y) = P(\ln X \le y) = P(X \le e^y) = \int_{0}^{e^y} p(x) dx.
  • Integral substitution [1 pt]: Use the substitution t=lnxt=\ln x to transform the integration limits and integrand into the standard normal distribution function form ycet2/2dt\int_{-\infty}^{y} c e^{-t^2/2} dt.
  • Final conclusion [1 pt]: Identify from the integral form that this is the standard normal distribution; conclusion same as Path A.
  • Shared prerequisite [max 1 pt]: If the student computed cc incorrectly in Part 1 but the derivation logic in this part is entirely correct, only the result point is deducted; process points are retained (follow-through).

Part 3: Whether (X,lnX)(X, \ln X) is a continuous random vector (2 points)

*Note: This part tests understanding of the definition of a two-dimensional continuous random vector.*

  • Identifying the support / measure [1 pt]
  • State that the probability mass of the random vector (X,lnX)(X, \ln X) is concentrated on the plane curve y=lnxy = \ln x; or state that its support has two-dimensional Lebesgue measure 0.
  • Determination and conclusion [1 pt]
  • Based on the above reasoning (measure 0 yet probability 1, or impossibility of writing a two-dimensional probability density f(x,y)f(x,y)), conclude: it is not a continuous random vector.

Total (max 7)


2. Zero-credit items

  • Copying the problem: Merely copying the given conditions or formula names (e.g., "use the density formula") without any concrete substitution or computation.
  • Unjustified guess in Part 3: In Part 3, merely answering "yes" or "no" without any mathematical argument.
  • Conceptual confusion: In Part 3, arguing "since the marginal distributions of XX and lnX\ln X are both continuous, the joint distribution is also continuous" (this conclusion is false; receives 0 points).
  • Incorrect independence assumption: In Part 3, attempting to construct a density via f(x,y)=fX(x)fY(y)f(x,y) = f_X(x) \cdot f_Y(y) (the problem does not give independence, and in fact the variables are perfectly dependent; this approach receives 0 points).

3. Deductions

*Apply at most one of the following (the most severe); total score cannot go below 0.*

  • Computation/arithmetic error (-1 pt):
  • Errors in constant handling during integration (e.g., missing 2π\sqrt{2\pi}, incorrect coefficients), leading to wrong values of cc or distribution parameters.
  • Missing domain (-1 pt):
  • When writing the probability density function as the final result, failing to specify the variable range (e.g., yRy \in \mathbb{R} or <y<+-\infty < y < +\infty). However, this deduction is waived if the text explicitly states "follows the standard normal distribution."
  • Logical/notational confusion (-1 pt):
  • Severely confusing the random variable notation (uppercase XX) with the value notation (lowercase xx), or writing integration limits with confused logic (e.g., 00 to lnx\ln x), rendering the mathematical expression meaningless.
Ask AI ✨