Master the foundational concepts of mathematical statistics from population theory to statistical inference
Mathematical statistics is the science of reasoning and decision-making under uncertainty, focusing on inferring unknown population characteristics from observed sample data.
Infer population distribution from random sample
Uses probability theory as foundation, develops inference procedures with quantifiable reliability
| Aspect | Probability Theory | Mathematical Statistics |
|---|---|---|
| Direction | Population → Sample (Forward) | Sample → Population (Inverse) |
| Known Information | Distribution F is known | Distribution F is unknown |
| Question Type | What samples will we get? | What is the population? |
A statistical population is the complete collection of all individuals or units under study, characterized mathematically by its cumulative distribution function:
A simple random sample is a collection of random variables:
that are independent and identically distributed (i.i.d.), each with same distribution as population.
1. Independence
2. Identical Distribution
Unbiasedness
Variance
Why n-1? Bessel's correction makes an unbiased estimator:
Glivenko-Cantelli Theorem
Empirical distribution converges uniformly to true distribution almost surely
Therefore, the probability limit is 0.
Let . Then , .
The MGF of near 0 is .
is the MGF of . By the continuity theorem for MGFs, the distribution converges.
Problem: For a population with mean and variance , find the expected value and variance of the sample mean for a sample of size .
Solution:
By the properties of sample mean:
Problem: Given a sample , construct the empirical distribution function .
Solution:
The empirical CDF is a step function:
Problem: For a population with and , approximate using CLT.
Solution:
By CLT, since .
Test your understanding with 10 multiple-choice questions
Probability theory works forwards from known distributions to predict sample behavior, while mathematical statistics works backwards from observed samples to infer unknown population characteristics. Probability asks "what samples will we get?", while statistics asks "what is the population?".
Dividing by (Bessel's correction) makes the sample variance an unbiased estimator of the population variance. This correction accounts for the fact that we're using the sample mean rather than the true population mean, which introduces a slight underestimation that corrects.
I.i.d. stands for "independent and identically distributed". It means each observation comes from the same distribution and doesn't depend on other observations. This assumption is crucial because it allows us to apply powerful statistical theorems like the Law of Large Numbers and Central Limit Theorem.
A statistical population is the complete collection of all individuals or units under study, characterized by a distribution function . It can be finite (e.g., all students in a school) or infinite (e.g., all possible measurements of a physical quantity). The population distribution typically contains unknown parameters we want to estimate.
Sample size depends on several factors: desired precision, population variability, confidence level, and the specific inference goal. Generally, larger samples provide more precise estimates. Rules of thumb include for CLT applications, but power analysis provides more rigorous sample size determination for specific tests.