Master the fundamental principles of statistical hypothesis testing: from basic concepts and error analysis to advanced methods and real-world applications in statistical inference.
The baseline hypothesis under test, typically containing '=', '≥', or '≤', representing the status quo or no effect condition.
The hypothesis that contradicts H₀, typically containing '≠', '>', or '<', representing what we're trying to detect.
The probability of rejecting H₀ when it is actually true (false positive). Controlled by significance level.
The probability of failing to reject H₀ when H₁ is true (false negative). Related to statistical power.
H₀ and H₁ must be mutually exclusive and collectively exhaustive
H₀ typically represents the current belief, no change, or no effect
H₁ represents what requires evidence to establish (burden of proof)
Choose one-sided or two-sided based on research question
Structure: H₀: θ = θ₀ vs H₁: θ ≠ θ₀
Rejection Region: T < c₁ or T > c₂
Example: Testing if population mean differs from specified value
Applications:
Structure: H₀: θ ≤ θ₀ vs H₁: θ > θ₀
Rejection Region: T > c
Example: Testing if new process increases efficiency
Applications:
Structure: H₀: θ ≥ θ₀ vs H₁: θ < θ₀
Rejection Region: T < c
Example: Testing if new method reduces error rate
Applications:
Control the maximum Type I error probability at level α, and among all such tests, choose the one with minimum Type II error (maximum power).
Very strong evidence required
Standard in most fields
Exploratory analysis
Among all tests with same significance level, choose the one with highest power (Uniformly Most Powerful when exists)
Clearly state H₀ and H₁ based on research question
H₀: μ = 50 vs H₁: μ ≠ 50
Select appropriate statistic based on data type and assumptions
T = (X̄ - μ₀)/(S/√n) for normal population with unknown variance
Based on H₁ direction and significance level α
For α = 0.05, two-sided: |T| > t₀.₀₂₅(n-1)
Compute statistic value using sample data
t = (15.2 - 15.0)/(0.8/√25) = 1.25
Compare statistic to critical value and conclude
Since |1.25| < 2.064, fail to reject H₀
Find probability of observed result or more extreme under H₀
P-value = 2 × P(t₂₄ > 1.25) = 0.224
Compare test statistic to critical value
Advantages: Direct comparison, Clear decision boundary
Disadvantages: Doesn't show strength of evidence
Compare P-value to significance level
Advantages: Shows strength of evidence, More informative
Disadvantages: Can be misinterpreted
Reject H₀:
Strong evidence against H₀ in favor of H₁
Fail to Reject H₀:
Insufficient evidence to reject H₀ (not proof of H₀)
Common Mistakes:
Scenario:
Testing population mean with known variance
Hypotheses:
Assumptions:
Test Statistic:
Rejection Regions:
Example Application:
Testing if mean height = 170cm with σ = 5cm known
Scenario:
Testing population mean with unknown variance
Hypotheses:
Assumptions:
Test Statistic:
Rejection Regions:
Example Application:
Testing if new teaching method improves test scores
Scenario:
Testing population variance
Hypotheses:
Assumptions:
Test Statistic:
Rejection Regions:
Example Application:
Testing if process variability meets specifications
Scenario:
Comparing means with known variances
Hypotheses:
Assumptions:
Test Statistic:
Example Application:
Comparing treatment effects in clinical trial with known population variances
Scenario:
Comparing means with unknown but equal variances
Hypotheses:
Assumptions:
Test Statistic:
Pooled Variance:
Example Application:
Comparing test scores between two teaching methods
Scenario:
Comparing population variances
Hypotheses:
Assumptions:
Test Statistic:
Rejection Regions:
Example Application:
Testing if two processes have equal variability before pooling data
Motivation:
When optimal tests don't exist or are unknown, GLRT provides a systematic approach
Principle:
Compare maximum likelihood under full parameter space to maximum likelihood under null hypothesis constraint
where r = () - ()
There's a one-to-one correspondence between confidence intervals and hypothesis tests at the same confidence/significance level
From acceptance regions to confidence sets
Explanation: The confidence set contains all parameter values that would not be rejected by the test
From confidence sets to acceptance regions
Explanation: Accept H₀: θ = θ₀ if and only if θ₀ lies within the confidence interval