Master the core learning framework for rule extraction. Learn the divide-and-conquer strategy, greedy algorithm characteristics, and step-by-step examples with loan approval classification.
Sequential covering is a divide-and-conquer approach to learning rule sets. The algorithm learns one rule at a time, removes all samples covered by that rule (both positive and negative), and repeats the process on the remaining samples until all samples are covered or no effective rules can be learned.
Each learned rule should cover as many positive examples as possible while avoiding negative examples. Once a rule is learned, all covered samples (positive and negative) are removed from the training set, and the process continues with the remaining samples.
Why Remove Covered Samples?
Sequential covering is a greedy algorithm: at each step, it selects the locally optimal rule (best for current samples) without considering the global impact on the final rule set.
If the algorithm first learns rule "approve ← income ≥ $50,000", it may miss a better rule "approve ← income ≥ $40,000 ∧ credit_score ≥ 750" that covers more samples with higher accuracy. This is why advanced algorithms like RIPPER use global optimization to refine rules.
Apply sequential covering to learn a rule set for loan approval. The training set contains 100 samples: 60 approve (positive) and 40 reject (negative).
| Step | Learned Rule | Samples Removed | Remaining |
|---|---|---|---|
| 1 | IF income ≥ $50,000 AND credit_score ≥ 700 THEN approve | 45 | 55 |
| 2 | IF employment_years ≥ 2 AND credit_score ≥ 650 THEN approve | 30 | 25 |
| 3 | IF debt_ratio < 0.3 AND savings ≥ $10,000 THEN approve | 20 | 5 |
| 4 | Default: reject | 5 | 0 |
After 4 steps, all 100 samples are covered. Rules are evaluated in order (1 → 2 → 3 → default).
Training set: 60 approve, 40 reject (100 total). Learn first rule covering high-income, high-credit applicants. Rule covers 45 samples (40 approve, 5 reject). Remove all 45 samples.
Remaining: 20 approve, 35 reject (55 total). Learn rule for stable employment cases. Rule covers 30 samples (25 approve, 5 reject). Remove all 30 samples.
Remaining: 5 approve, 25 reject (30 total). Learn rule for low-debt, high-savings cases. Rule covers 20 samples (15 approve, 5 reject). Remove all 20 samples.
Remaining: 0 approve, 5 reject (5 total). Add default rule "reject" to handle remaining samples. All samples now covered.
Four rules learned sequentially. Each rule handles a distinct subset of samples. Rules are evaluated in order, and the first matching rule's prediction is used.
Rule Coverage
Number of samples (positive and negative) that satisfy the rule body. A rule with high positive coverage and low negative coverage is desirable.
Rule Accuracy
Proportion of covered samples that are positive:
The algorithm stops when one of the following conditions is met: