Spy	Golfer	Fedora	Count
T	T	T	1
T	T	F	3
T	F	T	1
T	F	F	0
F	T	T	4
F	T	F	3
F	F	T	6
F	F	F	2

Golfer	P(Golfer \| Spy = True)
T	4/5 = .8
F	1/5 = .2

Golfer	P(Golfer \| Spy = False)
T	7/15 ≈ .47
F	8/15 ≈ .53

Fedora	P(Fedora \| Spy = True)
T	2/5 = .4
F	3/5 = .6

Fedora	P(Fedora \| Spy = False)
T	10/5 ≈ .66
F	5/15 ≈ .33

Implementation Issues

How to handle zeros for some attributes?
- If for some attribute value , then the entire product becomes 0
- This means regardless of other evidence
- Problem: A single zero probability can dominate the classification
Solution: Laplace Smoothing (Add-one smoothing)
- Instead of:
- Use:
- Where is the number of possible values for attribute
Example: If we never saw "Golfer=True, Spy=True" in training:
- Without smoothing:
- With Laplace:

Naive Bayes