Loading [MathJax]/jax/output/CommonHTML/fonts/TeX/fontdata.js

Maximum Likelihood Learning

Bayesian Learning

Maximum a Posteriori Learning

Maximum Likelihood Learning

Turning our single layer network into a probability distribution...

Aside: Derivative of the Sigmoid/Logistic Function

The logistic function has a simple derivative:

σ(x)=σ(x)(1σ(x))

One Weird Trick...

Maximum Likelihood for Logistic Regression:

Why not just use MSE?