Density Estimation — Gaussian Mixtures, EM & Vowel Classification

Summary

Modeling probability distributions with Mixture of Gaussians (MoG) trained via the Expectation-Maximization algorithm. Covers the full EM loop — E-step soft responsibility assignments, M-step parameter updates for means, covariances, and mixing weights — applied to the Peterson & Barney vowel formant dataset (F1, F2 frequencies). Builds a Maximum Likelihood classifier from two class-conditional GMMs, visualizes decision boundaries on a meshgrid, confronts the singularity problem with linearly dependent features, and solves it with regularization. Achieves 95.07% accuracy with K=3 and 95.72% with K=6.

Materials

THEORY

Density Estimation & the EM Algorithm

How Gaussian mixtures model complex distributions, why EM works through soft assignments, and how density models become classifiers.

NOTEBOOK

Density Estimation Notebook — Vowel Classification with GMMs

Implement EM for Gaussian mixtures, build a Maximum Likelihood vowel classifier from formant data, and solve the singularity problem.

Includes notebook

Clustering — K-Means, Hierarchical Methods & Customer Segmentation Dimensionality Reduction — PCA, MNIST & Feature Selection