Unsupervised, Recommenders & Reinforcement

0 / 13 completed0%

13 remainingShowing 13 of 13 nodes

Concepts Covered

0/13 done

Course map: unsupervised learning first, recommender systems next, reinforcement learning after that.

Clustering finds structure in unlabeled data by grouping similar points together.

K-means alternates between assigning points to nearest centroids and moving centroids to cluster means.

Formal K-means procedure with assignment equations, centroid updates, and empty-cluster handling.

K-means minimizes distortion: average squared distance from each point to its assigned centroid.

Initialization quality strongly affects final clustering; multi-start runs improve robustness.

Choosing K is often ambiguous; combine elbow hints with downstream business tradeoffs.

Anomaly detection learns normal behavior and flags low-probability events for inspection.

Gaussian distributions model feature likelihood via mean and variance, forming the basis of simple anomaly scoring.

Fit one Gaussian per feature, multiply densities into p(x), then classify with epsilon threshold.

Use cross-validation anomalies to tune epsilon and features; evaluate with skew-aware metrics like precision, recall, and F1.

Pick anomaly detection for rare and evolving positives; pick supervised learning when positives are sufficiently labeled and stable.

Feature shaping and engineering are critical in anomaly detection; transform skewed variables and iterate via error analysis.