Guided Starter Example
Google News uses unsupervised clustering to group stories about the same event from hundreds of different news sources โ no one manually labelled which articles go together.
Finding hidden structure in data with no labels โ clustering, anomaly detection, and more.
Andrew Ng's framing: 'Don't let the name fool you โ unsupervised learning is just as super as supervised learning.'
In unsupervised learning, your dataset has no output labels Y โ just input features X. You're not trying to predict anything specific. Instead, the algorithm must discover hidden structure, patterns, or groupings all by itself.
The most common type is clustering โ grouping similar data points together. Real example from the topic: Google News. Every day, thousands of news articles are published. Google News groups stories about the same event across hundreds of different sources โ no human manually labels which articles belong together. The clustering algorithm discovers the groups by text similarity.
Another example from the topic: DNA microarray data โ researchers measure gene activity across thousands of people and cluster them to discover which individuals share similar genetic patterns. No one told the algorithm what to look for.
Other unsupervised types you'll learn later: anomaly detection (fraud detection) and dimensionality reduction (compressing data while preserving structure).
Source-backed reinforcement: these points are extracted from the session source note to strengthen your theory intuition.
Source-backed reinforcement: these points add detail beyond short-duration UI hints and emphasize production tradeoffs.
First-time learner note: Read each model as a dataflow system: inputs become representations, representations become scores, and scores become decisions through a chosen loss and thresholding policy.
Production note: Track three things relentlessly in ML systems: data shape contracts, evaluation methodology, and the operational meaning of the model's errors. Most expensive failures come from one of those three.
Exhaustive coverage points to ensure complete topic understanding without missing core concepts.
Google News uses unsupervised clustering to group stories about the same event from hundreds of different news sources โ no one manually labelled which articles go together.
Guided Starter Example
Google News uses unsupervised clustering to group stories about the same event from hundreds of different news sources โ no one manually labelled which articles go together.
Source-grounded Practical Scenario
Finding hidden structure in data with no labels โ clustering, anomaly detection, and more.
Source-grounded Practical Scenario
Which is a type of unsupervised learning algorithm, takes data without labels and tries to automatically group them into clusters.
Concept-to-code walkthrough checklist for this topic.
Questions an interviewer is likely to ask about this topic. Think through your answer before reading the senior angle.
Test yourself before moving on. Flip each card to check your understanding โ great for quick revision before an interview.
Drag to reorder the architecture flow for Unsupervised Learning. This is designed as an interview rehearsal for explaining end-to-end execution.
Start flipping cards to track your progress
What is the core difference between supervised and unsupervised learning?
tap to reveal โSupervised: every training example has both input X and output label Y. You tell the algorithm the right answers. Unsupervised: data has only X โ no labels. The algorithm must find structure, patterns, or groupings by itself. Andrew Ng: 'We're not trying to supervise the algorithm to give some right answer for every input.'