From the course: Machine Learning with Python: k-Means Clustering
Unlock this course with a free trial
Join today to access over 25,300 courses taught by industry experts.
What is clustering? - Python Tutorial
From the course: Machine Learning with Python: k-Means Clustering
What is clustering?
- [Instructor] Clustering is an approach in machine learning that is used to categorize or segment data into subgroups based on similarity. These subgroups are known as clusters. Clustering is an unsupervised machine learning approach. This means that we use clustering when working with previously unlabeled data. There are two primary goals to clustering. The first is to ensure that the items within a particular cluster are as similar as possible. This is known as high-intraclass similarity. The second goal to clustering is to make sure the items within one cluster are as different as possible from items in another cluster. This is known as low-interclass similarity. Clustering is a widely used machine learning approach. It is sometimes used in the domain of network security as a way to detect anomalous behavior in computer networks. Clustering is also sometimes used to automatically group or categorize documents based on…
Practice while you learn with exercise files
Download the files the instructor uses to teach the course. Follow along and learn by watching, listening and practicing.