article thumbnail

Fundamentals of Data Mining

Data Science 101

Data mining is the process of discovering these patterns among the data and is therefore also known as Knowledge Discovery from Data (KDD). This data alone does not make any sense unless it’s identified to be related in some pattern. Strong patterns, if found, will likely generalize to make accurate predictions on future data.

article thumbnail

Density-Based Clustering

Domino Data Lab

Due to its importance in both theory and applications, this algorithm is one of three algorithms awarded the Test of Time Award at the KDD conference in 2014. The anomalous points pull the cluster centroid towards them, making it harder to classify them as anomalous points. neighborhoods. The general idea behind ?-neighborhoods away from p.

Metrics 116