The textbook referred to on this page is:
"Introduction to Data Mining (2nd Edition)".
By Pang-Ning Tan, Michael Steinbach, Anuj Karpatne, and Vipin Kumar.
Pearson. 2019. ISBN-13: 978-0133128901 ISBN-10: 0133128903.
(See the book's link above for book slides and other resources.)
Need to know all of the following concepts, what they are and how to use them:
Approach | Definition of Outlier
(state full definition) |
Anomaly score function | How does the approach work?
(in general) |
Example |
Statistical-based | Probabilistic definition of outlier | |||
Proximity-based | Proximity-based definition of outlier using
distance to k-nearest neighbor |
|||
Density-based | Density-based definition of outlier using
|
|||
Clustering-based | Clustering-based definition of outlier |