WPI Worcester Polytechnic Institute

Computer Science Department
------------------------------------------

CS548 Knowledge Discovery and Data Mining 
Quiz/Exam Topics and Sample Questions

PROF. CAROLINA RUIZ 

Warning: This page is provided just as a guide for you to study for the quizzes/tests. But there is no guarantee that the quiz/test questions will only come from those suggested here. You need to read the assigned materials in full before each quiz/test.
------------------------------------------

The textbook referred to on this page is:
"Introduction to Data Mining (2nd Edition)".
By Pang-Ning Tan, Michael Steinbach, Anuj Karpatne, and Vipin Kumar.
Pearson. 2019. ISBN-13: 978-0133128901 ISBN-10: 0133128903.
(See the book's link above for book slides and other resources.)

  1. Introduction to Data Mining and Knowledge Discovery in Databases:
    All materials covered in class, online lecture notes, and Chapter 1 + Slides of the Textbook

  2. Data and Data Preprocessing:
    All materials covered in class, online lecture notes, and Chapter 2 + Slides + Appendix B of the Textbook

  3. Classification: ZeroR, OneR, Decision Trees:
    All materials covered in class, online lecture notes, and Sections 3.1-3.3 + Slides + Appendix B of the Textbook

  4. Regression: Linear Regression, Regression Trees, Model Trees:
    All materials covered in class, online lecture notes, and online Appendix B.1 of the Textbook

  5. Model Evaluation and Model Comparison: (for prediction models)
    All materials covered in class, online lecture notes, and Sections 3.4-3.9 + Slides of the Textbook

  6. Artificial Neural Networks and Deep Learning:
    All materials covered in class, online lecture notes, and Sections 4.7-4.8 + Slides of the Textbook

    Need to know all of the following concepts, what they are and how to use them:

  7. Bayesian Classifiers:
    All materials covered in class, online lecture notes, and Section 4.5 + Slides of the Textbook

  8. Rule-Based Classifiers:
    All materials covered in class, online lecture notes, and Section 4.2 + Slides Textbook

  9. Association Analysis:
    All materials covered in class, online lecture notes, and Sections 5.1, 5.2, 5.3, 5.7. + Textbook Slides

  10. Clustering:
    All materials covered in class, online lecture notes, and Chapter 7 + Slides

  11. Anomaly Detection:
    All materials covered in class, online lecture notes, and Chapter 9

  12. Text Mining:
    All materials covered in class and online lecture notes

  13. Web Mining:
    All materials covered in class and online lecture notes

  14. Data Visualization:
    All materials covered in class and online lecture notes