WPI Worcester Polytechnic Institute

Computer Science Department
------------------------------------------

CS548 Knowledge Discovery and Data Mining 
Schedule of Classes - Fall 2019

PROF. CAROLINA RUIZ 

WARNING: Changes to this schedule may be made during the course of the semester. 
------------------------------------------

WEEK DATE DUE TOPIC READINGS
Tan, Steinbach, Kumar's Textbook
0 Aug. 22   Introduction to KDD & Data Mining   Chp. 1
1 Aug. 27 & 29   Data & Data Preprocessing
  • Concepts, instances, attributes
  • Data sampling
  • Missing values
  • Attribute discretization
  • Dimensionality reduction:
  • Feature Selection
  •   Chp. 2
    2 Sept. 3
    No class on Sept. 5
      Data & Data Preprocessing (cont.)
  • Dimensionality reduction:
  • Feature Extraction
  •   Online Appendix B.1
    3 Sept. 10 & 12 Project 1
    & Test 1
    Classification
  • Decision trees
    Showcase: Decision Trees
    Project 1 discussion and Test 1
  •   Sect. 3.1-3.3.
    4 Sept. 17 & 19 HW Regression
  • linear regression
  • model trees
  • regression trees
    Showcase: Model and Regression Trees
  •   Online Appendix D and
    all numeric prediction materials on Ruiz' lecture notes
    5 Sept. 24 & 26 HW Model construction and evaluation
  • Training and Testing
  • Cross validation
  • Performance evaluation
  •   Sect. 3.4-3.8
    6 Oct. 1 & 3 HW Model Comparison
    Experimental Design
    Deep Learning Networks
  • Neural Networks
  • Deep Learning
  •   Sect. 3.9, 4.7-4.8
    7 Oct. 8 & 10 Project 2
    & Test 2
    HW
    Deep Learning Networks (cont.)
  • Neural Networks
  • Deep Learning
    Showcase: Neural Networks and Deep Learning
    Project 2 discussion and Test 2
  •   Sect. 4.7-4.8
      Oct. 15 & 17   Semester Break  
    8 Oct. 22 & 24 HW Cluster Analysis
  • partitioning methods
  • hierarchical methods
  • density-based methods
    Showcase: Clustering
  •   Chp. 7
    9 Oct. 29 & 31 Project 3
    & Test 3
    HW
    Cluster Analysis (cont.)
  • grid-based methods
  • model-based methods
    Project 3 discussion and Test 3
  •   Chp. 7
    10 Nov. 5 & 7 HW Anomaly Detection
  • statistical approaches
  • proximity-based approaches
  • density-based approaches
  • reconstruction-based approaches
    Showcase: Anomaly Detection
  •   Chp. 9
    11 Nov. 12 & 14 Project 4
    & Test 4
    HW
    Advanced topics
  • Text mining
    Showcase: Text Mining
    Project 4 discussion and Test 4
  •  
    text mining materials marked with ** on Ruiz' lecture notes
    12 Nov. 19 & 21 HW Advanced topics (cont.)
  • Sequence mining
  • Data Visualization
  • Multimedia data mining
    Showcase: Data Visualization
  •   Sect. 7.4
    all visualization materials on Ruiz' lecture notes
      Nov. 26
    No class on Nov. 28
      Advanced topics (cont.)
  • Web mining
  • Ethical and Societal Implications of Data Mining
    Showcase: Web Mining
  • all web mining materials on Ruiz' lecture notes
    13 Dec. 3 & 5 Project 5
    & Test 5
    Project 5 presentations and Test 5  
    14 Dec. 12
    Makeup class (Dec 10)
      Project 5 presentations and discussion (cont.)
    Final remarks