WPI Worcester Polytechnic Institute

Computer Science Department
------------------------------------------

CS548 Knowledge Discovery and Data Mining 
Schedule of Classes - Spring 2014

PROF. CAROLINA RUIZ 

WARNING: Changes to this schedule may be made during the course of the semester. 
------------------------------------------

WEEK DATE DUE TOPIC READINGS
1 Jan. 17   Introduction to KDD & Data Mining   Chp. 1 & 2
2 Jan. 21 & 24   Data & Data Preparation
  • Concepts, instances, attributes
  • Data preprocessing
  • Attribute selection
  •   Chp. 2 & 3
    3 Jan. 28 & 31   Data & Data Preparation (cont.)
  • Data integration
  • Data warehousing & OLAP
  • Dimensionality reduction
    Showcases
  •   Chp. 2 & 3, Appendix B
    4 Feb. 4 & 7 Project 1 Mining process
  • Training and Testing
  • Cross validation
  • Performance evaluation
    Project 1 discussion and test
    Showcases
  •   Sect. 4.5
    5 Feb. 11 & 14   Classification
  • Decision trees
    Showcases
  •   Sect. 4.1-4.4.
    6 Feb. 18 & 21   Numeric Predictions
  • linear regression
  • model trees
  • regression trees
    Showcases
  •   Appendix D, assigned readings
    7 Feb. 25 & 28   Association Analysis
  • association rules
    Showcases
  •   Sec. 6.1-6.3, 6.7-6.9.
    8 Mar. 4 & 7 Project 2 Project 2 discussion and test
    Association Analysis (cont.)
  • association rules
    Showcases
  •   Chp. 6
    9 Mar. 18 & 21   Cluster Analysis
  • partitioning methods
  • hierarchical methods
  • density-based methods
    Showcases
  •   Chp. 8
    10 Mar. 25 & 28 Project 3 Cluster Analysis (cont.)
  • grid-based methods
  • model-based methods
    Project 3 discussion and test
    Showcases
  •   Chp. 8
    11 Apr. 1 & 4   Anomaly Detection
  • model-based methods
  • proximity-based methods
  • density-based methods
    Showcases
  •   Chp. 10
    12 Apr. 8 & 11 Project 4 Advanced topics
  • Visualization
  • Web mining
    Project 4 discussion and test
    Showcases
  •   assigned papers
    13 Apr. 15 & 18   Advanced topics (cont.)
  • Sequence mining
  • Multimedia data mining
    Showcases
  •   assigned papers
    14 Apr. 22 & 25 Project 5 **** Classroom for April 22nd: Goddard Hall GH012 ****
    Advanced topics (cont.)
  • Text mining
  • Industrial applications of data mining
  • Scientific applications of data mining
    Project 5 presentations and discussion
    Showcases
  •   assigned papers
    15 Apr. 29   Project 5 presentations and discussion (cont.)
    Final remarks
    Showcases
      assigned papers