WPI Worcester Polytechnic Institute

Computer Science Department
------------------------------------------

CS525D Knowledge Discovery and Data Mining 
Schedule of Classes - Spring 2008

PROF. CAROLINA RUIZ 

WARNING: Changes to this schedule may be made during the course of the semester. 
------------------------------------------

WEEK DATE DUE TOPIC READINGS
1 Jan. 15 & 17   Introduction to KDD & Data Mining   Chp. 1 & 9, assigned papers
2 Jan. 22 & 24   Data & Data Preparation
  • Concepts, instances, attributes
  • Data integration
  • Data preprocessing
  •   2, 7.1-7.3, 10.1-10.3, 10.8
    3 Jan. 29 & 31   Data & Data Preparation (cont.)
  • Data warehousing & OLAP
  • Attribute selection
  • Dimensionality reduction
  •   2, 7.1-7.3, 10.1-10.3, 10.8
    4 Feb. 5 & 7 Project 1 Mining process
  • Training and Testing
  • Cross validation
  • Performance evaluation
    Project 1 presentations
  •   5.1-5.4, 5.8
    5 Feb. 12 & 14   Classification
  • Decision trees
  •   3.2, 4.3, 6.1
    6 Feb. 19 & 21   Numeric Predictions
  • model trees
  • regression trees
  •   3.7, 4.6, 6.5, 5.8
    7 Feb. 26 & 28 Project 2 Project 2 presentations
    Association Analysis
  • association rules
  •   3.4, 4.5, assigned papers
    8 Mar. 4 & 6 Project 3 Association Analysis (cont.)
  • association rules
    Project 3 presentations
  •   3.4, 4.5, assigned papers
    9 Mar. 11 & 13 Project 4 Cluster Analysis
  • partitioning methods
  • hierarchical methods
  • density-based methods
    Project 4 presentations
  •   3.9, 4.8, 6.6
    10 Mar. 18 & 20   Cluster Analysis (cont.)
  • grid-based methods
  • model-based methods
  •   3.9, 4.8, 6.6
    11 Mar. 25 & 27   Anomaly Detection
  • model-based methods
  • proximity-based methods
  • density-based methods
  •   7.4, assigned papers
    12 April 1 & 3 Project 5 Advanced topics
  • Visualization
  • Web mining
    Project 5 presentations
  •   8, assigned papers
    13 April 8 & 10   Advanced topics (cont.)
  • Similarity search
  • Sequence mining
  • Multimedia data mining
  •   8, assigned papers
    14 April 15 & 17 Project 6 Advanced topics (cont.)
  • Text mining
  • Industrial applications of data mining
  • Scientific applications of data mining
    Project 6 presentations
  •   8, assigned papers
    15 April 22 & 24   make-up week for snow or other cancellations