WEEK | DATE | DUE | TOPIC |
1 | Aug. 30 |
Introduction to data and text mining in bioinformatics
Biological and biomedical data and text sources, ontologies | |
2 | Sept. 3 - 6 | PbmSet1 |
Data preprocessing, feature selection
Visualization Clustering Hirarchical clustering, phylogenetic trees Problem set discussion |
3 | Sept. 10 - 13 | PbmSet2 |
Bayesian models
Expectation Maximization (EM) Gibbs sampling Problem set discussion |
4 | Sept. 17 - 20 | PbmSet3 |
Markov chains
Hidden Markov Models (HMMs) Problem set discussion |
5 | Sept. 24 - 27 |
Kernel methods: Support Vector Machines (SVMs)
Sequence mining | |
6 | Oct. 1 - 4 | PbmSet4 |
Statistical Natural Language Processing
Preprocessing for text mining Principal Components Analysis (PCA) Problem set discussion |
7 | Oct. 8 - 11 | PbmSet5 |
Text clustering
Text classification Pattern discovery from text Ontologies Integrating data and text for mining Problem set discussion |
8 | Oct. 15 | PbmSet6 | EXAM |
9 - 15 | Oct. 28 - Dec. 20 | Final Project | Advanced graduate topics and readings |