Course web page: http://www.mcb.mcgill.ca/~perkins/COMP76602/COMP76602.html
Location: Arts Building 210
Time: 10:35 am - 11:25 am, MWF
Office hours: Mon 11:45-12:45, Thu 1-2
Lecture | Date | Topic | Handouts / Slides |
---|---|---|---|
1 | Jan 5 | Introduction | [lecture slides] [Course info in html] [Course info in ps] |
Summarization / Unsupervised learning | |||
2 | Jan 7 | Clustering | [lecture slides] |
3 | Jan 9 | Clustering, Self-organizing maps | [lecture slides] Single-linkage clustering example 1 Complete-linkage clustering example 1 Group average clustering example 1 Single-linkage clustering example 2 Complete-linkage clustering example 2 Group average clustering example 2 |
4 | Jan 12 | Read and critique: Eisen, Spellman, Brown, Botstein (1998) "Cluster analysis and display of genome-wide expression patterns." PNAS Vol. 95 pp. 14863-14868. | |
5 | Jan 14 | Read and critique: Yeung, Haynor, Ruzzo (2001) "Validating clustering for gene expression data." Bioinformatics Vol. 17 No. 4 pp. 309-318 | |
6 | Jan 19 | Basic statistics review, Principal components analysis | [lecture slides] |
7 | Jan 21 | Gradient descent, Multi-dimensional scaling, Self-organizing maps | [lecture slides] |
8 | Jan 23 | Read and critique: Landgrebe, Wurst, Welzl (2002) "Permutation-validated principal components analysis of microarray data" Genome Biology Vol. 3 No. 4 | |
9 | Jan 26 | Read and critique: Grishin and Grishin (2002) "Euclidian space and grouping of biological objects" Bioinformatics Vol. 18 no. 11 pp. 1523-1533 | |
10 | Jan 28 | Read and critique: Draghici and Potter (2003) "Predicting HIV drug resistance with neural networks" Bioinformatics Vol. 19 No. 1 pp. 98-107 | |
Prediction | |||
11 | Jan 30 | Prediction problems, Nearest neighbor | [lecture slides] |
12 | Feb 2 | Nearest neighbor, Linear regression | [lecture slides] |
13 | Feb 4 | Linear regression | [lecture slides] |
14 | Feb 6 | Perceptrons (linear classification) | [lecture slides] |
15 | Feb 9 | Read and critique: Huang and Li (2004) "Prediction of protein subcellular locations using fuzzy k-NN method" Bioinformatics Vol. 20 No. 1 pp. 21-28 | |
16 | Feb 11 | Read and critique: Chuzhanov, Jones, Margetts (1998) "Feature selection for genetic sequence classification" Bioinformatics Vol. 14 No. 2 pp. 139-143 | |
17 | Feb 13 | Read and critique: Wu, Schmidler, Hastie, Brutlag (1998) "Regression analysis of multiple protein structures" Journal of Computational Biology Vol. 5 No. 3 pp. 597-607 | |
18 | Feb 16 | Logistic regression | [example logistic fit] [training to minimize sum squared error] [training to minimize cross-entropy] |
19 | Feb 18 | Artificial neural networks | [training ANN with 2 hidden units] |
20 | Feb 20 | Read and critique: Hatzigeorgiou (2002) "Translation initiation start prediction in human cDNAs with high accuracy" Bioinformatics Vol. 18 No. 2 pp. 343-350 | |
spring break | |||
21 | Mar 3 | Support vector machines 1 | [lecture slides] |
22 | Mar 5 | Support vector machines 2 | |
23 | Mar 8 | Tree-structured predictors | [lecture slides] |
24 | Mar 10 | Read and critique: Aliferis et al. (2002) "Machine learning models for lung cancer classification using array comparative genomic hybridization" Proc. AMIA Symposium 2002 pp. 7-11 | |
Probabilistic Modeling | |||
25 | Mar 12 | Review of probability theory 1 | [lecture slides] |
26 | Mar 15 | Review of probability theory 2 | |
27 | Mar 17 | Maximum likelihood estimation of p.d.f.'s | [lecture slides] |
28 | Mar 22 | Read and critique: Henikoff and Henikoff (1996) "Using substitution probabilities to improve position-specific scoring matrices" Computer Applications in the Biosciences Vol. 12 No. 2 pp. 135-143 | |
29 | Mar 24 | Tests of statistical [in]dependence | [lecture slides] |
30 | Mar 26 | Bayesian networks | [lecture slides] |
31 | Mar 29 | Read and critique: Segal et a. (2002) "From Promoter Sequence to Expression: A Probabilistic Framework" RECOMB 2002 | |
32 | Mar 31 | Learning Bayesian Networks 1 | [lecture slides] |
33 | Apr 2 | Learning Bayesian Networks 2 |