36-350: Data Mining (Fall 2003)
Lectures:
Monday and Wednesday, 10:30-11:20, CFA 211
Computer labs:
Friday, 10:30-11:20, Baker 140F
Instructor:
Tom Minka
,
Statistics Dept
Teaching Assistant:
Fang Chen
Syllabus
Revision policy
R home page
Data Mining links
2003 Car data
Date
Topic
Assignment
8/25
Searching for information by similarity
hw1
8/27
Multi-dimensional scaling
8/29
Lab 1
mining.zip
lab1_docs.rda
lab1.r
politics3.txt
9/3
Searching for images by similarity
hw2
9/5
Lab 2
mining.zip
lab2_imgs.rda
lab2.r
9/8
Information content of words
hw3
9/10
Interactions and "20 questions"
9/12
Lab 3
mining.zip
lab3_docs.zip
lab3.r
9/15
Partitioning data into clusters
hw4
9/17
Partitioning images and video
9/19
Lab 4
mining.zip
lab2_imgs.rda
lab4_img.rda
lab4.r
9/22
PCA projection of Cars
hw5
9/24
Interpreting PCA projection
9/26
Lab 5
mining.zip
lab5.r
9/28
Visualizing subgroups with informative projections
hw6
10/1
Parallel-coordinate plots
10/3
Lab 6
mining.zip
lab6.r
10/6
Trend lines, slice plots, regression projection
hw7
10/8
Contour plots
10/10
Lab 7
mining.zip
lab7.r
10/13
Regression trees
hw8
10/15
More with trees
10/20
Linear regression for marketing research
hw9
10/22
Selecting variables for linear regression
10/24
Lab 9
mining.zip
lab9.r
lab9.rda
10/27
Adding interaction terms to a linear model
hw10
10/29
Assessing the quality of a regression model
10/31
Lab 10
mining.zip
lab10.r
lab9.rda
11/3
Categorical predictors and response tables
hw11
(greyscale version)
11/5
Contingency tables
11/7
Lab 11
mining.zip
lab11.r
crash-head.rda
11/10
Classification trees
hw12
11/12
Tree pruning
11/14
Lab 12
mining.zip
lab12.r
Credit.rda
data description
11/17
Logistic regression
hw13
11/19
Quadratic expansion
11/21
Lab 13
mining.zip
lab12.r
Credit.rda
data description
11/24
Modeling time-series data
12/1
Time-series of purchases
hw14
12/3
Final review
12/5
Lab 14
mining.zip
lab14.r
uspop.rda
Tom Minka