|
|||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Announcements
Course Description
Data mining, or intelligent analysis of information stored in data sets, has recently gained a substantial interest among practitioners in a variety of fields and industries. This course will introduce the process of knowledge discovery and the basic theory of automatically extracting models from data, validating those models, solving the problems of how to extract valid, useful, and previously unknown interesting patterns from a source (database or web) which contains an overwhelming amount of information. Students will be introduced to various models (decision trees, association rules, linear model, clustering, Bayesian network, neural network) and learn how to apply them in practice. Algorithms applied include searching for patterns in the data, using machine learning, and applying artificial intelligence techniques. Students will learn how to implement several relevant algorithms and use existing tools to mine real-world data.
Course Information and Prerequisites
Course Instructor
Schedule |
Date | Lecture Topic(s) | Reading Assignment | Homework |
W-Aug 22 Th-Aug 23 |
Introduction slides | Chapter 1 - Han Book | Assignment 0 - due before next class |
M-Aug 27 Tu-Aug 28 |
Getting to Know your Data slides | Chapter 2 - Han Book | Assignment 1 Sign-Up for Paper Presentations (Sign-up link will be emailed to you this afternoon) |
W-Aug 29 Tu-Sep 4 |
Data Preprocessing slides In-Class Activity 1 |
Chapter 3 - Han Book | |
W-Sep 5 Th-Sep 6 |
More Data Prepropressing slides In-Class Activity 2 |
Chapter 3 - Han Book | Assignment 2 Install WEKA v3.8 (Stable) by 9/11 |
M-Sep 10 Tu-Sep 11 |
Mining Frequent Patterns slides | Chapter 6 | |
W-Sep 12 Th-Sep 13 |
Introduction to WEKA - Tutorial In-Class Activity 3 |
Assignment 3 Group Project Information |
|
M-Sep 17 Tu-Sep 18 |
More Mining Frequent Patterns slides | Chapter 6 | |
W-Sep 19 Th-Sep 20 |
Intro to Classification slides | Section 8.1 - 8.2 | |
M-Sep 24 Tu-Sep 25 |
More Classification slides | Sections 8.3-8.4 | Assignment 4 |
W-Sep 26 Th-Sep 27 |
Classification: Evaluation & Model Selection slides | Sections 8.5-8.7 | |
M-Oct 1 Tu-Oct 2 |
Study Guide for Midterm | Group Project Proposals Due | |
W-Oct 3 Th-Oct 4 |
Midterm | ||
M-Oct 8 Tu-Oct 9 |
Clustering Basics slides | Section 10.1-10.3 | |
W-Oct 10 Th-Oct 11 |
WEKA Activities Classification Activity Clustering Activity |
colic.arff zoo.arff | |
M-Oct 15 Tu-Oct 16 |
Fall Break - No Class | ||
W-Oct 17 Th-Oct 18 |
More on Clustering slides | Section 10.4 | Assignment 5 |
M-Oct 22 Tu-Oct 23 |
Still More on Clustering slides | Section 10.5-10.7 | |
W-Oct 24 Th-Oct 25 |
Putting it all together In-Class Activity 6 | Weka ARFF Instructions skiresortdata.csv | |
M-Oct 29 Tu-Oct 30 |
K-Nearest Neighbor Classification slides In-Class Activity 7 |
Section 9.5.1 | |
W-Oct 31 Th-Nov 1 |
Work Period | ||
M-Nov 5 Tu-Nov 6 |
Link Analysis & PageRank slides | MMDS Chapter 5 | Project Checkpoint Paper Due Watch MapReduce Videos & Take MapReduce Quiz (Moodle) |
W-Nov 7 Th-Nov 8 |
More on PageRank slides | Assignment 6 | |
M-Nov 12 Tu-Nov 13 |
More on PageRank slides | ||
W-Nov 14 Th-Nov 15 |
Recommender Systems slides | MMDS Chapter 9 | |
M-Nov 19 Tu-Nov 20 |
More on Recommender Systems slides In-Class Activity 8 |
||
W-Nov 21 Th-Nov 22 |
Thanksgiving Break - No Class | ||
M-Nov 26 Tu-Nov 27 |
More on Recommender Systems slides | ||
W-Nov 28 Th-Nov 29 |
Group Project Presentations: Sahil, Geoffrey, Michael Mary, Noah Negusu, Jimmy, Allante Conrad, Jillian, Clare |
Thursday Presentations: Marcus, Liam, Benjamin Jimmy, Julie, Sophie Braith, William Carter, Thomas, Will |
Group Project Presentation Grading Rubric |
M-Dec 3 Tu-Dec 4 |
Group Project Presentations Adam, Will, Tanner William, Connor, Hieu Caroline, Tim, Shane |
Tuesday Presentations: Rachel, Emma, Hannah, Natalia Zach, Will, Chandler Jack, Jun, Jose |
|
W-Dec 5 | Final Project Paper Due | ||
F-Dec 7 M-Dec 10 |
Final Exam, 5:30-8pm |