COMP 345
Data Mining
(CRN 19746/19839, Fall 2018)

Announcements


  • No class or office hours on Thursday, August 30th.
  • You should install WEKA by Tuesday, September 11th.
  • The first lecture will be held on August 22/23, 2018.
  •  

    Course Description


    Data mining, or intelligent analysis of information stored in data sets, has recently gained a substantial interest among practitioners in a variety of fields and industries. This course will introduce the process of knowledge discovery and the basic theory of automatically extracting models from data, validating those models, solving the problems of how to extract valid, useful, and previously unknown interesting patterns from a source (database or web) which contains an overwhelming amount of information. Students will be introduced to various models (decision trees, association rules, linear model, clustering, Bayesian network, neural network) and learn how to apply them in practice. Algorithms applied include searching for patterns in the data, using machine learning, and applying artificial intelligence techniques. Students will learn how to implement several relevant algorithms and use existing tools to mine real-world data.

    The syllabus for this course can be found at http://cs.rhodes.edu/welshc/COMP345_F18/syllabus.pdf.

    Course Information and Prerequisites


    Location: Briggs 119
    Time: MW or TR 2-3:15PM
    URL: http://cs.rhodes.edu/welshc/COMP345_F18/
    Prerequisite: COMP 241


    Course Instructor


    Instructor: Catie Welsh
    Office: Briggs 208
    Email: welshc@rhodes.edu (please include "CS 345" somewhere in the subject)
    Office Hours: Mon, Tues and Thurs 3:30-5pm, Briggs 208







    Schedule


    This is a tentative schedule and subject to change as needed.

    Date Lecture Topic(s) Reading Assignment Homework
    W-Aug 22
    Th-Aug 23
    Introduction slides Chapter 1 - Han Book Assignment 0 - due before next class
    M-Aug 27
    Tu-Aug 28
    Getting to Know your Data slides Chapter 2 - Han Book Assignment 1
    Sign-Up for Paper Presentations (Sign-up link will be emailed to you this afternoon)
    W-Aug 29
    Tu-Sep 4
    Data Preprocessing slides
    In-Class Activity 1
    Chapter 3 - Han Book  
    W-Sep 5
    Th-Sep 6
    More Data Prepropressing slides
    In-Class Activity 2
    Chapter 3 - Han Book Assignment 2
    Install WEKA v3.8 (Stable) by 9/11
    M-Sep 10
    Tu-Sep 11
    Mining Frequent Patterns slides Chapter 6
    W-Sep 12
    Th-Sep 13
    Introduction to WEKA - Tutorial
    In-Class Activity 3
      Assignment 3
    Group Project Information
    M-Sep 17
    Tu-Sep 18
    More Mining Frequent Patterns slides Chapter 6  
    W-Sep 19
    Th-Sep 20
    Intro to Classification slides Section 8.1 - 8.2  
    M-Sep 24
    Tu-Sep 25
    More Classification slides Sections 8.3-8.4 Assignment 4
    W-Sep 26
    Th-Sep 27
    Classification: Evaluation & Model Selection slides Sections 8.5-8.7  
    M-Oct 1
    Tu-Oct 2
    Study Guide for Midterm   Group Project Proposals Due
    W-Oct 3
    Th-Oct 4
    Midterm
    M-Oct 8
    Tu-Oct 9
    Clustering Basics slides Section 10.1-10.3  
    W-Oct 10
    Th-Oct 11
    WEKA Activities
    Classification Activity
    Clustering Activity
      colic.arff zoo.arff
    M-Oct 15
    Tu-Oct 16
    Fall Break - No Class
    W-Oct 17
    Th-Oct 18
    More on Clustering slides Section 10.4 Assignment 5
    M-Oct 22
    Tu-Oct 23
    Still More on Clustering slides Section 10.5-10.7  
    W-Oct 24
    Th-Oct 25
    Putting it all together In-Class Activity 6   Weka ARFF Instructions  skiresortdata.csv
    M-Oct 29
    Tu-Oct 30
    K-Nearest Neighbor Classification slides
    In-Class Activity 7
    Section 9.5.1  
    W-Oct 31
    Th-Nov 1
    Work Period    
    M-Nov 5
    Tu-Nov 6
    Link Analysis & PageRank slides MMDS Chapter 5 Project Checkpoint Paper Due
    Watch MapReduce Videos & Take MapReduce Quiz (Moodle)
    W-Nov 7
    Th-Nov 8
    More on PageRank slides   Assignment 6
    M-Nov 12
    Tu-Nov 13
    More on PageRank slides    
    W-Nov 14
    Th-Nov 15
    Recommender Systems slides MMDS Chapter 9  
    M-Nov 19
    Tu-Nov 20
    More on Recommender Systems slides
    In-Class Activity 8
       
    W-Nov 21
    Th-Nov 22
    Thanksgiving Break - No Class
    M-Nov 26
    Tu-Nov 27
    More on Recommender Systems slides    
    W-Nov 28
    Th-Nov 29
    Group Project Presentations:
    Sahil, Geoffrey, Michael
    Mary, Noah
    Negusu, Jimmy, Allante
    Conrad, Jillian, Clare
    Thursday Presentations:
    Marcus, Liam, Benjamin
    Jimmy, Julie, Sophie
    Braith, William
    Carter, Thomas, Will
    Group Project Presentation Grading Rubric
    M-Dec 3
    Tu-Dec 4
    Group Project Presentations
    Adam, Will, Tanner
    William, Connor, Hieu
    Caroline, Tim, Shane
    Tuesday Presentations:
    Rachel, Emma, Hannah, Natalia
    Zach, Will, Chandler
    Jack, Jun, Jose
     
    W-Dec 5     Final Project Paper Due
    F-Dec 7
    M-Dec 10
    Final Exam, 5:30-8pm