Kent State University 
CS 63015/73015: Data Mining Techniques 
Fall 2007

Instructor: Ruoming Jin

Tuesdays 5:30PM-8:15PM; Rm. MSB 276
Office Hours (MSB 264): Tuesdays 4:00PM-5:00PM

---------------------------------------------------------

Prerequisites

CS 4/56101 Algorithms                                                                                                      CS 4/53005 Introduction to Database Systems
CS 33001 Data Structures                                                                                                      or Consent of the Instructor

Course Overview

This course teaches the fundamental concepts and techniques of data mining. We will cover a set of interesting topics, including pattern discovery/association rule mining, clustering, classification, information theory, decision theory/Bayesian inference, graphical models, kernel methods/support vector machine, spectral clustering, semi-supervised learning, etc. 

 

Each student will be expected to present a paper and lead the discussion following his/her presentation and do a project on selected topics. There will be neither homework nor exam. There will be two or three in-class exercise-sessions.

Textbook

P.-N. Tan, M. Steinbach, and V. Kumar, Introduction to Data Mining, Addison Wesley, 2005. 

Christopher M. Bishop, Pattern Recognition and Machine Learning, Springer, 2006

 

Other references:


[1] Data Mining --- Concepts and techniques, by Han and Kamber, Morgan Kaufmann, 2001. (ISBN:1-55860-489-8)
[2] Principles of Data Mining, by Hand, Mannila, and Smyth, MIT Press, 2001. (ISBN:0-262-08290-X)
[3] The Elements of Statistical Learning --- Data Mining, Inference, and Prediction, by Hastie, Tibshirani, and Friedman, Springer, 2001. (ISBN:0-387-95284-5)
[4] Mining the Web --- Discovering Knowledge from Hypertext Data, by Chakrabarti, Morgan Kaufmann, 2003. (ISBN:1-55860-754-4)

[5] I. H. Witten and E. Frank, Data Mining: Practical Machine Learning Tools and Techniques with Java Implementations, Morgan Kaufmann, 2nd ed. 2005.

Additional materials will include papers supplied by the instructor

 

Requirements & Grading Policy

A student's grade will be determined as a weighted average of project (40%), class participation (20%), and presentation (40%).

 

Lectures

Ø     8/28/07: Introduction to Data Mining

Ø     9/4/07: Association Rule Mining

Ø     9/18/07: Frequent Itemset Mining Implementation , Advanced Frequent Itemset Mining and Beyond

Ø     9/25/07: Summarization of FPM , Classification and Decision Tree Construction 

Ø     10/2/07: Paper Presentation List, Clustering

Ø     10/9/07: Page Rank

Ø     10/11/07: Paper Presentation Assignment and Date

Ø     10/16/07: Exercise, Weka Tutorial by Chibuike Muoh

Ø     10/23/07: Response Paper (Due Nov. 27th) Links: (How to write a response paper?) 

Ø     11/13/07: Clustering Visualization Project (Example)