S.#
Date
Day
Topics
Download
1-2
30/8/14
Saturday
Course Overview, What is Data Mining and its Origin, Typical Data Mining Tasks, Data Mining Applications/Examples,
Data Mining vs. OLAP and Statistics, Introduction to Classification/Decision Trees, Model Interpretation, Measures of Node Impurity, Computation of GINI Index

3-4
6/9/14
Saturday
Computation of Entropy and Misclassification Error, Induction of Classification Trees, Handling of Continuous and Multi-State Variables, Data Preparation, Normalization, Outlier Detection, Discretization using Value Reduction

5-6
13/9/14
Saturday
Overview of Chi Square Test, ChiMerge Discretization, KNIME Demo (using German Credit Card Data), Model Evaluation, Accuracy, Weighted Accuracy, Recall and Precision, Receiver Operating Characteristics (ROC Curve)

7-8
20/9/14
Saturday
Lift and Gain Charts, Bayes Theorem, Naive Bayes Classifier, KNIME Discussion


27/9/14

Midterm 1 Week

9-10
4/10/14
Saturday
Assignment 1 Presentations

11-12
11/10/14
Saturday
PAKDD 2010 Case Study, Hypothesis Testing (One Mean and Two Mean), Artificial Neural Networks, Motivation, History, Multi-layer Feedforward Network, Backpropagation Algorithm

13-14
18/10/14
Saturday
Performance Evaluation of SVM, NB, NN, DT, etc. in Non-linear Classification, Model Evaluation (Holdout, k-Cross Validation), Sampling with Replacement (Bootsrapping), Ensemble Methods (Bagging and Boosting), Stacking

15-16
25/10/14
Saturday
Lazy Learner vs. Eager Learner, k-Nearest Neighbor: Pros and Cons, Clustering: Basic Concepts and Popular Types, Applications, K-Means: Concepts, Working, Limitations, Schemes to Handle Initial Centroid Problems in K-Means, Hierarchical Clustering: Simple/Complete/Average Linkages, Validity of Clusters: External and Internal Metrics

17-18
1/11/14
Saturday
KNIME Demo (K-Means and Fuzzy c-Means Clustering with Relative Index and External Index, Hierarchical Clustering), Distance Computation for Mixed Type Variables: Interval-Scaled, Symmetric and Asymmetric Binary, Categorical and Ordinal, Fuzzy c-Means

19
8/11/14
Saturday
Assignment 2 Presentations

20
14/11/14
Friday
Kohonen Self Organizing Map, Recap and Examples of Bagging, Boosting, Bootstrap Sampling, Hands-On Clustering

21-22
15/11/14
Saturday
Text Analytics, Part-of-Speech Tagging, Bag of Words, Term Frequency, Inverse Document Frequency, TF-IDF


22/11/14
Saturday
Midterm 2 Week

23-24
29/11/14
Saturday
Association Rule Mining, Apriori Algorithm, Frequent Itemsets and Rules Generation, Support, Confidence, Interest and Lift, Handling of Continuous and Categorical Data, min-Apriori, Multi-level Association Rules, KNIME Demo

25
19/12/14
Friday
Principal Component Analysis, Big Data Overview

26-27
20/12/14
Saturday
Assignment 3 Presentations