MC7403 DATA WAREHOUSING AND DATA MINING NOTES ANNA UNIVERSITY

MC7403 DATA WAREHOUSING AND DATA MINING NOTES ANNA UNIVERSITY

UNIT I DATAWAREHOUSE

  • Data Warehousing
  • Operational Database Systems vs. Data Warehouses
  • Multidimensional Data Model
  • Schemas for Multidimensional Databases
  • OLAP Operations
  • Data Warehouse Architecture
  • Indexing
  • OLAP queries & Tools

UNIT II DATAMINING & DATA PREPROCESSING

  • Introduction to KDD process
  • Knowledge Discovery from Databases
  • Need for Data Preprocessing
  • Data Cleaning
  • Data Integration and Transformation
  • Data Reduction
  • Data Discretization and
  • Concept Hierarchy Generation.

UNIT III ASSOCIATION RULE MINING

  • Introduction
  • Data Mining Functionalities
  • Association Rule Mining
  • Mining Frequent Itemsets with and without Candidate Generation
  • Mining Various Kinds of Association Rules
  • Constraint-Based Association Mining.

UNIT IV CLASSIFICATION & PREDICTION

  • Classification vs. Prediction
  • Data preparation for Classification and Prediction
  • Classification by Decision Tree Introduction
  • Bayesian Classification
  • Rule Based Classification
  • Classification by Back Propagation
  • Support Vector Machines
  • Associative Classification
  • Lazy Learners
  • Other Classification Methods
  • Prediction
  • Accuracy and Error Measures
  • Evaluating the Accuracy of a Classifier or Predictor
  • Ensemble Methods
  • Model Section.

UNIT V CLUSTERING

  • Cluster Analysis:
  • Types of Data in Cluster Analysis
  • A Categorization of Major Clustering Methods
  • Partitioning Methods
  • Hierarchical methods
  • Density-Based Methods
  • Grid-Based Methods
  • Model-Based Clustering Methods
  • Clustering High Dimensional Data
  • Constraint-Based Cluster Analysis
  • Outlier Analysis.

Download Data Mining Notes

Unit 1
Unit 2
Unit 3
Unit 4
Unit 5
Two Mark Questions with Answers
Extras : Unit 1
Extras : Unit 2
Previous Year Question Paper

From Unit 1 to Unit 5 Source : vidyarthiplus.com