Bachelor of Computer Science & Engineering 3rd Year (V -VI Semester) Aktu Lecture Notes

 

NCS- 066 Data Mining And DataWareHousing


Unit I:
Data Warehousing: Overview, Definition, Data Warehousing Components, Building a Data Warehouse, Warehouse Database, Mapping the Data Warehouse to a Multiprocessor Architecture, Difference between Database System and Data Warehouse, Multi Dimensional Data Model, Data Cubes, Stars, Snow Flakes, Fact Constellations, Concept hierarchy, Process Architecture, 3 Tier Architecture, Data Marting.
 



Unit II:
Data Warehouse Process and Technology: Warehousing Strategy, Warehouse /management and Support Processes, Warehouse Planning and Implementation,Hardware and Operating Systems for Data Warehousing, Client/Server
Computing Model & Data Warehousing. Parallel Processors & Cluster Systems, Distributed DBMS implementations, Warehousing Software, Warehouse Schema Design, Data Extraction, Cleanup & Transformation Tools, Warehouse Metadata
 

Unit III:
Data Mining: Overview, Motivation, Definition & Functionalities, Data Processing, Form of Data  Preprocessing, Data Cleaning: Missing Values, Noisy Data,(Binning, Clustering, Regression, Computer and Human inspection),Inconsistent Data, Data Integration and Transformation. Data Reduction:-Data Cube Aggregation, Dimensionality reduction, Data Compression, Numerosity Reduction, Discretization and Concept hierarchy generation, Decision Tree.


Unit IV: Classification: Definition, Data Generalization, Analytical Characterization, Analysis of attribute relevance, Mining Class comparisons, Statistical measures in large Databases, Statistical-Based Algorithms, Distance-Based Algorithms, Decision Tree-Based Algorithms. Clustering: Introduction, Similarity and Distance Measures, Hierarchical
and Partitional Algorithms. Hierarchical Clustering- CURE and Chameleon. Density Based Methods-DBSCAN,OPTICS. Grid Based Methods- STING, CLIQUE. Model Based Method –Statistical Approach, Association rules: Introduction, Large Itemsets, Basic Algorithms, Parallel and Distributed Algorithms, Neural Network approach. 


Unit V :
Data Visualization and Overall Perspective: Aggregation, Historical information, Query Facility, OLAP function and Tools. OLAP Servers, ROLAP,MOLAP, HOLAP, Data Mining interface, Security, Backup and Recovery, Tuning Data Warehouse, Testing Data Warehouse. Warehousing applications and Recent Trends: Types of Warehousing Applications, Web Mining, Spatial Mining and Temporal Mining.
 



Textbooks:
1. Alex Berson, Stephen J. Smith “Data Warehousing, Data-Mining & OLAP”, TMH
2. Mark Humphries, Michael W. Hawkins, Michelle C. Dy, “ Data Warehousing:Architecture and Implementation”, Pearson
3. Margaret H. Dunham, S. Sridhar,”Data Mining:Introductory and Advanced Topics” Pearson Education
4. Arun K. Pujari, “Data Mining Techniques” Universities Press
5. Pieter Adriaans, Dolf Zantinge, “Data-Mining”, Pearson Education 

DMDW PREVIOUS Year Question Papers(LAST 10 Years) [Preview ##eye##]  [DOWNLOAD ##download##]


 


Unit II: [Preview ##eye##] Click Here to Download







Unit III: [Preview ##eye##]Click Here to Download



loading...