Lecture OLAP & Operation
Lecture OLAP & Operation
Lecture OLAP & Operation
• Data warehousing: Introduction to Data Warehouse, Statistical Observation on Data, Data Types, DBMS
Schemas for Decision Support, Data Mart, Data Extraction, Transformation and Load (ETL) Operations,
Metadata; Online Analytical Processing (OLAP), Online Transaction Processing (OLTP), ROLAP,
MOLAP, HOLAP and their Operations, Bitmap Indexing, Join Indexing, Attribute Selection Measure,
BUC Cubing Method, Data Cubing, Star Tree Construction, Inverted Index.
• Data Mining: Introduction Data Mining & Applications, Types of Data, Pre-Processing, KDD Process.
• Association Rule Mining (ARM): Interestingness of Patterns, Mining Frequent Patterns, K-Frequent
Item Set Mining, A-Priori Algorithm, Associations and Correlations Mining, Correlation Analysis,
Constraint Based Association Mining.
• Classification and Prediction: Basic Concepts, Entropy, Decision Tree, Naïve Bayes Algorithm, Neural
Networks, Back Propagation, Support Vector Machines, Associative Classification, Lazy Learners,
Prediction.
• Clustering: Basic Concepts, Cluster Analysis, K-Means, Partitioning Methods, Hierarchical Clustering,
Expectation Maximization, Density based Clustering, Web Mining, Text Mining, Spatial Mining.
• Case Study: Case Studies on Various Data Mining Techniques with Varying Data Sets.
• History of OLAP
• OLAP Cube
• OLAP Operations
• Benefits of OLAP
Introduction
extract Query/Reporting
transform
load serve
refresh
etc. e.g., ROLAP
Operational
DB’s Data Mining
serve
Data Marts
CS 336 14
OLTP vs. OLAP
OLAP Cube
• Takes the current aggregation level of fact values and does a further
aggregation on one or more of the dimensions.
• Equivalent to doing GROUP BY to this dimension by using attribute
hierarchy.
• SELECT [attribute list], SUM [attribute names] FROM [table list]
WHERE [condition list] GROUP BY [grouping list]
Example
After Roll up
Example of Roll up
Drill- down
• Rotates the data axis to view the data from different perspectives.
• Groups data with different dimensions
Pivot
• Sort
• Sort brings the cube back where the members of a dimension were sorted.
• Add Measure
• This OLAP operation one is able to add new measures to a cube.
• Drop Measure
• In contrast to Add Measure, it’s also possible to get rid of a measure from a
data cube if it's not necessary.
• Union
• Due to an opportunity of Union, you can unite a number of cubes which have
the same scheme but separate instances.
• Difference
• Difference eliminates the cells in a cube which are owned by another one.
These two cubes must possess the same scheme.
Union
Add and Drop Measure
Sort
Benefits