ETL Testing
ETL Testing
Testing
Agenda
What is Operational Database/System ?
Introduction of Data warehouse
Data warehouse Architecture
Data Extraction, Data Transformation ,Data
Operational Database/System
An operational database, as the name implies, is the
Operational Database/System
Data Extraction:
Data from different source systems is converted into
one consolidated data warehouse format which is ready
for transformation processing.
Data Transformation:
In transforming the data, the following tasks may
involve.
Applying business rules (for example calculating new
measures and dimensions)
Cleaning (for example Mapping NULL to 0 or "Male"
to "M" and "Female" to "F" etc)
Filtering (for example selecting only certain columns to
load),
Splitting a column into multiple columns and vice versa
Data Loading:
Loading data into the data warehouse.
End users directly access data derived from
several source systems through the data
warehouse
OLAP (Online Analytical Processing) are being
used aggressively by organizations to discover
valuable business trends from data marts and
data warehouses.
Data Mining:
Data mining, the extraction of hidden predictive
information from large databases, is the process
of analyzing data from different perspectives
and summarizing it into useful information
Data mining tools predict future trends and
behaviors, allowing businesses to make
proactive, knowledge-driven decisions.
It allows users to analyze data from many
different dimensions or angles, categorize it,
and summarize the relationships identified.
Technically, data mining is the process of finding
correlations or patterns among dozens of fields
in large relational databases.
Subject Oriented
Operational
Data
Time
Warehouse
Customer
Products
Location
Sales
Subject Oriented
Data is stored subject
wise
Independent
Data Warehouse stores data
subject wise of
Application
Application oriented
Data is dependant on
the Application
Integrated Data:
All the relevant data from various applications
Integrated
Data
Warehouse
Operational
Sales
Departmental
Within a department
Finance
Procure
ment
Integrated
Data is integrated across
Enterprise
One version of truth
NonVolatile Data:
The data in the data warehouse is primarily for query and
time period.
The time-variant nature of the data in a data warehouse
Allows for analysis of the past
Related information to the present
Enables forecasts for the future
Non-volatile
insert
change
Data
Warehouse
Operational
delet
e
insert
load
change
read only
access
Time Variant
Operational
Data
Warehouse
Snapshot data
time horizon : 5-10 years
data warehouse stores historical
data
OLTP
OLAP
OLAP stands for On-Line Analytical Processing.
OLAP has been growing in popularity due to
OLAP TOOLS
A
Subject oriented,
Integrated,
Non-volatile,
Time-variant
data store containing
detailed and aggregate
corporate data Data stored for longer
It is a read only
database and data is
always inserted but
not modified
duration of time
An Example
Current
Current
/Recent
/Recent
Informatio
Informatio
nn
Historical
Historical
Informatio
Informatio
nn
OLTP
OLAP
OLAP
Source of
data
OLTPs are the original source OLAP data comes from the
of the data.(Operational data) various OLTP Databases
(Consolidation )
Purpose of
Data
Inserts and
Updates
Queries
Processing
Speed
Database
Design
DSS
Q&A
Thank You