Copy of 2m Unit1
Copy of 2m Unit1
A Data Warehouse (DW) is a relational database that is designed for query and analysis rather than
transaction processing. It includes historical data derived from transaction data from single and multiple sources.
1. Business User: Business users require a data warehouse to view summarized data from the past. Since these
people are non-technical, the data may be presented to them in an elementary form.
2. Store historical data: Data Warehouse is required to store the time variable data from the past. This input is
made to be used for various purposes.
3. Make strategic decisions: Some strategies may be depending upon the data in the data warehouse. So, data
warehouse contributes to making strategic decisions.
4. For data consistency and quality: Bringing the data from different sources at a commonplace, the user can
effectively undertake to bring the uniformity and consistency in data.
5. High response time: Data warehouse has to be ready for somewhat unexpected loads and types of queries,
which demands a significant degree of flexibility and quick response time.
1. It is used for Online Transactional Processing 1. It is used for Online Analytical Processing (OLAP).
(OLTP) but can be used for other objectives such as This reads the historical information for the customers
Data Warehousing. This records the data from the for business decisions.
clients for history.
2. The tables and joins are complicated since they are 2. The tables and joins are accessible since they are de-
normalized for RDBMS. This is done to reduce normalized. This is done to minimize the response
redundant files and to save storage space. time for analytical queries.
4. Entity: Relational modeling procedures are used 4. Data: Modeling approach are used for the Data
for RDBMS database design. Warehouse design.
6. Performance is low for analysis queries. 6. High performance for analytical queries.
7. The database is the place where the data is taken 7. Data Warehouse is the place where the application
as a base and managed to get available fast and data is handled for analysis and reporting objectives.
efficient access.
Operational systems are designed to support high-volume transaction Data warehousing systems are typically
processing. designed to support high-volume analytical
processing (i.e., OLAP).
Operational systems are usually concerned with current data. Data warehousing systems are usually
concerned with historical data.
Data within operational systems are mainly updated regularly according Non-volatile, new data may be added
to need. regularly. Once Added rarely changed.
It is designed for real-time business dealing and processes. It is designed for analysis of business
measures by subject area, categories, and
attributes.
Data warehouses and their architectures very depending upon the elements of an organization's situation.
A set of data that defines and gives information about other data.Meta Data summarizes necessary information
about data, which can make finding and work with particular instances of data more accessible. For example, author,
data build, and data changed, and file size are examples of very basic document metadata.
Data warehouses use a staging area (A place where data is processed before entering the warehouse).
Data Warehouse Staging Area is a temporary location where a record from source systems is copied.
A staging area simplifies data cleansing and consolidation for operational method coming from multiple source
systems, especially for enterprise data warehouses where all relevant data of an enterprise is consolidated.
A data mart is a segment of a data warehouses that can provided information for reporting and analysis on a
section, unit, department or operation in the company, e.g., sales, payroll, production, etc.
14.Properties of Data Warehouse Architectures
1. Separation
2. Scalability
3. Extensibility
4. Security
5. Administerability
17. Define Meta data Repository (or) What is the role of meta data repository in data warehouse?
The Metadata Repository stores information that defines DW objects. It includes the following parameters and
information for the middle and the top-tier applications:
A Modern Data Warehouse is a cloud-based solution that gathers and stores that information. Organizations can
process this data to make intelligent decisions. That’s why various organizations use a Modern Data Warehouse to
improve their finances, human resources, and operations business processes. Ex: Quality cloud-based warehouse
departments
Data Acquisition
Data Engineering
Data Management Governance
Reporting and Business Intelligence
Data Science