Datawarehousing Interview Questions
Datawarehousing Interview Questions
com/interview_questions
For Complete Study Material For Interview Preparations Click On Below Link
https://www.instamojo.com/interview_questions
What is the main difference between Inmon and Kimball philosophies of Data
Warehousing?
Both differ in the concept of building the Data Warehouse.
• Kimball views Data Warehousing as a constituency of data marts. Data marts are focused on
delivering business objectives for departments in an organization, and the Data Warehouse
is a conformed dimension of the data marts. Hence, a unified view of the enterprise can be
obtained from the dimension modelling on a local departmental level.
• Inmon explains in creating a Data Warehouse on a subject-by-subject area basis. Hence, the
development of the Data Warehouse can start with data from the online store. Other
subject areas can be added to the Data Warehouse as their needs arise. Point-of-sale (POS)
data can be added later if management decides that it is necessary.
Data warehouse stores complex and Transactional system stores the updated
general form of the data. daily transactions, workloads etc.
What is the very basic difference between data warehouse and operational databases?
A data warehouse contains historical information that is made available for analysis of the
business whereas an operational database contains current information that is required to
run the business.
What is ODS?
ODS stands for Operational Data Store. it is essentially a repository of real-time operational
data.
Data is from original data source Data is from various data sources
What Is Schema?
A schema is collection of database objects of a User.
Explain what is a dimension of data warehousing? What are the primary functions of the
dimensions?
A dimension can be defined as classification where it categorizes the measures and facts in
an orderly fashion. Using these facts and measures, it will help the users to define and
provide necessary answers for the business operations.
For example:
The common dimensions that are used are:
1. People
2. Products
3. Place
4. Time, etc.
The primary functions of the dimensions are as follows:
1. Filtering
2. Grouping
3. Labelling
Usually, these factors are all utilized in the concept of slicing and dicing the data. Out of
which slicing refers filtering the data and dicing the data refers grouping the data.
In the image i have explained which are fact and which are dimension tables. You will able
to see there are four dimensions :
1.Time
2.Location
3.Item
4.Branch
1.Additive:
Measures that can be added across any dimension
2.Non-additive:
Measures that cannot be added across any dimension
3.Semi-additive:
Measures that can be added across some dimensions.
Real Example:
Following is a fact table which contains all the primary keys of dimensions table and added
measures for ITEM,i.e. Product sold.
ITEM KEY Time key Product key Date key Product Sold
The fact table contains the foreign keys, time dimensions, product dimension, customer
dimension, measurement values. Following are some examples of common facts :
No of unit sold, Margin, Sales revenue and the dimension tables are customer, time and
product etc.. which is used to analyse data.
Granularity in table represents the level of information stored in the table. In BI granularity
is very important concept to check the table data. The granularity is high and low. High
granularity data contains the data with high information, or you can say it as transaction
level data is high granularity data. Low granularity means data has low level information
only. Fact table always have low granularity mean we need very low-level data in fact table.
Following 2 points are important in defining granularity:
1.Determining the dimensions that are to be included
2.Determining location to place hierarchy of each dimension of information.
Real life Example:
Date Dimension Granularity level :
Year, month, quarter, period, week, day
What is normalization?
The term normalization is also considered as “Database Normalization”.
This is a process of rearranging or organizing the columns and the tables that are associated
in a relational database. By doing this activity, it reduces the data redundancy and also helps
in improving the data integrity.
Further, this process also helps in simplifying the database design so that the optimal
structure is enabled. In short, normalization helps the data to split into additional tables to
incorporate the data and at the same time makes it easy while retrieving the data.
Out of star schema and snowflake schema, whose dimension table is normalized?
Snowflake schema uses the concept of normalization.
Define metadata?
Metadata is simply defined as data about data. In other words, we can say that metadata is
the summarized data that leads us to the detailed data.
A load manager performs the operations required to extract and load the process. The size
and complexity of load manager varies between specific solutions from data warehouse to
data warehouse.
What Is Table?
A table is the basic unit of data storage in an ORACLE database. The tables of a database
hold all of the user accessible data. Table data is stored in rows and columns.
What Is A View?
A view is a virtual table. Every view has a Query attached to it. (The Query is a SELECT
statement that identifies the columns and rows of the table(s) the view uses.)
What Is An Extent?
An Extent is a specific number of contiguous data blocks, obtained in a single allocation, and
used to store a specific type of information.
What Is An Index?
An Index is an optional structure associated with a table to have direct access to rows, which
can be created to increase the performance of data retrieval. Index can be created on one
or more columns of a table.
What Is An Integrity Constrains?
An integrity constraint is a declarative way to define a business rule for a column of a table.
What is SCD?
SCD is defined as slowly changing dimensions, and it applies to the cases where record
changes over time.
For Complete Study Material For Interview Preparations Click On Below Link
https://www.instamojo.com/interview_questions
https://www.instamojo.com/interview_questions/sql-interview-questions-asked-in-top-it-comp