Exam - 1: October 5, 2016 Exam - 2: November 23, 2016 Quiz - 2: October 26, 2016 Quiz - 3: November 9, 2016
Exam - 1: October 5, 2016 Exam - 2: November 23, 2016 Quiz - 2: October 26, 2016 Quiz - 3: November 9, 2016
Exam - 1: October 5, 2016 Exam - 2: November 23, 2016 Quiz - 2: October 26, 2016 Quiz - 3: November 9, 2016
Exam 1:
Exam 2:
Quiz 2:
Quiz 3:
FINAL PROJECT -
Data Maintenance
Prepare data Maintenance schedule
ROI Measures
Determine methods of estimating ROI
Generate new standards
Modify the existing processes
Establish methods to identify and fix problems
Establish new performance targets
Compare your goals with the industry (important)
Type 1: Overwrite. A value is replaced. Eg. Salesperson Dimension the Department is changed.
In that case we simply overwrite the old value with the new one.
DR76321
DR76321
John Smith
John Smith
Electonics
Houseware
DR76321
John Smith
Houseware Electonics
PatientAge
#Visits
TotalBilling
1
2
3
4
5
6
19-25
19-25
19-25
19-25
19-25
19-25
Low
Low
Low
Med
Med
Med
< 20,000
< 30,000
< 50,000
< 60,000
< 70,000
< 80,000
Fact Table
MedicalRecKey PatientKey
DateKey
ProviderKey .. Facts..
3421
3421
245
246
780
780
3
4
PatientAge
#Visits
TotalBilling
MedicalCondition
1
2
3
4
5
6
19-25
19-25
19-25
19-25
19-25
19-25
Low
Low
Low
Med
Med
Med
< 20,000
< 30,000
< 50,000
< 60,000
< 70,000
< 80,000
Stable
Stable
MedicalRecKey PatientKey
DateKey
ProviderKey .. Facts..
3421
3421
245
246
780
780
Fact Table
3
4
Patient Dimension
MedicalRecKey CurrentPatientKey
3421
SPName
HistDept
CurrentRowFlag
1001
1032
DR76321
DR76321
John Smith
John Smith
Electonics
Houseware
DR76321
DR76321
John Smith
John Smith
Houseware
Houseware
Transform
Type Conversion (ASCII to Binary; SQL Date to Normal Date; Binary to ASCII; Number to Code (0
to F and 1 to M))
Data Separation (Full Name to First Name and Last Name; Date to Year, Month and Day; CSZ to
City, State and Zip)
Standardization (ZIP+4 to ZIP; SSN with Dashes; P.O. Box to PO BOX; Phone# as (xxx) xxx-xxxx)
Functions (Date to Qtr [assuming no Date Dimension]; All Upper or All Lower; Date to Fiscal
Date; Age from Date of Birth)
Derived Measures / Dimensions (% Total; Rank; Quartile; Decile; New Flags etc.)
Format Conversions (YYYYMODA to Mo/Da/Yr)
Scrubbing (Removal of Jr.,Sr.,Dr.,MD etc.; part of standardization; Business Rules)
Add New Fields ($DATE$, $ABSREC$)
Layout (Horizontal or Verticle)
Missing Values (Unknown; Undefined; N/A etc.)
Sign Association (Charge and Quantity)
Chronology of Data (Admit and Discharge Dates; Birth Date and Service Dates; Service Date and
Current Date; Start and End Dates)
Value Association (Line Charge and Unit Charge)
One Column Value dictates the destination (TXCODE: PAY or ADJ; WAGETYPE:REG or OVER or
SICK or HOLIDAY etc)
Key Value Normalization / Standardization (BILLNGID and ACCOUNT_NO, eg. MMS-PATNO)
DASHBOARD_FACT_TABLE.xlsx
Loading
DIMENSIONAL MODELLING
How do we know that whatever dimensions we are considering is a complete set?
Degrees of freedom.
Impact of the external factors.
DIMENSIONS are the CONSTRAINTS put on the system behavior.
The objective is to understand each constraint and their interaction with each other.
DIMENSIONS
MEASURES
Sale (i.e. Selling Price, Discounts) Lost leaders (Product A vs Product B), Dispersion
Advertising / Promotion / Coupons
Loyalty (Frequent Flyer)
Financing (Credit) Lease payments
Repackaging eg. BJ
file:///C:/Apl/PINPT_PREMIER/DIMN_AGGR.HTM
Reasons why the host data in the current format is unusable for the query
Integration of different data elements may be required for a single analysis
End-User may need to know the internal data schema of how the data is stored
Data may need to be remapped
Security issues
Optimized for a single record access. i.e. for Operational efficiency. Quickly records a
transaction
It is highly normalized
The measures needed for the strategic analysis are time consuming to be computed on
demand
It is difficult for a user to differentiate systems internal use data from the rest
The historical data may not be available for the trends analysis
Issues: