Académique Documents
Professionnel Documents
Culture Documents
Chapter 1
Data,Information,Knowledge,Decisi
on
Analysis
Report
Chapter2
Normalization
OLTP Systems
Characteristics of OLTP
Chapter 3
Data Warehouse
Advantages of DataWarehouse
Goals of Data Warehouse
DWH-Training Material
Chapter 4
Characteristics of Data Warehouse
Difference between OLTP/DW
OLAP
Data Warehouse/Data Mart
Data Warehouse Strategies
Chapter 5
Dimension Modeling
Star Schema
Snow Flake Schema
Dimension Table
Conformed Dimension
Degenerated Dimension
Chapter 6
Fact Table
Types of Fact
Metadata Management
DWH-Training Material
Chapter 7
Grain Level
Surrogate Key
Time Dimension
Staging Area
Slowly Changing Dimensions
Chapter 8
Project Overview
Phases of Project
DWH-Training Material
Data
Raw Observations
No Meaning
Informatio
n
Meaning by
Knowledge
-Appropriate collection
of information
-Intent is to be useful
and to change the busin
process
Relational
Connection
DWH-Training Material
What is Knowledge?
Data
Information Knowledge
Raw Facts
Data in context
Numbers
Readily Captured
Strategic Value
Acti
on
Information+Experience
Knowledge
applied
DWH-Training Material
to decision making
Analysis
DWH-Training Material
Report:
Collection of Data
Purpose:
Analysis- Comparitive Study of
Data, Historical Data
Final:
Improve Decision
DWH-Training Material
Chapter 2
DWH-Training Material
Normalization
DWH-Training Material
10
DWH-Training Material
11
Information System/OLTP
Systems
DWH-Training Material
12
Characteristics-OLTP
Characteristics
OLTP
Operation
Insert/Update
Analytical Requirements
Low
Small
Data Level
Detailed
Orientation
Records
DWH-Training Material
13
Business Intelligence
DWH-Training Material
14
Chapter 3
DWH-Training Material
15
Data Warehouse
Data warehousing is the entire
process of data extraction,
transformation and loading of data to
the warehouse and the access of the
data by end users and applications.
DWH-Training Material
16
17
Advantages through DW
DWH-Training Material
18
DWH-Training Material
19
Chapter 4
DWH-Training Material
20
Data Warehouse
Characteristics
Subject- Oriented
Integrated
Non-Volatile
Time-Variant
DWH-Training Material
21
DWH-Training Material
22
OLAP(DW)
Access
Read/Write
Unit of Work
Short, Simple
Transaction
Query
# Users
Thousands
Hundreds
DB Size
100 MB-GB
100 GB - Terabytes
Function
Date of Date
Operations
Decision Support
DB Design
Application Oriented
Subject Oriented
Data
Current, Up to date
detailed
Historical,
Summarized
DWH-Training Material
23
OLAP
OLAP operations
Roll-up
Drill-down
Slice and dice
Pivot (rotate)
DWH-Training Material
24
DWH-Training Material
25
Data Warehouse
Data Mart
Scope
Enterprise
Department
Subjects
Multiple
Single
Data Source
Many
Few
Implementation time
Months to Years
Months
DWH-Training Material
26
Data Warehousing
Strategies
DWH-Training Material
27
Data Warehouse
Marketing
Operation
SalesData Marts
al
Finance
Sales
Systems
Finance
DWH-Training Material
28
Bottom Up Approach
Legacy
Data
Data Marts
Warehouse
Data
Marketin
g
Operations
Data
Sales
External
data
sources
Finance
DWH-Training Material
Marketing
Sales
Finance
29
DWH-Training Material
30
Data Warehouse
Architecture
DWH-Training Material
31
Dimensional Modeling
DWH-Training Material
32
DWH-Training Material
33
Dimension Table
Characteristics
Dimension tables have the following characteristics:
Contain textual information that represents the attributes of
the business
Contain relatively static data
Are joined to a fact through foreign key reference
They are hierarchical in nature and provide the ability to
view data at varying levels of details.
DWH-Training Material
34
DWH-Training Material
35
DWH-Training Material
36
Star Schema
DWH-Training Material
37
Snowflake Schema
DWH-Training Material
38
Conformed Dimensions
DWH-Training Material
39
Degenerated Dimension
DWH-Training Material
40
Fact Tables
Types of Measures
Additive facts
Non-additive facts
Semi-additive facts
DWH-Training Material
41
Fact Tables
Additive Facts
Additive facts are facts that can be summed up through all of
the dimensions in the fact table.
Example :Dollar value is additive fact. If we want to find out the
amount for a particular place for a particular period of time, we
can add the dollar amounts and come up with total amount.
DWH-Training Material
42
DWH-Training Material
43
Semi-additive facts
Semi-additive facts are facts that can be summed up for
some of the dimensions in the fact table, but not the others.
DWH-Training Material
44
Teacher
Dimension
Teacher_PK
Course_Dimension
Course_PK
Teacher_FK
Course_FK
Student_FK
Location_FK
DWH-Training Material
Student_Dimensio
n
Student_PK
45
Metadata
DWH-Training Material
46
Chapter 7
DWH-Training Material
47
Grain Level
Level at which the data has to be captured in the
Fact table
Example
Each Sales Transaction
Insurance claim Transaction
Monthly Account
DWH-Training Material
48
Surrogate Keys
DWH-Training Material
49
Data Staging
Source
Staging
DWH-Training Material
Target
50
Slowly Changing
Dimensions(SCD)
Slowly changing dimension change gradually and
occasionally over time.
Example: Employee change their address, name,
marital status
DWH-Training Material
51
SCD
Approach
Results
Type1
Only
current
Losing the
ability to track
the old history
Type2
Creating an additional
dimension record(with a
time stamp)at the time of
the change with the new
attribute values
History+
Current
Segmenting
history very
accurately
between the old
description and
the new
description
Type3
DWH-Training Material
Describe both
historical and
current view
52
Project Manager
Business Analyst
Architect
ETL Lead
SourceSystem Study
Data Modeler
OLAPLead
ETL Devs/Cons
OLAP Devs/Cons
DBA
Test Lead
Tester
DWH-Training Material
53
Phase1 - Define
Phase2- Analysis
Phase3 - Design
Phase4-Build
Phase5-Test
Phase6-Production
Phases of Project
DWH-Training Material
54
DWH-Training Material
55
DWH-Training Material
56
DWH-Training Material
57
DWH-Training Material
58
DWH-Training Material
59
Transition to Production
Phase
DWH-Training Material
60