Académique Documents
Professionnel Documents
Culture Documents
Database System Concepts - 6th Edition 20.2 ©Silberschatz, Korth and Sudarshan
Data Warehousing
Data sources often store only current data, not historical data
Corporate decision making requires a unified view of all organizational
data, including historical data
A data warehouse is a repository (archive) of information gathered
from multiple sources, stored under a unified schema, at a single site
Greatly simplifies querying, permits study of historical trends
Shifts decision support query load away from transaction
processing systems
Database System Concepts - 6th Edition 20.3 ©Silberschatz, Korth and Sudarshan
Data Warehouse vs. Operational DBMS
OLTP (on-line transaction processing)
Major task of traditional relational DBMS
Day-to-day operations: purchasing, inventory, banking, manufacturing,
payroll, registration, accounting, etc.
OLAP (on-line analytical processing)
Major task of data warehouse system
Data analysis and decision making
Distinct features (OLTP vs. OLAP):
Data contents: current, detailed vs. historical, consolidated
Database design: ER, Normalized design + application vs. star + subject
View: current, local vs. evolutionary, integrated
Access patterns: update vs. read-only but complex queries
all all
Monitor
& OLAP Server
Other Metadata
sources Integrator
Analysis
Operational Extract Query
DBs Transform Data Serve Reports
Load
Refresh
Warehouse Data mining
Data Marts
Office Day
Month
April 8, 2018 Data Mining: Concepts and 10
Database System Concepts - 6th Edition Techniques 20.10 ©Silberschatz, Korth and Sudarshan
A Sample Data Cube
Country
sum
Canada
Mexico
sum
all
0-D(apex) cuboid
time,location,supplier
3-D cuboids
time,item,location
time,item,supplier item,location,supplier
4-D(base) cuboid
time, item, location, supplier
April 8, 2018 Data Mining: Concepts and 12
Database System Concepts - 6th Edition Techniques 20.12 ©Silberschatz, Korth and Sudarshan
Conceptual Modeling of Data Warehouses
branch_key
branch location
location_key
location_key
branch_key
units_sold street
branch_name
city_key
branch_type
dollars_sold city
city_key
avg_sales city
state_or_province
Measures country
April 8, 2018 Data Mining: Concepts and 15
Database System Concepts - 6th Edition Techniques 20.15 ©Silberschatz, Korth and Sudarshan
Example of Fact Constellation
time
time_key item Shipping Fact Table
day item_key
day_of_the_week Sales Fact Table item_name time_key
month brand
quarter time_key type item_key
year supplier_type shipper_key
item_key
branch_key from_location
Database System Concepts - 6th Edition 20.20 ©Silberschatz, Korth and Sudarshan