Académique Documents
Professionnel Documents
Culture Documents
Day 1
BI concepts Data warehouse concepts Introduction to BIW Advantages of BIW over other data warehouse tools Concept of star schema architecture Introduction to Administrator workbench (All buttons in AWB)
Introduction
Success in a competitive business environment needs more than just
good information. Ability to derive meaningful, timely and readily accessible insights from the information is the need of the hour.
Insights into the business are the key to define effective strategy,
align business operations to the strategy and improve the efficiency and effectiveness of execution.
Business needs
The ability to take actions based on complete, timely, relevant
insights.
Introducing SAP BI
SAP BI s Approach
Today
SAP BI s Approach
2. Bring decision making to the business process 3. Align execution with strategy across organizations to achieve
corporate goals
2. Bring decision making to the business process 3. Align execution with strategy across organizations to achieve
corporate goals
Business benefits
Better-informed decisions with faster corrective actions. Better business performance as a result of strategy-guided actions. Faster innovation. Faster response to changing business conditions. Increased competitive advantage.
Business Intelligence
Defined as: Business Intelligence is a technology based on customer and profit oriented models that reduce operating costs and provide increased profitability by improving productivity, sales, and service and help to make decision-making capabilities at no time. Business Intelligence Models are based on multi dimensional analysis capabilities.
On-Line Analytical Processing (OLAP) and Data Mining - Analysis tools are
applied against the data warehouse to analyze and mine the data.
Key differentiators
Term Data Warehouse coined by Bill Inmon in 1990 Bill Inmon s definition
A warehouse is a Subject-oriented, Integrated, Timevariant and Non-volatile collection of data in support of managements decision making process
Page 28
Subject-Oriented
Data that gives information about a particular subject instead of about a
company's ongoing operations
Operational
Data Warehouse
Plan
Despatch
Customers
Products
Invoices
Orders
Regions
Time
Page 29
Integrated
Data that is gathered into the data warehouse from a variety of sources and
merged into a coherent whole.
Appl A - m,f Appl B - 1,0 Appl C - male,female Appl A - balance dec fixed (13,2) Appl B - balance pic 9(9)V99 Appl C - balance pic S9(7)V99 comp-3 Appl A - bal-on-hand Appl B - current-balance Appl C - cash-on-hand Appl A - date (julian) Appl B - date (yymmdd) Appl C - date (absolute)
2008 MindTree Consulting
date (julian)
Page 30
Time Variant
All data in the data warehouse is identified with a particular time period.
Operational
Data Warehouse
Current Value data time horizon : 60-90 days key may not have element of time
Snapshot data time horizon : 5-10 years key has an element of time data warehouse stores historical data
Page 31
Page 32
Decision oriented
Decision support Watching the Wheels turn Broad looking across...
View
Usage patterns
Time
Stable, predictable
Limited time frame
Variable, Unpredictable
Historical data
Data
Detailed only
2008 MindTree Consulting
Table structure
Scope of data Data
Page 34
OLTP Systems are used to run a business and are based on ER Model
The Data Warehouse helps to optimize the business and is based on OLAP (dimensional model)
Page 35
What is OLAP
Stands for OnLine Analytical Processing OLAP tools aid users in quick and easy multi dimensional analysis to
get insights into whats happening
What is OLAP
Slice and Dice Relational Model:
Record #001 #002 #003 #004 Product Film Lenses Cameras Film Region East West Central West Month Dec Jan Feb Mar Sales 240 250 690 425
Multidimensional Model:
Region
Sales
Ad Hoc View
Page 37
Originated in the mid seventies by A.C.Nielson Made popular by Ralph Kimball Dimensional Model divides the world into
Measurement : Sales, Cost, Stock, Yield
Page 38
Mode
Location
Property
Agent
Product Line
Booking
Business Unit
Product Group
Contract
Franchisee
Customer
Contact
Sales rep
Division
Site
Site key
Dimension
Fact
Bookings
Site Rate Plan Channel
Site QA Score
Lodge Score Rest Score
Time
Date key Week Month Quarter Year Weekend flag
Date
# of new bookings # of booking nights # of rooms for bookings # of guaranteed bookings
Rate Plan
Rate plan key Rate plan desc Rate plan type Brand
Hierarchy
2008 MindTree Consulting
Measure
Page 40
Dimensions - Definition
Contain descriptors of the business using which analysts view data
by.
Dimensions sets the context for asking questions about the facts in
the fact table.
SPEAKS BUSINESS LANGUAGE !!! Dimensions have multiple levels A combination of levels participate in a hierarchy
Dimension - Characteristics
The tables contains all the textual descriptors of the business. Dimensions supply the context in which a measurement was made They correspond to the entities by which you want to analyze the
business
Many columns
Fewer rows Are linked to a fact table through a foreign key reference to their
primary key
Page 42
Dimensions Examples
Franchisee Consumer Property Car Channel Channel-Travel Agent Site Rate plans Brand Business unit Entity Entity group
2008 MindTree Consulting Page 43
Fact - Definition
Each row in a fact table corresponds to a measurement. Fact tables have a few columns and lots of rows
Page 44
Are usually the largest tables Are usually appended to Can grow quickly
Page 45
Fact examples
Sum insured Amount Approved Claims ratio (derived fact) Premium Paid
Page 46
PRODUCT
All Products
DEPOT
All Depots
CUSTOMER
All Customers
PERIOD
All Periods
4 * 4 * 5 * 6 = 480 reports
Category Region Year
Region
Area
Quarter
Product
Depot
Customer
Time point
Page 47
FROM
FROM
Channel_Desc, year, month, 'TotSales' = sum(Total_Sale) Arrivals st, Channel_Dimension pd, Time_Dimension td Channel_Desc = Agent' month = 2 year in(1992,1994) st.Product_Key = pd.Product_Key st.Time_Key = td.Time_Key
Channel_Desc, year, month
Less intensive
Page 48
Example
Retail : what products does he buy ? HealthCare : Which area contribute to maximum Claims?
Stage 2: Analysis
Less focus on what happened ? More focus on why it happened ? Iterative refinement of questions ( Q&A Map ) support chain of
thought analysis and questions
Example
Why did expenses increase by 10% compared to last quarter?
2008 MindTree Consulting Page 49
Typical architecture of a DW
Page 51
Optimized for inserts and updates Very less redundancy of data by design
Page 52
Building Blocks Component 2 : ETL ETL stands for Extract Transform Load
The action of
Extracting information from one or more Source Transforming it mid stream Aggregation Business Rules Code normalization/cleansing Loading it into a central database
Page 53
Page 54
Page 55
Page 56
The glue that binds the data warehouse components An encyclopedia of the data warehouse
Page 57
Extractors
For distributing data to other applications
Page 58
To present data
Business Objects
Cognos Microstrategy OLAP services Express
To Store data
Hyperion
Brio SAS-EIS
Data cleansing
Page 59
Data Access
Data Warehouse
Data Sources
2008 MindTree Consulting
SAP BW Architecture
SAP BW Components
Info-objects DataSources
Star schema
The Star schema offers comprehensibility for software. The Star
schema is the most popular way of implementing a MultiDimensional Model in a relational database
Star schema
The key elements of a Star schema are:
Central fact table with dimension tables shooting off from it Fact tables typically store atomic and aggregate transaction information, such as
quantitative amounts of goods sold. They are called facts.
Facts are numeric values of a normally additive nature. Fact tables contain foreign keys to the most atomic dimension attribute of each
dimension table.
Foreign keys tie the fact table rows to specific rows in each of the associated
dimension tables.
The points of the star are dimension tables. Dimension tables store both attributes about the data stored in the fact table and
textual data.
Dimension tables are de-normalized. The most atomic dimension attributes in the dimensions define the granularity of
the information, i.e. the number of records in the fact table.
2008 MindTree Consulting
Textual descriptions of a characteristic are stored in a separate text table. The system runs in different languages at a time.
Dependent attributes of a characteristic can be stored in a separate table called the Master Data Table for the characteristic
2008 MindTree Consulting
pointer or translation tables called SID (Surrogate-ID) tables are used in the BW schema to link the solution-independent master tables of the BW schema to InfoCubes
2008 MindTree Consulting
Comparison
Slide 67
SID tables (pointer tables) provide the technical link to the Master Data
(attribute, text and hierarchy) tables that are outside the dimension of a star schema.
Dimension tables are built using the combination of numeric SID values of
each Characteristic in the Dimension.
Administrator Workbench
The Data Warehousing Workbench (DWB) is the central tool for
performing the tasks in the data warehousing process
Documents
Business Content Translation Metadata Repository
2008 MindTree Consulting
Modeling
Used to create and maintain (meta) objects relevant to the data staging process in SAP BW. Objects are displayed in a tree structure, in which the objects are ordered according to hierarchical criteria. To access the Modeling function area, choose transaction RSA1.
Administration
Transport Connection
Documents
BI Content
BI Content provides pre-configured information models based on metadata. It provides users in an enterprise with a selection of information they can use to fulfill their tasks. To access the BI Content function area, choose the transaction RSORBCT
Translation
In the Translation function area, you can translate short and long texts belonging to SAP BWobjects.
Metadata Repository
transaction RSOR.
Thank You