Académique Documents
Professionnel Documents
Culture Documents
com 1
September 10, 2002
Copyrights 2002 ERP Data Mining & Knowledge Discovery webcast searchsap.com Sept 10, 2002
2
About the Speaker
• Founder and CTO of Information Frameworks, an author, speaker and world-
renowned expert on emerging Information Architectures, Integration and
Business Intelligence Technologies.
Copyrights 2002 ERP Data Mining & Knowledge Discovery webcast searchsap.com Sept 10, 2002
3
Agenda
Copyrights 2002 ERP Data Mining & Knowledge Discovery webcast searchsap.com Sept 10, 2002
4
Agenda
Copyrights 2002 ERP Data Mining & Knowledge Discovery webcast searchsap.com Sept 10, 2002
5
What is Data Mining and Knowledge Discovery ?
Copyrights 2002 ERP Data Mining & Knowledge Discovery webcast searchsap.com Sept 10, 2002
6
Data Mining and Statistics ?
Copyrights 2002 ERP Data Mining & Knowledge Discovery webcast searchsap.com Sept 10, 2002
7
Data Mining - Present State
Application Domains
Business 317 73%
Life Sciences 85 20%
Other 31 7%
Source: http://www.kdnuggets.com/polls/
Copyrights 2002 ERP Data Mining & Knowledge Discovery webcast searchsap.com Sept 10, 2002
8
Data Mining Methodologies
CRISP-DM
http://www.crisp-dm.org/
Source: http://www.kdnuggets.com/polls/
Source: http://www.crisp-dm.org/
1. Business Understanding
2. Data Understanding
3. Data Preparation
4. Modeling
5. Evaluation
6. Deployment
Copyrights 2002 ERP Data Mining & Knowledge Discovery webcast searchsap.com Sept 10, 2002
10
Data Mining - Tools and Data Formats
Business 317 73%
Domains Life Sciences 85 20%
Other 31 7%
Source: http://www.kdnuggets.com/polls/
Copyrights 2002 ERP Data Mining & Knowledge Discovery webcast searchsap.com Sept 10, 2002
11
Data Mining Technology
Visualization
Use human pattern recognition capabilities
Statistics
T Applying statistical techniques to predict
E Decision Trees U
C Building scripts based on historic data
Discover
S
H Association Rules (Rule Induction) A Understand
N Reasoning from specific facts to reach a hypothesis G
I Clustering E Predict
Q Refers to finding and visualizing groups of facts that were
U not previously known
E Neural Networks
S Learning how to solve problems based on examples
K-Nearest Neighbor
Classification by looking at similar data
Genetic Algorithms
Survival of the fittest …
Copyrights 2002 ERP Data Mining & Knowledge Discovery webcast searchsap.com Sept 10, 2002
12
Data Mining Models
Copyrights 2002 ERP Data Mining & Knowledge Discovery webcast searchsap.com Sept 10, 2002
13
Traditional DM vendors
• SPSS Clementine
• SAS Enterprise Miner
• IBM Intelligent Miner
• Salford CART/MARTS
• …more
Copyrights 2002 ERP Data Mining & Knowledge Discovery webcast searchsap.com Sept 10, 2002
14
Database Vendors – DM within the Products
Copyrights 2002 ERP Data Mining & Knowledge Discovery webcast searchsap.com Sept 10, 2002
15
Data Mining Standards
Copyrights 2002 ERP Data Mining & Knowledge Discovery webcast searchsap.com Sept 10, 2002
16
Agenda
Copyrights 2002 ERP Data Mining & Knowledge Discovery webcast searchsap.com Sept 10, 2002
17
Enterprise Applications Landscape
• ERP Solutions
– Oracle
– PeopleSoft
– SAP
• ERP vendors have extended
scope of their applications far
beyond tradition ERP functions
to a wide array of business • Oracle Business
solutions such as: Intelligence Solution
Customer Relationships
Management
• Peoplesoft Enterprise
Business Intelligence Performance Management
Enterprise Portals • SAP Business
• Siebel Information Warehouse
Copyrights 2002 ERP Data Mining & Knowledge Discovery webcast searchsap.com Sept 10, 2002
18
Oracle Business Intelligence Solution
Business Processes (Pre-Built Portlets)
• Response to Lead (27)
• Lead to Quote (56)
• Quote to Order (15)
• Order to Cash (34)
• Demand to Build (40)
• Procure to Pay (28)
• Revenue to Compensation (29)
• Expiration to Renewal (33)
• Issue to Resolution (51)
• HR Family (43)
Oracle 9i DM Integration
• Oracle Marketing Online for
Campaign Management
• Oracle9iAS Personalization
• iStore
• more to come… Source: Oracle
Oracle 9i Oracle9iDS Warehouse Builder Oracle9iAS Discoverer
Oracle9iDS Reports Oracle9iAS Portal
Business
Oracle9iAS Clickstream Intelligence Oracle9iAS Personalization
Intelligence Oracle9i Data Mining Oracle9iDS Business Intelligence Beans
Copyrights 2002 ERP Data Mining & Knowledge Discovery webcast searchsap.com Sept 10, 2002
19
PeoplSoft Business Intelligence Solution
Enterprise Performance Management (EPM)
Customer Profitability
Finance
Workforce Analytics
Supply Chain Management Process
Workforce Rewards
Enrollment Management
Retail Merchandise
)
Project Analysis .com
a dva n
biz
ww.e
Student Administration c. ( w
ta ge In
an
Balanced Scorecard Ad v
ess
Busin
r tes
y: e CRM Prospect Analysis
Employee Scorecard Cou
Data mining CRM Marketing Analysis
Customer Scorecard
Vendor Scorecard
Capabilities CRM Sales Effectiveness
CRM Service Effectiveness
No word on PeopleSoft Data Mining tools/technologies for predictive analytics - home grown, acquired or 3rd Party Products.
No response from PeopleSoft contacts
Copyrights 2002 ERP Data Mining & Knowledge Discovery webcast searchsap.com Sept 10, 2002
20
SAP Business Intelligence Solution
Business Information Warehouse
SAP CRM SAP Markets, Procurement
Campaign management Bidding, pattern-based offering
Opportunity analytics Activity reproting, service
Customer behavior modeling analytics
Source: SAP
Copyrights 2002 ERP Data Mining & Knowledge Discovery webcast searchsap.com Sept 10, 2002
21
CRM Venders – Data Mining Integration
• Oracle CRM
– Pre 9i Darwin
– Post 9i ODM
• RightPoint and E.piphany
• SPSS and Siebel
• SAP CRM
– Native Data Mining built in SAP BW - Database Independent
– Interface to IBM Intelligent Miner Interface with SAP BW
• PeopleSoft CRM
– No official data mining product or vendor solution
– Waiting for their response on what they have?
Copyrights 2002 ERP Data Mining & Knowledge Discovery webcast searchsap.com Sept 10, 2002
22
Agenda
Copyrights 2002 ERP Data Mining & Knowledge Discovery webcast searchsap.com Sept 10, 2002
23
SAP BW 3.0b Data Mining Implementation
Copyrights 2002 ERP Data Mining & Knowledge Discovery webcast searchsap.com Sept 10, 2002
24
Modeling a Decision Tree
7
Specifying the
values in case the
original values in
the column are to
be treated
differently
4 Indicating the
3 prediction
The nature of the column content column
5
Indicating the key column
Source: SAP
Copyrights 2002 ERP Data Mining & Knowledge Discovery webcast searchsap.com Sept 10, 2002
25
Modeling a Decision Tree
Specify Model Parameters
Size of the window (such as 10%)
The number of repeats
Use portion (%) of 1 2
the data for training 3 with different samples
or the whole data set
for training
Source: SAP
Copyrights 2002 ERP Data Mining & Knowledge Discovery webcast searchsap.com Sept 10, 2002
26
Modeling a Decision Tree
Model
columns
2 3
Selected source columns
Source: SAP
Copyrights 2002 ERP Data Mining & Knowledge Discovery webcast searchsap.com Sept 10, 2002
27
SAP BW Data Mining – Process Steps
Source: SAP
Copyrights 2002 ERP Data Mining & Knowledge Discovery webcast searchsap.com Sept 10, 2002
28
Viewing Decision Tree Training Results Out of a total of
705 cases, 41
Chances of a customer leaving is This decision tree predicts cases are covered
70.7% if the profession is whether the customer has under this node
“LABOURER”
1 left or is still “on board
2 4
3
Chart shows the
distribution at the
selected node 6
5 13/41 customers
28/41 customers are likely to stay
are likely to leave
Source: SAP
Copyrights 2002 ERP Data Mining & Knowledge Discovery webcast searchsap.com Sept 10, 2002
29
Data Mining – Decision Trees
• Create a Association
model
• Define Model Columns
• Train the model
• Predictions using
Training results
• Using the data mining
results against BW Query
Source: SAP
Copyrights 2002 ERP Data Mining & Knowledge Discovery webcast searchsap.com Sept 10, 2002
31
Data Mining – Association
Source: SAP
Copyrights 2002 ERP Data Mining & Knowledge Discovery webcast searchsap.com Sept 10, 2002
32
Data Mining – Cluster Analysis
Source: SAP
Copyrights 2002 ERP Data Mining & Knowledge Discovery webcast searchsap.com Sept 10, 2002
33
Viewing Cluster Analysis Results
Source: SAP
Copyrights 2002 ERP Data Mining & Knowledge Discovery webcast searchsap.com Sept 10, 2002
34
Viewing Cluster Analysis results
Uploaded in BW
Then BEX for further Analysis Source: SAP
Copyrights 2002 ERP Data Mining & Knowledge Discovery webcast searchsap.com Sept 10, 2002
35
SAP Data Mining
Copyrights 2002 ERP Data Mining & Knowledge Discovery webcast searchsap.com Sept 10, 2002
36
SAP BW - IBM Intelligent Miner
Copyrights 2002 ERP Data Mining & Knowledge Discovery webcast searchsap.com Sept 10, 2002
37
Agenda
Copyrights 2002 ERP Data Mining & Knowledge Discovery webcast searchsap.com Sept 10, 2002
38
ERPs and Data Mining: Good and the Bad News
• Good News
– Known Business Processes
– Few data Sources
– Improved Data Quality
– Metadata Integration
CRISP-DM
– Near real-time data mining
1. Business Understanding
– Closed-loop Knowledge Discovery 2. Data Understanding
– Consistent Infrastructure 3. Data Preparation
• Bad News 4. Modeling
5. Evaluation
– Complex Data Structures
6. Deployment
– Performance
– Availability
– Very few Data Mining algorithms - Today
Copyrights 2002 ERP Data Mining & Knowledge Discovery webcast searchsap.com Sept 10, 2002
39
Data Mining Process and ERP Data Mining
Business Data
Understanding Understanding
Will
Will reduce
reduce data
data mining
mining
project time up to
Data 50%
50%
Preparation
Deployment Business Understanding
Data Understanding
Data Preparation
Modeling
Evaluation
Deployment
Source: http://www.crisp-dm.org/
Copyrights 2002 ERP Data Mining & Knowledge Discovery webcast searchsap.com Sept 10, 2002
40
Agenda
Copyrights 2002 ERP Data Mining & Knowledge Discovery webcast searchsap.com Sept 10, 2002
41
INFORMATION FRAMEWORKS
Executive and Senior IT Management
Consulting
Seminars
Webinars Enterprise Information Architectures (EIA)
Keynotes KNOWLEDGE Business Case Development
Panel Moderator
Publications TRANSFER Information Architecture Application
Hands-on training Deployment Architectures implementation
Conferences
Legacy Application Migration Strategies
ERP Application deployment strategies
Technology/Solution
SOFTWARE Tool/technology/Vendor assessment and
selection
Assessment
Product Strategy
AND Data Warehouse, Data Marts, Analytics,
Information Delivery
Solution Strategy
Product Positioning SOLUTION Deployment Architectures
Competitive Analysis
Software product architecture VENDORS Business Intelligence and eBusiness
Marketing Strategy Integration architectures
Product Performance and Portals Strategies, Business case,
Benchmarking Consulting Assessment, Architectures, Modeling,
Hardware Configuration Planning and knowledge Transfer
http://infoframeworks.com
Copyrights 2002 ERP Data Mining & Knowledge Discovery webcast searchsap.com Sept 10, 2002
42
Questions
Naeem Hashmi
Chief Technology Officer
September 10, 2002
Email: nhashmi@infoframeworks.com
Web Site: http://infoframeworks.com
Tel: 603-432-4550
Copyrights 2002 ERP Data Mining & Knowledge Discovery webcast searchsap.com Sept 10, 2002