Académique Documents
Professionnel Documents
Culture Documents
Business Intelligence
1st Lecture
Iraklis Varlamis
1
About the course
•Office: 5.1
•Email:varlamis@hua.gr
•Eclass
• http://eclass.hua.gr/courses/DIT161/
2
Reading
3
Course Outline
4
Course Outline
Case studies
•5th: Introduction to Graph/Network Mining
•6th: Measuring networks and random graph model.
• 7th: A graph processing library
• 8th: Social Recommender systems
...
• 12th: Presentation of assignments
5
Grading system
• What is graded:
6
Definitions
7
Definition and concepts
8
9
Multidimensional Data Analysis
10
Data mining
11
Decision support systems
12
Digital Dashboards
• Dashboards:
– Provide rapid access to timely information.
– Provide direct access to management reports.
– Are very user-friendly and supported by graphics.
13
The management cockpit
14
From Data to Knowledge
15
Evolution of sciences
16
The Data Gap
17
Why do we need data analysts
18
What else is data mining
19
Business intelligence (ΒΙ)
Source: http://decision-quality.com/ 20
Stages of BI creation
Data Exploration
Statistical Summary, Querying, and Reporting
22
What is Business Intelligence?
http://www.microstrategy.com/Solutions/5Styles/
23
What is Business Intelligence?
24
What is Business Intelligence?
25
What is Business Intelligence?
26
What is Business Intelligence?
27
Market interest
• Companies that
develop BI software
•http://apandre.wordpress.com/market/ 28
Scientific interest – Big Data
• Volume
– Scalable algorithms
• Variety
– Multidimensional data, e.g. microarray DNA data
contain a few 10K features,
– Spatial, spatiotemporal data, time-series data
– Web data, multimedia
– Graphs and hypergraphs in social networks
• Velocity
– Data streams, sensor data
• Veracity
– We are not always sure about data accuracy (e.g. GPS
data)
VALUE
29
Data mining for BI
30
Data mining steps
Pattern Evaluation
Data Mining
Task-relevant Data
Data Cleaning
Data Integration
Databases
31
Main principles
33
Sampling
34
Pattern mining
35
Common architecture for DW and DM
Mining query Mining result Layer4
User Interface
User GUI API
Layer3
OLAM OLAP
Engine Engine OLAP/OLAM
Layer2
MDDB
MDDB
Meta Data
37
Discussion
38
Data mining tasks (1)
39
Data mining tasks (2)
• Cluster analysis
– Grouped samples in new unknown groups , e.g. group homes
which are for sale and study the characteristics of groups
– Aim to maximize the similarity within groups and the diversity
between groups
• Outlier analysis
– Exceptional samples have completely different behavior from
all other samples
– It is noise , error or exception?
• Trend analysis
– Trends and variations: e.g. regression analysis (finding a
function that describes the data, find data that deviate far from
this)
– Mining sequential patterns : e.g. searching for cameras 🡪
searching for a memory card
– periodicity analysis
40
Discussion
41
Market analysis
42