Académique Documents
Professionnel Documents
Culture Documents
Lecture Notes
https://www.youtube.com/watch?v=KgjUsie50WQ
or
https://www.youtube.com/watch?v=hO13N2b-gXE
Lecture 5 and 6 — History, Types, and Slides
History of Data Warehouse
The Data Warehouse benefits users to understand and enhance their
organization's performance. The need to warehouse data evolved as computer
systems became more complex and needed to handle increasing amounts of
Information. However, Data Warehousing is a not a new thing.
1960- Dartmouth and General Mills in a joint research project, develop the
terms dimensions and facts.
1970- A Nielsen and IRI introduces dimensional data marts for retail sales.
1983- Tera Data Corporation introduces a database management system
which is specifically designed for decision support
Data warehousing started in the late 1980s when IBM worker Paul Murphy
and Barry Devlin developed the Business Data Warehouse.
However, the real concept was given by Inmon Bill. He was considered as a
father of data warehouse. He had written about a variety of topics for
building, usage, and maintenance of the warehouse & the Corporate
Information Factory.
OLTP
Online Analytical Processing, a category of software tools which provide analysis
of data for business decisions. OLAP systems allow users to analyze database
information from multiple database systems at one time.
Assignment 1
What is the difference between data ware house and data mining?
Some kind of explanation of Data Warehouse.
Problem: Heterogeneous Information
Different interface
Different data representation
Duplication and inconsistent information
Solution
– END--
Lecture 7 and 8 — Data Mining
The key features of Data mining are discussed below\
4. Automatic discovery of patterns
5. Prediction of likely outcomes
6. Creation of actionable information
7. Focus on large data sets and databases
It is a process which is used to integrate data It is the process which is used to extract
from multiple sources and then combine it useful patterns and relationships from a
patterns.
This process must take place before data This process always takes place after data
organizes data into a common database. compiled data to extract useful patterns.
This process is solely carried out by This process is carried out by business