Académique Documents
Professionnel Documents
Culture Documents
ARMAN SHAIKH
MMS SYSTEMS
M020
1
Website
Social Media
Billing
ERP
CRM
RFID
Network Switches
2
CHARACTERISTICS OF BIG
DATA
Volume
Velocity
Variety
Data
Quantity
Data
Speed
Data
Type
3
DATA IN 2013
United States
China
India
Western Europe
32%
19%
32%
13%
4%
China
India
Western Europe
23%
35%
21%
15%
6%
FACTS
By 2016, the cumulative size of all words data center
is expected to 16,000 acres
HADOOP
Open-source software framework from Apache
Inspired by
Google MapReduce
GFS (Google File System)
HDFS
Map/Reduce
EDITIONS OF HADOOP
Enterprise Edition
Enterprise class
Licensed
Application accelerators
Pre-built applications
Text analytics
Spreadsheet-style tool
RDBMS, warehouse connectivity
Basic Edition
Administrative tools, security
Free download
Eclipse development tools
Performance enhancements
Integrated install
Online Info Center
Apache Big Data Univ.
Hadoop
Breadth of capabilities
10
MAP REDUCE
INPUT
DATA
Allows massive
MAP
scalability across
Hadoop servers
MAP
MAP
SHUFFLE
REDUCE
REDUCE
RESULT
11
HDFS tolerates disk Failures by storing multiple copies of each data block
on different servers
WHAT IS NOSQL ?
13
Health care
Telecom
Traffic control
Trade analytics
Manufacturing
14
IBM NETEZZA
ORACLE EXADATA
FRACTAL CONCORDIA
SAS ADVANCED ANALYTICS
15
16