Vous êtes sur la page 1sur 15

Big Data Analytics with Tableau

Ben Sullins
Bensullins.com
ben@bensullins.com

Outline
Introduction to Big Data
Massively Parallel Processing (MPP) Databases
Cloud Big Data sources
Accessing Big Data with Tableau
Visualizing your Big Data with Tableau
Sharing your work

Why Big Data? - New Data Types


Business
Data
3rd Party
Apps

Web
Logs
BI &
Analytics

Sensor
Data

Videos
Images

Why Big Data? Massive Content

Why Big Data? Variety of Data


Data Volume Growth

Why Big Data? Storage is Cheap!


Hard Drive Costs per GB since 1980

Why Big Data? Reduced Cycle Time

Requirements

RDBMS
Data Store

NOSQL
Data Store

Development

App
Design

App Design

Data
Model

Testing

App and
DB Dev

App Dev

Deployment

Deploy

Deploy

Why Big Data?


Facebook User Growth post 500M mark

Big Data
What it IS

What it IS NOT

Unstructured

Transactional

Petabytes+

Simple or Easy

Evolution of RDBMS

Structured DW

Many Platforms

One Platform

Difficult for Analytics

Easy or Fast for Analytics

Big Data Scalability


Availability

SQL Server,
MySQL
PostgreSQL
Oracle

Cassandra
CouchDB

Consistency &
Availability

Availability &
Partitioning

You can have only 2,


never 3

Consistency

Consistency &
Partitioning
Hbase
MongoDB

Partitioning

Big Data Storage Key Value Pairs

Simple

More Advanced

Key

Value

Key

Value

Color

Blue

FacebookUser12345_Color

Blue

Libation

Beer

FacebookUser12345_Libation

Beer

Favorite
Animal

Shark

TwitterUser1234_FavoriteAnim
al

Shark

FacebookUser1234_ProfilePhot
o

/img/profile.jpg

Platforms

Azure Table Storage, Redis, MemcacheDB, HamsterDB,


DynamoDB

Big Data Storage Column Family Stores


Key
Data

Platforms

FacebookUser1234

Favorite color

Blue

Hometown

Phoenix

Current Location

San Diego

Favorite Animal

Shark

Hbase, CassandraDB, Amazon SimpleDB

Big Data Storage Document Stores

Data

JSON
Document

original

Data

copy
copy

Data

Data

Platforms

HDFS, ElasticSearch, CouchDB

Big Data Storage Graph Databases


Node

Edges

Node

ID: 2
Name: Ben
Age: 32

ID: 1
Name: Stella
Age: 28

ID: 3
Type: Group
Name: Geeks

Node

properties

Big Data Platforms Platform Vendors

Vous aimerez peut-être aussi