Académique Documents
Professionnel Documents
Culture Documents
E : info@rittmanmead.com
W : www.rittmanmead.com
E : info@rittmanmead.com
W : www.rittmanmead.com
E : info@rittmanmead.com
W : www.rittmanmead.com
E : info@rittmanmead.com
W : www.rittmanmead.com
E : info@rittmanmead.com
W : www.rittmanmead.com
E : info@rittmanmead.com
W : www.rittmanmead.com
BigDataLite Demonstration VM
Demo / Training VM downloadable from OTN
Contains Cloudera Hadoop 4.5 + Oracle Big Data Connectors
Similar to setup on Oracle BDA
Contains OBIEE enabling technologies:
Apache Hive (SQL access over Hadoop)
Apache HDFS (file storage)
Oracle Direct Connector for HDFS
Oracle R Advanced Analytics for Hadoop
Great way to get started with Hadoop
Requires 8GB RAM, modern laptop etc
E : info@rittmanmead.com
W : www.rittmanmead.com
E : info@rittmanmead.com
W : www.rittmanmead.com
E : info@rittmanmead.com
W : www.rittmanmead.com
Hive Driver
(Compile
Optimize, Execute)
Metastore
Managed Tables
External Tables
/user/hive/warehouse/
/user/oracle/
/user/movies/data/
E : info@rittmanmead.com
W : www.rittmanmead.com
Map
Task
Map
Task
SELECT a, sum(b)
FROM myTable
WHERE a<100
Map
Task
GROUP BY a
Reduce
Task
Reduce
Task
Result
E : info@rittmanmead.com
W : www.rittmanmead.com
E : info@rittmanmead.com
W : www.rittmanmead.com
E : info@rittmanmead.com
W : www.rittmanmead.com
E : info@rittmanmead.com
W : www.rittmanmead.com
E : info@rittmanmead.com
W : www.rittmanmead.com
1
3
E : info@rittmanmead.com
W : www.rittmanmead.com
E : info@rittmanmead.com
W : www.rittmanmead.com
E : info@rittmanmead.com
W : www.rittmanmead.com
E : info@rittmanmead.com
W : www.rittmanmead.com
E : info@rittmanmead.com
W : www.rittmanmead.com
Aggregates
Data Warehouse
Detail-level
Data
E : info@rittmanmead.com
W : www.rittmanmead.com
Exalytics
In conjunction with a well-tuned data warehouse, Exalytics adds an in-memory analysis layer
Based around Oracle TimesTen for Exalytics, Oracles In-Memory Database
Aggregates are recommended based on query patterns, automatically created in TimesTen
Summary Advisor makes recommendations, which adapt as queries change
Meant to be plug-and-play - no need for
expensive data warehouse tuning
TimesTen
BI Server
So can we use this for speeding-up Hadoop/Hive queries?
E : info@rittmanmead.com
W : www.rittmanmead.com
E : info@rittmanmead.com
W : www.rittmanmead.com
E : info@rittmanmead.com
W : www.rittmanmead.com
E : info@rittmanmead.com
W : www.rittmanmead.com
E : info@rittmanmead.com
W : www.rittmanmead.com
E : info@rittmanmead.com
W : www.rittmanmead.com
BI Server
Presentation Svr
Cloudera Impala
ODBC Driver
Impala
Impala
Hadoop
Hadoop
HDFS etc
Hadoop
HDFS etc
Impala
Hadoop
HDFS etc
Impala
E : info@rittmanmead.com
W : www.rittmanmead.com
HDFS etc
Impala
Hadoop
HDFS etc
Multi-Node
Hadoop Cluster
E : info@rittmanmead.com
W : www.rittmanmead.com
E : info@rittmanmead.com
W : www.rittmanmead.com
E : info@rittmanmead.com
W : www.rittmanmead.com
E : info@rittmanmead.com
W : www.rittmanmead.com
E : info@rittmanmead.com
W : www.rittmanmead.com
E : info@rittmanmead.com
W : www.rittmanmead.com
E : info@rittmanmead.com
W : www.rittmanmead.com
E : info@rittmanmead.com
W : www.rittmanmead.com