Vous êtes sur la page 1sur 6





Massively scalable, open infrastructure
to store and manage big data
Industry-leading security, performance
and the most comprehensive big data
tool set on the market all bundled in an
easy to deploy appliance.
Big Data Connectors delivers load rates
of up to 15TB per hour between Big
Data Appliance and Oracle Exadata
Clouderas comprehensive software
suite including Cloudera Distribution
including Apache Hadoop (CDH 4.x and
5.x) and Apache Spark
Oracle Enterprise Manager combined
with Cloudera Manager simplifies
management of the entire Big Data
Advanced analytics with Oracle R
directly interacting with data stored in
Handle low-latency unstructured
workloads with the pre-installed and
configured Oracle NoSQL Database
Community Edition
InfiniBand connectivity between nodes
and across appliances as well as to
Oracle Exadata and Oracle Exalytics
Flexible configuration choices for
optimizing both floor space and growth
path for Hadoop and Oracle NoSQL

Oracle Big Data Appliance X4-2 is a comprehensive Big Data platform,
engineered for secure data processing with a low overall total cost of
ownership. It is optimized for both batch and real-time processing utilizing
Clouderas Distribution for Apache Hadoop (versions 4.x and 5.x) supporting
YARN and MR2, Oracle NoSQL Database, Apache Spark, Cloudera Impala and
Cloudera Search to satisfy diverse computing requirements. Built using
industry-standard hardware from Sun, Big Data Appliance X4-2 delivers the
perfect balance between compute power, I/O bandwidth and memory footprint
offering 33% more storage capacity than the previous generation appliance.
Big Data Appliance X4-2 provides a highly optimized platform with integrated
management capabilities that allows you to derive value quickly with lower risk.
Comprehensive Big Data Platform
Oracle Big Data Appliance is an open, multi-purpose big data platform. It is optimized to run
a diverse set of workloads including batch processing jobs as well as interactive
applications. Apache Hadoops MapReduce framework (including YARN and MR2) powers
the batch capabilities processing massive volumes of data with linear scalability. There are
several options for interactive applications each with their own unique properties. Oracle
NoSQL Database is a distributed key-value database. It is designed to be highly available and
extremely scalable with predictable levels of throughput and latency. Cloudera Impala
provides real-time SQL queries over data stored in HDFS enabling business intelligence
tools to access data in Hadoop without requiring MapReduce processing. Apache Spark
enables processing of large data sets in memory. Finally, Cloudera Search offers full-text
interactive search over data stored in HDFS with results delivered using a faceted navigation

In addition to providing the full Cloudera software platform, Big Data Appliance utilizes
Oracle Big Data Connectors to simplify data integration and analytics. Big Data Connectors
provide high speed access to data in Hadoop from Oracle Exadata and Oracle Database with
data transfer rates in the order of 15 TB/hour. Big Data Connectors also enable integrated,
highly scalable analytics to run on Big Data Appliance providing native access to Hadoop
data and parallel processing using Oracle R Distribution. Finally, Oracle XQuery for Hadoop
is a new capability that enables standard XQuery operations to process and transform
documents in various formats (JSON, XML, Avro and others), executing in parallel across the
Hadoop cluster.

The big data domain is marked by continuous innovation; Big Data Appliance embraces these
innovations by providing an open environment without compromising tight integration and
enterprise-level support. Organizations are free to deploy external software to support new
functionality such as graph analytics, natural language processing and fraud detection to
meet the needs of the application. Support for non-Oracle components is delivered by their
respective support channels and not by Oracle.

Big Data Appliance X4-2 Software
Integrated Software
Oracle Linux 6.4 with Unbreakable Enterprise Kernel
Optimized, Complete and Secure Big
Data Solution
Most comprehensive big data tool set
integrated in a single appliance
Integrated with Oracle Exadata to
analyze all your data
Risk-free installation and rapid time to
Simplified operations, updates and
patch management though a single
command utility of the entire stack (OS,
Java, Oracle NoSQL Database and the
Cloudera stack)
Single Management Console integrating
Big Data Appliance hardware and
software monitoring through Oracle
Enterprise Manager
Single-vendor support for your entire big
data solution covering both hardware
and software

Oracle Java JDK 7
Cloudera Software 4.x and 5.x
Clouderas Distribution including Apache Hadoop (CDH) with support for YARN* and MR2*
Cloudera Impala
HBase (as well as support for Accumulo)
Cloudera Search
Apache Spark*
Cloudera Manager including:
Cloudera Back-up and Disaster Recovery (BDR)
Cloudera Navigator
Oracle R Distribution
Oracle NoSQL Database Community Edition**
Oracle Big Data Appliance Enterprise Manager Plug-In
Optional Software (separately licensed)
Oracle Big Data Connectors
Oracle SQL Connector for Hadoop
Oracle Loader for Hadoop
Oracle XQuery for Hadoop
Oracle R Advanced Analytics for Hadoop
Oracle Data Integrator Application Adapter for Hadoop
Oracle Audit Vault and Database Firewall for Hadoop Auditing
Oracle Data Integrator
Oracle NoSQL Database Enterprise Edition
* Apache Spark, YARN and MR2 are features included with and supported on CDH 5.x
** Support for Oracle NoSQL Database Community Edition is not a part of Big Data Appliance. It is a separately
purchased component
Lower TCO than Do-it-Yourself Hadoop
Oracle Big Data Appliance lowers the total cost of ownership of a big data platform when
compared to a DIY system. Not only are the costs of an initial deployment lower with Big
Data Appliance, but more significantly, so are the ongoing costs of maintenance, optimization
and system growth.

Big Data Appliance provides unique pricing to dramatically reduce the three to four year TCO
when compared to a DIY big data platform. Big Data Appliance bundles the hardware
(servers, high-speed networking, power distribution units and peripherals), OS support and
subscription costs for the Cloudera software into a single price for the life of the system. A
single support license covers both the hardware and the integrated software.

Organizations do not want to spend valuable intellectual capital assembling and tuning an
optimized Hadoop/NoSQL infrastructure, especially when these resources can be applied to
delivering high value business solutions. Big Data Appliance delivers a pre-configured, highly
tuned environment out-of-box for Apache Hadoop and Oracle NoSQL Database. This
optimized environment enables companies to focus their resources on developing compelling
business applications lowering the risk for the solution. Additionally, the pre-tuned
environment avoids extensive ramp-up time for new applications due to performance and
production issues.
Simplified Operations
Oracle Enterprise Manager provides a single entry point for managing the entire system both
hardware and software providing continuity across other Oracle products in the
organization. To provide deep management capabilities for Hadoop, Enterprise Manager
enables a context-aware integration with Cloudera Manager.

Oracle Big Data Appliance brings a low
risk, highly scalable big data platform to
the enterprise.

The following are related products
available from Oracle:
Oracle Exadata
Oracle Big Data Connectors
Oracle NoSQL Database
Oracle Exalytics
Oracle Business Intelligence Enterprise
Oracle Endeca Information Discovery
Oracle Data Integrator
Oracle Enterprise Manager

The following services are available from
Oracle Support Services:
Advanced Customer Services
Product Support Services
Consulting Services
Oracle University Courses

Big Data Appliance simplifies day-to-day operations by providing a simple one-command
installation, update, patch and expansion utility Mammoth which enables rapid
deployment updates (typically quarterly) to the frequently evolving Hadoop stack without
incurring significant downtime. Mammoth also enables Oracle-tested, seamless upgrades
between Hadoop versions and automated service management to ensure the best balance
between Hadoop Master Nodes and Data Nodes.

Big Data Appliance is supported by Oracle, giving organizations a single point of support for
their hardware, all integrated software (including all Cloudera software) and any additional
Oracle software installed.
Comprehensive Security
Securing data is critical to Big Data solutions in the enterprise; Big Data Appliance provides
strong authentication, authorization and auditing of data in Hadoop out of the box.
Strong authentication is provided using Kerberos. This ensures that all users are who they
claim to be and that rogue services are not added to the system.

Big Data Appliance leverages Apache Sentry (an open-source project of which Oracle is a
founding member) to authorize SQL access via tools like Hive and Impala. By delivering and
developing Sentry, Oracle delivers Big Data Appliance with the highest data security levels
currently available for Hadoop.

Both encryption of data-at-rest and network encryption are capabilities included with Oracle
Big Data Appliance and supported by Oracle. Network encryption prevents network sniffing
from capturing protected data. Encryption of data-at-Rest ensures that data on removed
physical disks is unreadable without the passphrase or trusted platform module, which is used
to encrypt the data. Both encryption features are transparent to the applications running over
HDFS and can be applied upon install or at a later time.

To ensure security and data access compliance, Big Data Appliance integrates with Oracle
Audit Vault and Database Firewall. An Oracle Audit Vault agent is pre-installed on Big Data
Appliance to track and audit data access on the Hadoop system. By leveraging Oracle Audit
Vault and Database Firewall, all auditing across the organization is consolidated into a single
audit repository ensuring a comprehensive view across all data.
Flexible Configurations
Big Data Appliance is designed to expand as your data and requirements grow. Initial big data
implementations may start with Big Data Appliance Starter Rack. This six server rack comes
fully equipped with a complete set of switches and power distribution units (PDU) required
for a full rack. This allows the appliance to easily and efficiently expand in six node hardware
increments to larger configurations using the Oracle Big Data Appliance In-Rack Expansion.

In addition to upgrading within a rack, multiple racks can be connected using the
integrated InfiniBand fabric to form even larger configurations; up to 18 racks can be
connected in a non-blocking manner by connecting InfiniBand cables without the need for any
external switches. Larger non-blocking configurations are supported with additional external
InfiniBand switches, larger blocking network configurations can be supported without
additional switches.

Big Data Appliance is multitenant; it can be configured as a single cluster or as a set of
clusters. This provides the flexibility customers need when deploying development, test and
production clusters.


Big Data Appliance X4-2 Hardware
Full Rack Starter Rack In-Rack Expansion
18 x compute/storage
6 x compute/storage
6 x compute/storage

Per Node:
2 x Eight-Core Intel Xeon E5-2650 V2 Processors
64 GB Memory (individual nodes can be expanded to 512 GB)
Disk Controller HBA with 512MB Battery backed write cache
12 x 4TB 7,200 RPM High Capacity SAS Disks
2 x QDR (40Gb/s) Ports
4 x 10 Gb Ethernet Ports
1 x ILOM Ethernet Port

2 x 32 Port QDR InfiniBand Switch
32 x InfiniBand ports
8 x 10Gb Ethernet ports
Leverages the leaf
switches from the Starter
1 x 36 Port QDR InfiniBand Switch
36 x InfiniBand Ports
Leverages the spine
switch from the Starter
Additional Hardware Components included:
Ethernet Administration Switch
2 x Redundant Power Distributions Units
42U rack packaging
Leverages the
administration switch,
PDUs and base rack from
the Starter Rack
Spares Kit Included:
2 x 4 TB High Capacity SAS disk
InfiniBand cables
Leverages the spares kit
from the Starter Rack

Big Data Appliance X4-2 Expansions
In-Rack Expansion Multi-Rack Connection
Field upgrade leveraging either a single (6
nodes) or two (2 x 6 nodes) In-Rack
Expansions. Expansion supports multiple
generations of hardware

Additional hardware include with each
In-Rack Expansion:
6 x Compute node with direct
attached storage as shown earlier
InfiniBand and Ethernet cables to
connect all of the components
Up to 18 racks can be connected without
requiring additional InfiniBand switches

InfiniBand cables to connect 3 racks are
included in the rack Spares Kits

Additional optical InfiniBand cables
required when connecting 4 or more racks

Memory Expansions
Expand the memory in any individual node or any number of nodes from 64GB per node
to 512GB per node.

Big Data Appliance X4-2 Environmental Specificaions
Physical Dimensions
42U, 78.66 - 1998 mm
23.62 - 600mm
47.24 - 1200 mm
Starter Rack 1037 Lbs
Starter Rack + In-
Rack Expansion
1400 Lbs
Full Rack 1800 Lbs
Starter Rack
Maximum 4.2 KW
3.0 KW
Starter Rack + In-
Rack Expansion
Maximum 7.7 KW
5.4 KW
Full Rack
Maximum 10.0KW
7.0 KW
Starter Rack
Maximum 14,052 BTU/hour
Typical 9,836 BTU/hour
Starter Rack + In-
Rack Expansion
Maximum 26,411 BTU/hour
Typical 18,487 BTU/hour
Full Rack
Maximum 34,142 BTU/hour
Typical 23,940 BTU/hour

Starter Rack
Maximum 676 CFM
Typical 473 CFM
Starter Rack + In-
Rack Expansion
Maximum 1223 CFM
Typical 856 CFM
Full Rack
Maximum 1,573 CFM
Typical 1,103 CFM
Further Environmental Specifications
Operating temperature/humidity: 5 C to 32 C (41 F to 89.6 F), 10% to 90% relative
humidity, non-condensing
Altitude Operating: Up to 3,048 m, max. ambient temperature is de-rated by 1 C per
300 m above 900 m

Safety: UL 60950-1 2nd Ed, EN60950-1:2006 2nd Ed, CB Scheme with all country
RFI/EMI: FCC CFR 47 Part 15 Subpart B Class A, EN 55022:2006+A1:2007 Class A,
EN 61000-3-11:2000, EN 61000-3-12:2005, ETSI EN 300 386 V1.4.1 (2008)
Immunity: EN 55024:1998+A1:2001:+A2:2003

Contact Us
For more information about Oracle Big Data Appliance, visit oracle.com or call +1.800.ORACLE1 to speak to an Oracle representative.

Copyright 2013, Oracle and/or its affiliates. All rights reserved.
This document is provided for information purposes only and the contents hereof are subject to change without notice. This document is not warranted to be error-free, nor subject
to any other warranties or conditions, whether expressed orally or implied in law, including implied warranties and conditions of merchantability or fitness for a particular purpose.
We specifically disclaim any liability with respect to this document and no contractual obligations are formed either directl y or indirectly by this document. This document may not
be reproduced or transmitted in any form or by any means, electronic or mechanical, for any purpose, without our prior written permission.
Oracle and Java are registered trademarks of Oracle and/or its affiliates. Other names may be trademarks of their respective owners. Cloudera, Cloudera CDH, and Cloudera
Manager, Cloudera Navigator and Cloudera BDR are registered and unregistered trademarks of Cloudera, Inc.,
Intel and Intel Xeon are trademarks or registered trademarks of Intel Corporation. All SPARC trademarks are used under license and are trademarks or registered trademarks of
SPARC International, Inc. AMD, Opteron, the AMD logo, and the AMD Opteron logo are trademarks or registered trademarks of Advanced Micro Devices. UNIX is a registered
trademark licensed through X/Open Company, Ltd. 0611


Safety: UL/cUL, CE, BSMI, GOST R, S-Mark, CSA C22.2 No. 60950-1-07 2nd Ed, CCC
Other: Complies with WEEE Directive (2002/96/EC) and RoHS Directive (2002/95/EC)
Typical power usage varies by application workload

Airflow must be front to back

In some cases, as applicable, regulatory and certification compliance were obtained at
the component level

Big Data Appliance Support Services
Hardware Warranty: 1 year with a 4 hour web/phone response during normal
business hours (Mon-Fri 8AM-5PM), with 2 business day on-site
response/Parts Exchange
Oracle Premier Support for Systems: Oracle Linux and integrated software
support and 24x7 with 2 hour on-site hardware service response (subject to
proximity to service center)
Oracle Premier Support for Operating Systems
Oracle Customer Data and Device Retention
System Installation Services
Software Configuration Services
System Expansion Support Services including hardware installation and
software configuration
Quarterly on-site patch deployment service
Oracle Automatic Service Request (ASR)