Vous êtes sur la page 1sur 3

Hadoop

Hadoop is an open-source software framework for storing data and running applications on clusters of
commodity hardware. It provides massive storage for any kind of data, enormous processing power
and the ability to handle virtually limitless concurrent tasks or jobs.
Hadoop History
In 2005, computer scientist Doug Reed Cutting, the creator of Apache Lucene, created Hadoop.
Hadoop was named by Doug after his sons stuffed toy elephant and is now being managed by Apache
Software Foundation.
Why is Hadoop important?

Ability to store and process huge amounts of any kind of data, quickly. Data
volume and varieties is constantly increasing especially from social media and the Internet of
Things(IoT)

Computing power. Hadoop processes big data fast by distributed computing model.
Processing power increase with usage of more number of computing nodes

Fault tolerance. In case of any hardware failure, data and application processing are
protected. Due to distributed computing model, even if one node goes down jobs are
automatically redirected to other nodes. Multiple copies of all data are stored automatically.

Flexibility. You dont have to preprocess data before storing it unlike conventional
database. You can store as much data as you want like text, images and videos and decide
how to use it later.

Low cost. The open-source framework is free and uses commodity hardware to store
large quantities of data.

Scalability. System can be easily grown to handle more data simply by adding nodes

SMAC 3.0

Trends Relationship with work


Social Who we work with
Mobile How we get to work
Analytics What we work on, the
meaning of work
Cloud Where we do the work
Social Media:
Futuristic CRMs will be able to produce trends for all those connected to you via advanced
preference analytics algorithms. These devices will be able to clearly demarcate the difference
between data of interest and redundant data. They will also increase the coverage of the
individuals for whom advertising products can be garnished based on the customers preferences.

Mobility
The next big step is the collaboration of organizations with cloud service technologies to create
custom solutions for employees to work better in the mobility space. Organizations adopting
enterprise mobility would focus on the creation of unified platform, enabling users to work on their mobile
devices irrespective of the type of device, the platform on which the device runs, and the type of
connectivity available.

Analytics
The next level of analytics awaited is prescriptive analytics. Understanding customer
requirements even before they have made a decision is the key element in this level. The aim is to
get
insight into the lives of customers, understand their lifestyles, interests and behaviors, and provide
the
appropriate products or services they need. Prescriptive analytics automatically synthesizes big
data,
multiple disciplines of mathematical sciences and computational sciences, and business rules, to
make
predictions. It then suggests decision options to take advantage of the predictions.

Cloud computing
Cloud computing is an approach to computing infrastructure that links large pools of resources to
be
shared simultaneously by multiple users.
Cloud consumption is performed by four deployment models:
1. Public cloud
2. Private cloud
3. Community cloud
4. Hybrid cloud

Reference
EY report SMAC 3.0 digital is here
http://www.sas.com/en_us/insights/big-data/hadoop.html

Vous aimerez peut-être aussi