01 What Is Hadoop Transcript 1

Transféré par

AntonLieskovsky

0% ont trouvé ce document utile (0 vote)

11 vues4 pages

from datax academy

Titre original

01 What is Hadoop Transcript 1

Copyright

Formats disponibles

PDF, TXT ou lisez en ligne sur Scribd

Partager ce document

Partager ou intégrer le document

Options de partage

Avez-vous trouvé ce document utile ?

Ce contenu est-il inapproprié ?

Signaler ce document

from datax academy

Droits d'auteur :

Formats disponibles

Téléchargez comme PDF, TXT ou lisez en ligne sur Scribd

Signaler comme contenu inapproprié

0% ont trouvé ce document utile (0 vote)

11 vues4 pages

01 What Is Hadoop Transcript 1

Transféré par

AntonLieskovsky

from datax academy

Droits d'auteur :

Formats disponibles

Téléchargez comme PDF, TXT ou lisez en ligne sur Scribd

Signaler comme contenu inapproprié

Passer à la page

Vous êtes sur la page 1sur 4

Rechercher à l'intérieur du document

Hello everyone and welcome to Hadoop Fundamentals What is

Hadoop. My name is Warren Pettit.

In this video we will explain what is Hadoop and what is Big Data.
We will define some Hadoop-related open source projects and give
some examples of Hadoop in action.
Imagine this scenario: You have 1GB of data that you need to process.
The data is stored in a relational database on your desktop computer and
this desktop computer has no problem handling this load.
Then your company starts growing very quickly, and that data grows to
10GB.
And then 100GB.
And you start to reach the limits of your current desktop computer.
So you scale-up by investing in a larger computer, and you are then OK
for a few more months.
When your data grows to 10TB, and then 100TB, you are quickly
approaching the limits of that computer.

Moreover, you are now asked to feed your application with unstructured
data coming from sources like Facebook, Twitter, RFID readers,
sensors, and so on.
Your management wants to derive information from both the relational
data and the unstructured data and wants this information as soon as
possible.
What should you do? Hadoop may be the answer!
What is Hadoop?
Hadoop is an open source project of the Apache Foundation.
It is a framework written in Java originally developed by Doug Cutting
who named it after his son's toy elephant.
Hadoop uses Googles MapReduce and Google File System
technologies as its foundation.
It is optimized to handle massive quantities of data which could be
structured, unstructured or semi-structured, using commodity hardware,
that is, relatively inexpensive computers.
This massive parallel processing is done with great performance.

Hadoop replicates its data across multiple computers, so that if one goes
down, the data is processed on one of the replicated computers.
It is a batch operation handling massive quantities of data, so the
response time is not immediate.
Hadoop is not suitable for OnLine Transaction Processing workloads
where data is randomly accessed on structured data like a relational
database.
Also, Hadoop is not suitable for OnLine Analytical Processing or
Decision Support System workloads where data is sequentially accessed
on structured data like a relational database, to generate
reports that provide business intelligence.
As of Hadoop version 2.2, updates are not possible, but appends are
possible.
Hadoop is used for Big Data. It complements OnLine Transaction
Processing and OnLine Analytical Processing.
It is NOT a replacement for a relational database system.
So, what is Big Data?

With all the devices available today to collect data, such as RFID
readers, microphones, cameras, sensors, and so on, we are seeing an
explosion in data being collected worldwide.
Big Data is a term used to describe large collections of data (also known
as datasets) that may be unstructured, and grow so large and quickly that
it is difficult to manage with regular database or statistical tools.
Other interesting statistics providing examples of this data explosion are:
There are more than 2 billion internet users in the world today,
and in 2014 there will be 7.3 billion active cell phones.
Twitter processes 7TB of data every day, and 500TB of data is
processed by Facebook every day.
Interestingly, approximately 80% of these data are unstructured.
With this massive quantity of data, businesses need fast, reliable, deeper
data insight.
Therefore, Big Data solutions based on Hadoop and other analytics
software are becoming more and more relevant.

Vous aimerez peut-être aussi

Grit: The Power of Passion and Perseverance
D'Everand
Grit: The Power of Passion and Perseverance
Angela Duckworth
Évaluation : 4 sur 5 étoiles
4/5 (588)
The Yellow House: A Memoir (2019 National Book Award Winner)
D'Everand
The Yellow House: A Memoir (2019 National Book Award Winner)
Sarah M. Broom
Évaluation : 4 sur 5 étoiles
4/5 (98)
The Subtle Art of Not Giving a F*ck: A Counterintuitive Approach to Living a Good Life
D'Everand
The Subtle Art of Not Giving a F*ck: A Counterintuitive Approach to Living a Good Life
Mark Manson
Évaluation : 4 sur 5 étoiles
4/5 (5795)
Never Split the Difference: Negotiating As If Your Life Depended On It
D'Everand
Never Split the Difference: Negotiating As If Your Life Depended On It
Chris Voss
Évaluation : 4.5 sur 5 étoiles
4.5/5 (838)
Hidden Figures: The American Dream and the Untold Story of the Black Women Mathematicians Who Helped Win the Space Race
D'Everand
Hidden Figures: The American Dream and the Untold Story of the Black Women Mathematicians Who Helped Win the Space Race
Margot Lee Shetterly
Évaluation : 4 sur 5 étoiles
4/5 (895)
The Hard Thing About Hard Things: Building a Business When There Are No Easy Answers
D'Everand
The Hard Thing About Hard Things: Building a Business When There Are No Easy Answers
Ben Horowitz
Évaluation : 4.5 sur 5 étoiles
4.5/5 (345)
Shoe Dog: A Memoir by the Creator of Nike
D'Everand
Shoe Dog: A Memoir by the Creator of Nike
Phil Knight
Évaluation : 4.5 sur 5 étoiles
4.5/5 (537)
Yes Please
D'Everand
Yes Please
Amy Poehler
Évaluation : 4 sur 5 étoiles
4/5 (1891)
The Little Book of Hygge: Danish Secrets to Happy Living
D'Everand
The Little Book of Hygge: Danish Secrets to Happy Living
Meik Wiking
Évaluation : 3.5 sur 5 étoiles
3.5/5 (400)
Elon Musk: Tesla, SpaceX, and the Quest for a Fantastic Future
D'Everand
Elon Musk: Tesla, SpaceX, and the Quest for a Fantastic Future
Ashlee Vance
Évaluation : 4.5 sur 5 étoiles
4.5/5 (474)
A Heartbreaking Work Of Staggering Genius: A Memoir Based on a True Story
D'Everand
A Heartbreaking Work Of Staggering Genius: A Memoir Based on a True Story
Dave Eggers
Évaluation : 3.5 sur 5 étoiles
3.5/5 (231)
On Fire: The (Burning) Case for a Green New Deal
D'Everand
On Fire: The (Burning) Case for a Green New Deal
Naomi Klein
Évaluation : 4 sur 5 étoiles
4/5 (74)
The Emperor of All Maladies: A Biography of Cancer
D'Everand
The Emperor of All Maladies: A Biography of Cancer
Siddhartha Mukherjee
Évaluation : 4.5 sur 5 étoiles
4.5/5 (271)
Angela's Ashes: A Memoir
D'Everand
Angela's Ashes: A Memoir
Frank McCourt
Évaluation : 4.5 sur 5 étoiles
4.5/5 (440)
Bad Feminist: Essays
D'Everand
Bad Feminist: Essays
Roxane Gay
Évaluation : 4 sur 5 étoiles
4/5 (1016)
Devil in the Grove: Thurgood Marshall, the Groveland Boys, and the Dawn of a New America
D'Everand
Devil in the Grove: Thurgood Marshall, the Groveland Boys, and the Dawn of a New America
Gilbert King
Évaluation : 4.5 sur 5 étoiles
4.5/5 (266)
The Unwinding: An Inner History of the New America
D'Everand
The Unwinding: An Inner History of the New America
George Packer
Évaluation : 4 sur 5 étoiles
4/5 (45)
Team of Rivals: The Political Genius of Abraham Lincoln
D'Everand
Team of Rivals: The Political Genius of Abraham Lincoln
Doris Kearns Goodwin
Évaluation : 4.5 sur 5 étoiles
4.5/5 (234)
Principles: Life and Work
D'Everand
Principles: Life and Work
Ray Dalio
Évaluation : 4 sur 5 étoiles
4/5 (599)
Fear: Trump in the White House
D'Everand
Fear: Trump in the White House
Bob Woodward
Évaluation : 3.5 sur 5 étoiles
3.5/5 (738)
The World Is Flat 3.0: A Brief History of the Twenty-first Century
D'Everand
The World Is Flat 3.0: A Brief History of the Twenty-first Century
Thomas L. Friedman
Évaluation : 3.5 sur 5 étoiles
3.5/5 (2259)
The Gifts of Imperfection: Let Go of Who You Think You're Supposed to Be and Embrace Who You Are
D'Everand
The Gifts of Imperfection: Let Go of Who You Think You're Supposed to Be and Embrace Who You Are
Brene Brown
Évaluation : 4 sur 5 étoiles
4/5 (1091)
Steve Jobs
D'Everand
Steve Jobs
Walter Isaacson
Évaluation : 4.5 sur 5 étoiles
4.5/5 (806)
Rise of ISIS: A Threat We Can't Ignore
D'Everand
Rise of ISIS: A Threat We Can't Ignore
Jay Sekulow
Évaluation : 3.5 sur 5 étoiles
3.5/5 (137)
John Adams
D'Everand
John Adams
David McCullough
Évaluation : 4.5 sur 5 étoiles
4.5/5 (2409)
The Glass Castle: A Memoir
D'Everand
The Glass Castle: A Memoir
Jeannette Walls
Évaluation : 4.5 sur 5 étoiles
4.5/5 (1713)
The Outsider: A Novel
D'Everand
The Outsider: A Novel
Stephen King
Évaluation : 4 sur 5 étoiles
4/5 (1839)
Brooklyn: A Novel
D'Everand
Brooklyn: A Novel
Colm Toibin
Évaluation : 3.5 sur 5 étoiles
3.5/5 (1937)
The Sympathizer: A Novel (Pulitzer Prize for Fiction)
D'Everand
The Sympathizer: A Novel (Pulitzer Prize for Fiction)
Viet Thanh Nguyen
Évaluation : 4.5 sur 5 étoiles
4.5/5 (121)
The Light Between Oceans: A Novel
D'Everand
The Light Between Oceans: A Novel
M.L. Stedman
Évaluation : 4.5 sur 5 étoiles
4.5/5 (789)
The Art of Racing in the Rain: A Novel
D'Everand
The Art of Racing in the Rain: A Novel
Garth Stein
Évaluation : 4 sur 5 étoiles
4/5 (4200)
Manhattan Beach: A Novel
D'Everand
Manhattan Beach: A Novel
Jennifer Egan
Évaluation : 3.5 sur 5 étoiles
3.5/5 (792)
The Woman in Cabin 10
D'Everand
The Woman in Cabin 10
Ruth Ware
Évaluation : 3.5 sur 5 étoiles
3.5/5 (2322)
The Perks of Being a Wallflower
D'Everand
The Perks of Being a Wallflower
Stephen Chbosky
Évaluation : 4.5 sur 5 étoiles
4.5/5 (2104)
Wolf Hall: A Novel
D'Everand
Wolf Hall: A Novel
Hilary Mantel
Évaluation : 4 sur 5 étoiles
4/5 (3811)
A Man Called Ove: A Novel
D'Everand
A Man Called Ove: A Novel
Fredrik Backman
Évaluation : 4.5 sur 5 étoiles
4.5/5 (4610)
Little Women
D'Everand
Little Women
Louisa May Alcott
Évaluation : 4 sur 5 étoiles
4/5 (104)
Sing, Unburied, Sing: A Novel
D'Everand
Sing, Unburied, Sing: A Novel
Jesmyn Ward
Évaluation : 4 sur 5 étoiles
4/5 (1103)
A Tree Grows in Brooklyn
D'Everand
A Tree Grows in Brooklyn
Betty Smith
Évaluation : 4.5 sur 5 étoiles
4.5/5 (1929)
Her Body and Other Parties: Stories
D'Everand
Her Body and Other Parties: Stories
Carmen Maria Machado
Évaluation : 4 sur 5 étoiles
4/5 (821)
The Constant Gardener: A Novel
D'Everand
The Constant Gardener: A Novel
John le Carré
Évaluation : 3.5 sur 5 étoiles
3.5/5 (104)
Saudi Arabia Map F35895896938
Document3 pages
Saudi Arabia Map F35895896938
AFZAAl
Pas encore d'évaluation
Steamcmd
Document2 pages
Steamcmd
game generator
Pas encore d'évaluation
9600LSY Operador
Document396 pages
9600LSY Operador
SUZUKI SAN ANTONIO LA PAZ
Pas encore d'évaluation
SoftX3000 Operation Manual-GUI Guide
Document290 pages
SoftX3000 Operation Manual-GUI Guide
Arindam Bhattacharaya
Pas encore d'évaluation
Readme CrXIwin Sp4
Document159 pages
Readme CrXIwin Sp4
devsavior
Pas encore d'évaluation
Untitled
Document5 pages
Untitled
Toni Mihajlovic
Pas encore d'évaluation
HP Deskjet 1180c Series Release Notes
Document11 pages
HP Deskjet 1180c Series Release Notes
Sandri Hasoloan Napitupulu
Pas encore d'évaluation
Core Application Development
Document871 pages
Core Application Development
Zany 21
Pas encore d'évaluation
Manual Marantz - MCR610 NA ES
Document130 pages
Manual Marantz - MCR610 NA ES
immav
Pas encore d'évaluation
(PDF) SOALAN JAWI TAHUN 1.pdf - DOKUMEN - TIPS
Document4 pages
(PDF) SOALAN JAWI TAHUN 1.pdf - DOKUMEN - TIPS
NORHAYATI BINTI HASBULLAH Moe
Pas encore d'évaluation
Danguardian and Squid Proxy
Document13 pages
Danguardian and Squid Proxy
Filius Deus
Pas encore d'évaluation
Stephen Yu: Education
Document1 page
Stephen Yu: Education
Anonymous PKxguZgo
Pas encore d'évaluation
Database Programming With SQL 1-2: Relational Database Technology Practice Activities
Document2 pages
Database Programming With SQL 1-2: Relational Database Technology Practice Activities
stenly94
0% (1)
BTP 37
Document15 pages
BTP 37
sprasadn66
Pas encore d'évaluation
Unit 4 MCQ
Document48 pages
Unit 4 MCQ
sakshi
Pas encore d'évaluation
Cs8791 Cloud Computing Unit2 Notes
Document37 pages
Cs8791 Cloud Computing Unit2 Notes
Teju Melapattu
Pas encore d'évaluation
Dialux Arbetsmetod
Document7 pages
Dialux Arbetsmetod
Nyaguti Samot
Pas encore d'évaluation
A Java MySQL INSERT Example (Using PreparedStatement) - Java MySQL INSERT W - PreparedStatement - Alvinalexander
Document5 pages
A Java MySQL INSERT Example (Using PreparedStatement) - Java MySQL INSERT W - PreparedStatement - Alvinalexander
Vedansh Agrawal
Pas encore d'évaluation
Heidenhain Device Driver
Document12 pages
Heidenhain Device Driver
Martin
Pas encore d'évaluation
API Load Testing 101
Document28 pages
API Load Testing 101
mmaannii08
Pas encore d'évaluation
Ansys Cracking
Document1 page
Ansys Cracking
Piranha Tourniquet
Pas encore d'évaluation
Chapter: 5.9 Working With Charts Topic: 5.9.1 Working With Charts
Document4 pages
Chapter: 5.9 Working With Charts Topic: 5.9.1 Working With Charts
ETL LABS
Pas encore d'évaluation
Growatt ShineWiFi User Manual 20160818
Document2 pages
Growatt ShineWiFi User Manual 20160818
hitosnap
Pas encore d'évaluation
Ealth ARE: Service Manual SE Suite 2.0
Document400 pages
Ealth ARE: Service Manual SE Suite 2.0
Svilen Popov
Pas encore d'évaluation
Z-3101 Programming Card
Document2 pages
Z-3101 Programming Card
mastertech
Pas encore d'évaluation
Professional Vmware Workspace One 21.X: Vmware 2V0-62.21 Version Demo
Document6 pages
Professional Vmware Workspace One 21.X: Vmware 2V0-62.21 Version Demo
Nicolas Voisin
Pas encore d'évaluation
3808 - SAP ABAP RMCA Developer
Document2 pages
3808 - SAP ABAP RMCA Developer
Santosh Kumar
Pas encore d'évaluation
HTML5 Assignment
Document21 pages
HTML5 Assignment
Huynh Phuong
100% (1)
Peurifoy Construction Planning Equipment Methods PDF
Document3 pages
Peurifoy Construction Planning Equipment Methods PDF
Daniel Huerta Collado
Pas encore d'évaluation
Empowerment Technologies - Summative C2 With Answer Key
Document2 pages
Empowerment Technologies - Summative C2 With Answer Key
Rolan Ben Lorejo
Pas encore d'évaluation