Bienvenue sur Scribd !

Ignorer le carrousel

Trial Version Hadoop Notes

Transféré par

anandaraj28

0% ont trouvé ce document utile (0 vote)

53 vues4 pages

Copyright

Formats disponibles

PDF, TXT ou lisez en ligne sur Scribd

Partager ce document

Partager ou intégrer le document

Options de partage

Avez-vous trouvé ce document utile ?

Ce contenu est-il inapproprié ?

Signaler ce document

Droits d'auteur :

Formats disponibles

Téléchargez comme PDF, TXT ou lisez en ligne sur Scribd

Signaler comme contenu inapproprié

0% ont trouvé ce document utile (0 vote)

53 vues4 pages

Trial Version Hadoop Notes

Transféré par

anandaraj28

Droits d'auteur :

Formats disponibles

Téléchargez comme PDF, TXT ou lisez en ligne sur Scribd

Signaler comme contenu inapproprié

Passer à la page

Vous êtes sur la page 1sur 4

Rechercher à l'intérieur du document

HadoopExam Learning resource

1
This is a Trial Version Free Document

Hadoop Revision Notes

Version 1.0.0
Author: HadoopExam Learning Resource

Advertisement
Hadoop Certification Exam Simulator + Study Material (Revision Notes)

Contains 4 practice Question Paper

240 realistic Hadoop Certification Questions

All Questions are on latest Pattern

End time 30 Page revision notes (Save lot of time)

Download from www.HadoopExam.com

Note: There is 50% talent gap in BigData domain, get Hadoop certification with the
HadoopExam Learning Resources Hadoop Exam Simulator.

HadoopExam Learning resource

2
This is a Trial Version Free Document

Print Screen Hadoop Exam Simulator

Hadoop Revision Notes

Core Concepts

The responsibility of the TaskTracker is to Instantiating and monitoring of individual map and
reduce task.
JobTracker is responsible to send individual task to each Node.
Hadoop Daemon does not share the JVM.
Technically all the Hadoop Node can run, all the daemons on single node, however it is not
advised.
Daemons (JobTracker, NameNode and Secondary NameNode) are considered as Master
Node
DataNode and TaskTracker are slave node

Input/Output

Reducer input must have the same type as the Map output.
Context object is used for emitting key-value pairs.

HadoopExam Learning resource

3
This is a Trial Version Free Document

By default hash partition algorithm is used to decide in which partition key-value should be
sent and is based on the keys hash values.
We can run MapReduce job without setting Mapper and Reducer and it will produce lineoffset (Which is not a line number) as key and line as a value in output directory.
The default input format for the MapReduce job is TextInputFormat, which produce
LongWritable as key and Text as value type in a Tab separated form.
Each partition is processed by a reduce task, therefore number of partion is equal to
number of reducer.
Example for calculating the partion
o (key.hashCode() & Integer.Max_Value)%numberOfReducer

HDFS

HDFS actively monitors any failed datanode(s) and upon failure detection immediately
schedules re-replication of blocks (if needed).
With 3 replication factor and multiple racks HDFS will write one copy of the block on the
same node and other two copy on different rack in different machine.

MapReduce

An Instance of Configuration class represent collection of configuration properties.

Configuration properties can be defined in terms of system properties but they can not be
used until System Properties are redefined using Configuration properties.

Hive

Hive gives SQL like interface for data stored in HDFS

Hive provides programtic access like JDBC/ODBC for data stored in HDFS

Data Join

Map-join map side join is done in map phase and done in memory. The map-side join
technique allows splitting map file between different data nodes. The data will be loaded
into memory. This allows very fast performance for join.

HBase

It is a column family database

Can store massive data e.g. TeraBytes
It gives high throughput

HadoopExam Learning resource

4
This is a Trial Version Free Document

Algorithm and Features

Counters are useful channel for gathering statistics about the Job, for quality control or for
application level statistics.
TaskCounter : Updated as task progresses.
JobCounter : Updated as Job progresses.

Key1 Record1 Tag1

Reducer
Key1 Record2 Tag2

Note : PappuPass.com is www.HadoopExam.com

Contact Us:
Email : admin@hadoopexam.com
hadoopexam@gmail.com
Phone: 022-42669636
Mobile : +91-8879712614
HadoopExam Learning Resources
B-902 Shah Trade Center
Malad-E Mumbai 400097

Key1 Record1 Record2

Vous aimerez peut-être aussi

The Subtle Art of Not Giving a F*ck: A Counterintuitive Approach to Living a Good Life
D'Everand
The Subtle Art of Not Giving a F*ck: A Counterintuitive Approach to Living a Good Life
Mark Manson
Évaluation : 4 sur 5 étoiles
4/5 (5795)
The Gifts of Imperfection: Let Go of Who You Think You're Supposed to Be and Embrace Who You Are
D'Everand
The Gifts of Imperfection: Let Go of Who You Think You're Supposed to Be and Embrace Who You Are
Brene Brown
Évaluation : 4 sur 5 étoiles
4/5 (1091)
Never Split the Difference: Negotiating As If Your Life Depended On It
D'Everand
Never Split the Difference: Negotiating As If Your Life Depended On It
Chris Voss
Évaluation : 4.5 sur 5 étoiles
4.5/5 (838)
Principles: Life and Work
D'Everand
Principles: Life and Work
Ray Dalio
Évaluation : 4 sur 5 étoiles
4/5 (599)
The Glass Castle: A Memoir
D'Everand
The Glass Castle: A Memoir
Jeannette Walls
Évaluation : 4.5 sur 5 étoiles
4.5/5 (1713)
Hidden Figures: The American Dream and the Untold Story of the Black Women Mathematicians Who Helped Win the Space Race
D'Everand
Hidden Figures: The American Dream and the Untold Story of the Black Women Mathematicians Who Helped Win the Space Race
Margot Lee Shetterly
Évaluation : 4 sur 5 étoiles
4/5 (895)
Sing, Unburied, Sing: A Novel
D'Everand
Sing, Unburied, Sing: A Novel
Jesmyn Ward
Évaluation : 4 sur 5 étoiles
4/5 (1103)
Grit: The Power of Passion and Perseverance
D'Everand
Grit: The Power of Passion and Perseverance
Angela Duckworth
Évaluation : 4 sur 5 étoiles
4/5 (588)
Shoe Dog: A Memoir by the Creator of Nike
D'Everand
Shoe Dog: A Memoir by the Creator of Nike
Phil Knight
Évaluation : 4.5 sur 5 étoiles
4.5/5 (537)
The Perks of Being a Wallflower
D'Everand
The Perks of Being a Wallflower
Stephen Chbosky
Évaluation : 4.5 sur 5 étoiles
4.5/5 (2104)
The Hard Thing About Hard Things: Building a Business When There Are No Easy Answers
D'Everand
The Hard Thing About Hard Things: Building a Business When There Are No Easy Answers
Ben Horowitz
Évaluation : 4.5 sur 5 étoiles
4.5/5 (345)
Elon Musk: Tesla, SpaceX, and the Quest for a Fantastic Future
D'Everand
Elon Musk: Tesla, SpaceX, and the Quest for a Fantastic Future
Ashlee Vance
Évaluation : 4.5 sur 5 étoiles
4.5/5 (474)
Bad Feminist: Essays
D'Everand
Bad Feminist: Essays
Roxane Gay
Évaluation : 4 sur 5 étoiles
4/5 (1016)
The Outsider: A Novel
D'Everand
The Outsider: A Novel
Stephen King
Évaluation : 4 sur 5 étoiles
4/5 (1839)
Her Body and Other Parties: Stories
D'Everand
Her Body and Other Parties: Stories
Carmen Maria Machado
Évaluation : 4 sur 5 étoiles
4/5 (821)
The Emperor of All Maladies: A Biography of Cancer
D'Everand
The Emperor of All Maladies: A Biography of Cancer
Siddhartha Mukherjee
Évaluation : 4.5 sur 5 étoiles
4.5/5 (271)
The Sympathizer: A Novel (Pulitzer Prize for Fiction)
D'Everand
The Sympathizer: A Novel (Pulitzer Prize for Fiction)
Viet Thanh Nguyen
Évaluation : 4.5 sur 5 étoiles
4.5/5 (121)
Angela's Ashes: A Memoir
D'Everand
Angela's Ashes: A Memoir
Frank McCourt
Évaluation : 4.5 sur 5 étoiles
4.5/5 (440)
Brooklyn: A Novel
D'Everand
Brooklyn: A Novel
Colm Toibin
Évaluation : 3.5 sur 5 étoiles
3.5/5 (1937)
The Little Book of Hygge: Danish Secrets to Happy Living
D'Everand
The Little Book of Hygge: Danish Secrets to Happy Living
Meik Wiking
Évaluation : 3.5 sur 5 étoiles
3.5/5 (400)
The World Is Flat 3.0: A Brief History of the Twenty-first Century
D'Everand
The World Is Flat 3.0: A Brief History of the Twenty-first Century
Thomas L. Friedman
Évaluation : 3.5 sur 5 étoiles
3.5/5 (2259)
A Man Called Ove: A Novel
D'Everand
A Man Called Ove: A Novel
Fredrik Backman
Évaluation : 4.5 sur 5 étoiles
4.5/5 (4610)
The Art of Racing in the Rain: A Novel
D'Everand
The Art of Racing in the Rain: A Novel
Garth Stein
Évaluation : 4 sur 5 étoiles
4/5 (4200)
A Tree Grows in Brooklyn
D'Everand
A Tree Grows in Brooklyn
Betty Smith
Évaluation : 4.5 sur 5 étoiles
4.5/5 (1929)
The Yellow House: A Memoir (2019 National Book Award Winner)
D'Everand
The Yellow House: A Memoir (2019 National Book Award Winner)
Sarah M. Broom
Évaluation : 4 sur 5 étoiles
4/5 (98)
Steve Jobs
D'Everand
Steve Jobs
Walter Isaacson
Évaluation : 4.5 sur 5 étoiles
4.5/5 (806)
Devil in the Grove: Thurgood Marshall, the Groveland Boys, and the Dawn of a New America
D'Everand
Devil in the Grove: Thurgood Marshall, the Groveland Boys, and the Dawn of a New America
Gilbert King
Évaluation : 4.5 sur 5 étoiles
4.5/5 (266)
Yes Please
D'Everand
Yes Please
Amy Poehler
Évaluation : 4 sur 5 étoiles
4/5 (1891)
The Woman in Cabin 10
D'Everand
The Woman in Cabin 10
Ruth Ware
Évaluation : 3.5 sur 5 étoiles
3.5/5 (2322)
A Heartbreaking Work Of Staggering Genius: A Memoir Based on a True Story
D'Everand
A Heartbreaking Work Of Staggering Genius: A Memoir Based on a True Story
Dave Eggers
Évaluation : 3.5 sur 5 étoiles
3.5/5 (231)
Team of Rivals: The Political Genius of Abraham Lincoln
D'Everand
Team of Rivals: The Political Genius of Abraham Lincoln
Doris Kearns Goodwin
Évaluation : 4.5 sur 5 étoiles
4.5/5 (234)
Fear: Trump in the White House
D'Everand
Fear: Trump in the White House
Bob Woodward
Évaluation : 3.5 sur 5 étoiles
3.5/5 (738)
Wolf Hall: A Novel
D'Everand
Wolf Hall: A Novel
Hilary Mantel
Évaluation : 4 sur 5 étoiles
4/5 (3811)
John Adams
D'Everand
John Adams
David McCullough
Évaluation : 4.5 sur 5 étoiles
4.5/5 (2409)
On Fire: The (Burning) Case for a Green New Deal
D'Everand
On Fire: The (Burning) Case for a Green New Deal
Naomi Klein
Évaluation : 4 sur 5 étoiles
4/5 (74)
The Light Between Oceans: A Novel
D'Everand
The Light Between Oceans: A Novel
M.L. Stedman
Évaluation : 4.5 sur 5 étoiles
4.5/5 (789)
The Unwinding: An Inner History of the New America
D'Everand
The Unwinding: An Inner History of the New America
George Packer
Évaluation : 4 sur 5 étoiles
4/5 (45)
Manhattan Beach: A Novel
D'Everand
Manhattan Beach: A Novel
Jennifer Egan
Évaluation : 3.5 sur 5 étoiles
3.5/5 (792)
The Constant Gardener: A Novel
D'Everand
The Constant Gardener: A Novel
John le Carré
Évaluation : 3.5 sur 5 étoiles
3.5/5 (104)
Rise of ISIS: A Threat We Can't Ignore
D'Everand
Rise of ISIS: A Threat We Can't Ignore
Jay Sekulow
Évaluation : 3.5 sur 5 étoiles
3.5/5 (137)
Little Women
D'Everand
Little Women
Louisa May Alcott
Évaluation : 4 sur 5 étoiles
4/5 (104)
CHAPTER 1-INTRODUCTION . 1: Page No
Document52 pages
CHAPTER 1-INTRODUCTION . 1: Page No
Pranshik Warrior
Pas encore d'évaluation
Exam Sections: 1. Infrastructure Objectives
Document2 pages
Exam Sections: 1. Infrastructure Objectives
anandaraj28
0% (1)
Unix Commands: Navneet Sharma
Document8 pages
Unix Commands: Navneet Sharma
anandaraj28
Pas encore d'évaluation
Books: Publications
Document1 page
Books: Publications
anandaraj28
Pas encore d'évaluation
RC 2 Passage
Document7 pages
RC 2 Passage
anandaraj28
Pas encore d'évaluation
Resume Kushal PDF
Document1 page
Resume Kushal PDF
VEDANT
Pas encore d'évaluation
Lecture 1 Introduction Knowledge and KM Latest
Document30 pages
Lecture 1 Introduction Knowledge and KM Latest
Yean Qin
Pas encore d'évaluation
CHAPTER 6: Finding Answers Through Data Collection, Analyzing The Meaning of Data and Drawing Conclusions
Document2 pages
CHAPTER 6: Finding Answers Through Data Collection, Analyzing The Meaning of Data and Drawing Conclusions
george dan torres
Pas encore d'évaluation
6120be6daea4490011abe4aa-1629537650-Module 1-Lesson 2 - Part 1
Document5 pages
6120be6daea4490011abe4aa-1629537650-Module 1-Lesson 2 - Part 1
Aldvin Jan Alcasid
Pas encore d'évaluation
Open Ended Lab Assignment
Document2 pages
Open Ended Lab Assignment
Sheikh M Hassaan Adil
Pas encore d'évaluation
Operation and Maintenance of m2000 Huawei
Document112 pages
Operation and Maintenance of m2000 Huawei
AbdouTreize
Pas encore d'évaluation
RBAC 2023 Round 2 Case Study
Document15 pages
RBAC 2023 Round 2 Case Study
s3927319
Pas encore d'évaluation
ECC 6 Release Notes
Document56 pages
ECC 6 Release Notes
Sajid Sait
Pas encore d'évaluation
DR Edgar F Codd
Document2 pages
DR Edgar F Codd
hrthrthtr
Pas encore d'évaluation
Balridge Assesment Excel
Document52 pages
Balridge Assesment Excel
Arnab Dey
Pas encore d'évaluation
Manage Data Access With Unity Catalog
Document17 pages
Manage Data Access With Unity Catalog
dvd.cmp.masterbip
Pas encore d'évaluation
Final
Document603 pages
Final
Muhammad Sultan Shahid
Pas encore d'évaluation
Primecluster Gds4-1a20
Document310 pages
Primecluster Gds4-1a20
rarcrossi
Pas encore d'évaluation
Structured Query Language
Document7 pages
Structured Query Language
anant1993
Pas encore d'évaluation
Unit 1: Database Management System (DBMS) Historical Perspective
Document30 pages
Unit 1: Database Management System (DBMS) Historical Perspective
Ravindran Kumar N
100% (1)
Swiss Timing - C++ Developer
Document2 pages
Swiss Timing - C++ Developer
mamatovmurat0
Pas encore d'évaluation
Building NAS System With OpenMediaVault - Giani
Document105 pages
Building NAS System With OpenMediaVault - Giani
giani_2008
Pas encore d'évaluation
Cloudera Hadoop Admin Notes PDF
Document65 pages
Cloudera Hadoop Admin Notes PDF
Pankaj Sharma
Pas encore d'évaluation
Project Certificates New
Document12 pages
Project Certificates New
AMIT JINDAL
Pas encore d'évaluation
Berg (2001) Qualitative Research Methods For The Social Sciences
Document11 pages
Berg (2001) Qualitative Research Methods For The Social Sciences
Kemverly Suaza
Pas encore d'évaluation
Qualitative Research With Children 21st March 2021
Document57 pages
Qualitative Research With Children 21st March 2021
Lata arya
Pas encore d'évaluation
Athanasakis - 2021 - Registered Nurses Experiences of Medication Errors An Original Research Protocol Methodology, Methods, and Ethics
Document13 pages
Athanasakis - 2021 - Registered Nurses Experiences of Medication Errors An Original Research Protocol Methodology, Methods, and Ethics
Ivonne
Pas encore d'évaluation
MongoDB Manual
Document908 pages
MongoDB Manual
manuel_martin3
Pas encore d'évaluation
Rman Backup and Recovery Practice Duplicate Database: Alejandro Vargas February 4, 2007
Document13 pages
Rman Backup and Recovery Practice Duplicate Database: Alejandro Vargas February 4, 2007
kthimma
Pas encore d'évaluation
Teradata API
Document237 pages
Teradata API
Jisoo Eric Lee
Pas encore d'évaluation
Research CH 1..2
Document20 pages
Research CH 1..2
Ermias Atalay
Pas encore d'évaluation
Introduction To Indexes
Document35 pages
Introduction To Indexes
Vlada Grujić
Pas encore d'évaluation
Cap Body of Knowledge-2023
Document7 pages
Cap Body of Knowledge-2023
Musa afasy Musa
Pas encore d'évaluation
Data Scientist Cover Letter PDF
Document5 pages
Data Scientist Cover Letter PDF
afjwduenevzdaa
100% (2)