Académique Documents
Professionnel Documents
Culture Documents
In this module, you will understand what is Big Data and Apache Hadoop, How
Hadoop solves the Big Data problems, Hadoop Cluster Architecture, Introduction to
MapReduce framework, Hadoop Data Loading techniques, and Role of a Hadoop
Cluster Administrator.
Learning Objectives–
After this module, you will understand Multiple Hadoop Server roles such as Name
Node and Data Node, and MapReduce data processing. You will also understand the
Hadoop 2.x Cluster setup and configuration, Setting up Hadoop Clients using Hadoop
2.x, and important Hadoop configuration files and parameters.
Learning Objectives – In this module, you will understand Planning and Managing a
Hadoop Cluster, Hadoop Cluster Monitoring and Troubleshooting, Analyzing logs, and
Auditing. You will also understand Scheduling and Executing MapReduce Jobs, and
different Schedulers.
Topics:
Planning the Hadoop Cluster.
Cluster Sizing.
Hardware and Software considerations.
Managing and Scheduling Jobs.
Types of schedulers in Hadoop – FIFO, FAIR SCHEDULER
Setup Queues and Pools for Jobs.
Configuring the schedulers and run MapReduce jobs.
Cluster Monitoring and Troubleshooting.
Copyright @ 2019 Learntek. All Rights Reserved. 7
Value Ads (as per latest industry standards)
Email : info@learntek.org