Vous êtes sur la page 1sur 3

Apache Hadoop, the open source data management software that helps organizations analyze massive volumes of structured

and unstructured data, is a very hot topic across the tech industry. Employed by such big named websites as eBay, Facebook, and Yahoo, Hadoop is being tagged by many as one of the most desired tech skills for 2012 and coming years along with Cloud Computing.

Hadoop Training Chennai Weekends [June 22] / Online [Jul 10] /Fasttrack (5Days) [July 8] / Training Modes Bangalore 5 Days FastTrack Training Starts July 27 Register Now
What participants will learn? The attendees will learn below topics through lectures and hands-on exercises Understand Big Data & Hadoop Ecosystem Hadoop Distributed File System HDFS Use Map Reduce API and write common algorithms Best practices for developing and debugging map reduce programs Advanced Map Reduce Concepts & Algorithms Hadoop Best Practices & Tip and Techniques Managing and Monitoring Hadoop Cluster Importing and exporting data using Sqoop Leverage Hive & Pig for analysis Intended Audience: Architects and developers, who wish to build, manage Hadoop Stack or write, build and maintain Apache Hadoop jobs. Course Prerequisites: The participants should have basic understanding or knowledge of java and linux. Course Content: What is Big Data & Why Hadoop? Big Data Characteristics, Challenges with traditional system Hadoop Overview & its Ecosystem Anatomy of Hadoop Cluster, Installing and Configuring Hadoop Hands-On Exercise HDFS Hadoop Distributed File System HDFS Architecture, Name Nodes, Data Nodes and Secondary Name Node Hands-On Exercise Map Reduce Anatomy How Map Reduce Works? The Mapper & Reducer, , Data Type, Input& Output Formats Developing Map Reduce Programs

Setting up Eclipse Development Environment, Creating Map Reduce Projects, Debugging and Unit Testing Developing a map reduce algorithm on real world scenario Hands On Exercises Advanced Map Reduce Concepts Combiner, Partitioner, Counter, Compression, Setup and teardown, Speculative Execution, Zero Reducer and Distributed Cache Advanced Map Reduce Algorithms Sorting, Searching , Multiple Inputs, Chaining multiple jobs Joins, Handling Binary & Unstructured data Advanced Tips & Techniques Determining optimal number of reducers, skipping bad records Partitioning into multiple output files & Passing parameters to tasks Hadoop Cluster sizing and capacity planning Monitoring & Management of Hadoop Managing HDFS with Tools like fsck and dfsadmin Using HDFS & Job Tracker Web UI Routine Administration Procedures Hands On Exercises Sqoop Importing and Exporting data from using RDBMS Hands On Exercises Import and Export Hive Hive Basics, Internal & External Tables, Partitioning, Buckets Writing queries Joins, Union, Dynamic partitioning, Sampling Hands On Exercise Structured data analysis Pig Pig Basics, Loading data files Writing queries SPLIT, FILTER, JOIN, GROUP, SAMPLE, ILLUSTRATE etc. Hands On Exercise Semi-structured Data Analysis Setting up a Hadoop Cluster ( Access to 50 Node Hadoop Cluster ) Hands-On Session Hadoop Best Practices RealTime Case Studies We are providing Hadoop(Big data) online and classroom training with single node and multi node lab environment. Interested people can contact BigDataAcademy.IN .

Duration: 45 hours(1 month)

Vous aimerez peut-être aussi