Vous êtes sur la page 1sur 1

Hadoop Admin Responsibilities:

Responsible for implementation and ongoing administration of Hadoop infrastructu


re.
Aligning with the systems engineering team to propose and deploy new hardware an
d software environments required for Hadoop and to expand existing environments.
Working with data delivery teams to setup new Hadoop users. This job includes se
tting up Linux users, setting up Kerberos principals and testing HDFS, Hive, Pig
and MapReduce access for the new users.
Cluster maintenance as well as creation and removal of nodes using tools like Ga
nglia, Nagios, Cloudera Manager Enterprise, Dell Open Manage and other tools.
Performance tuning of Hadoop clusters and Hadoop MapReduce routines.
Screen Hadoop cluster job performances and capacity planning
Monitor Hadoop cluster connectivity and security
Manage and review Hadoop log files.
File system management and monitoring.
HDFS support and maintenance.
Diligently teaming with the infrastructure, network, database, application and b
usiness intelligence teams to guarantee high data quality and availability.
Collaborating with application teams to install operating system and Hadoop upda
tes, patches, version upgrades when required.
Point of Contact for Vendor escalation
DBA Responsibilities Performed by a Hadoop Administrator:
Data modelling, design & implementation based on recognized standards.
Software installation and configuration.
Database backup and recovery.
Database connectivity and security.
Performance monitoring and tuning.
Disk space management.
Software patches and upgrades.
Automate manual tasks.
DWH Development Responsibilities Performed by Hadoop Administrator:
DWH admins job responsibilities includes developing, testing and monitoring batc
h jobs for the following tasks:
Ensure Referential integrity.
Perform primary key execution.
Accomplish data restatements.
Load large data volumes in a timely manner.
Now that you know about the job responsibilities of a Hadoop administrator,let s t
ake a look at the skills required to be one.
Skills Required to become a Hadoop Administrator:
General operational expertise such as good troubleshooting skills, understanding
of system s capacity, bottlenecks, basics of memory, CPU, OS, storage, and networ
ks.
Hadoop skillslike HBase, Hive, Pig, Mahout, etc.
The most essential requirements are: They should be able to deploy Hadoop cluste
r, add and remove nodes, keep track of jobs, monitor critical parts of the clust
er, configure name-node high availability, schedule and configure it and take ba
ckups.
Good knowledge of Linux as Hadoop runs on Linux.
Familiarity with open source configuration management and deployment tools such
as Puppet or Chef and Linux scripting.
Knowledge of Troubleshooting Core Java Applications is a plus.

Vous aimerez peut-être aussi