Académique Documents
Professionnel Documents
Culture Documents
Study Guide
1
CONTENTS
About MapR Study Guides .................................................................................................................. 3
Datasets ............................................................................................................................................ 20
After the Exam - Sharing Your Results ................................................ Error! Bookmark not defined.
2
About MapR Study Guides
MapR certification study guides are intended to help you prepare for certification by
providing additional study resources, sample questions, and details about how to take
the exam. The study guide by itself is not enough to prepare you for the exam. Youll
need training, practice, and experience. The study guide will point you in the right
direction and help you get ready.
If you use all the resources in this guide, and spend 6-12 months on your own using the
software, experimenting with the tools, and practicing the role you are certifying for, you
should be well prepared to attempt the exams.
The MapR Certified Cluster Administrator 5.1 credential is designed for Engineers and
Administrators who set up and manage clusters using MapR. The certification tests
ones ability to use MapR tools and basic Linux commands to design, manage, and
troubleshoot clusters.
3
Whats on the
Exam?
4
Section 1 Whats on the Exam?
The MapR Certified Cluster Administrator 5.1 exam is comprised of 6 exam topic sections
and 15 objectives. There are 60-80 questions on the exam. MapR exams are frequently
updated and therefore the number of exam questions can change.
MapR tests new questions on the exam in an unscored manner. This means that you may
see test questions on the exam that are not used for scoring your exam. You will not know
which items are scored and which are unscored. Unscored items are being tested for
inclusion in future versions of the exam. Unscored items have no influence on your results.
MapR exams are Pass or Fail based on cut score. Your exam results will be displayed at the
end of your exam and they will be included in your end of exam results email.
Exam Objectives
1. Installation 15%
2.2 Configure Logging options including default/local, centralized & YARN log aggregation
2.3 Locate and review log files and describe the behavior of the fair scheduler
2.4 Given a typical use case, set-up a node using an appropriate node topology
5
3. Volumes, Snapshots, and Mirrors 20%
3.3 Demonstrate how to create and schedule snapshots from MCS and CLI, restore
from a snapshot, and describe how snapshots are expired
4.1 Demonstrate how to get files into and out of the cluster using a variety of commands
like Copy Table and importtsv
4.2 Configure client access using Direct Access NFS, MapR Client, and MapR POSIX Client
5.1 Demonstrate how to diagnose disk failures, add disks, and replace a failed disks
5.2 Demonstrate add nodes, replaced failed nodes, and use maintenance windows to
maintain nodes
6.1 Demonstrate how to use common Linux commands to manage Cluster services
6.2 Demonstrate how to diagnose common cluster problems and troubleshooting steps to
correct the problem
6
Sample Questions
The following questions represent the kinds of questions you will see on the exam.
The answers to these sample questions can be found in the answer key following the
sample questions.
A. one
B. two
C. three
D. four
Q2. You create a new volume, with a replication factor of 3. You write 100 GB of data
to the volume. You take a snapshot of the volume, then you delete a 20 GB file from
your volume and take another snapshot. How much space are your snapshots
counting against your quota?
A. 0 GB
B. 20 GB
C. 80 GB
D. 100 GB
E. 120 GB
Q4. What two (2) things does the POSIX client enable you to do that you wouldnt be
able to do if you hadnt installed it?
7
Q5. You have a volume configured with the default replication factors. If the number
of replicas of that volume's data falls to 2 because a disk failed, when will the data
begin to re-replicate?
A. Almost immediately
B. After 30 minutes
C. After 1 hour
D. After 4 hours
Q6. All of your volumes are in the /data topology. Which topology could NOT be used
to safely decommission a node?
A. /decommissioned
B. /data/decommissioned
C. /decommissioned/data
D. /data-decommissioned
Q7. A cluster has the node topology illustrated here. You create a volume and assign
a volume topology of /data/rack3. To which of the nodes could that volume's data be
written?
A. Nodes 1 through 24
B. Nodes 17 through 24
C. Nodes 17 through 19
D. Any of them
8
Q8. With a Converged Community Edition Cluster, which three (3) of the following
services can be configured on multiple nodes?
A. ZooKeeper
B. JobHistoryServer
C. NodeManager
D. WebServer
E. NFS
F. CLDB
Q10. Eddie cannot connect to his MCS. Which Linux command can he use to verify
the port is open and listening?
A. ifconfig
B. route
C. netstat
D. iptables
9
Sample Question Answer Key
A. one
B. two
C. *three
D. four
Q2. You create a new volume, with a replication factor of 3. You write 100 GB of data
to the volume. You take a snapshot of the volume then you delete a 20 GB file from
your volume and take another snapshot. How much space are your snapshots
counting against your quota?
A. *0 GB
B. 20 GB
C. 80 GB
D. 100 GB
E. 120 GB
Q4. What two (2) things does the POSIX client enable you to do that you wouldnt be
able to do if you hadnt installed it?
10
Q5. You have a volume configured with the default replication factors. If the number
of replicas of that volume's data falls to 2 because a disk failed, when will the data
begin to re-replicate?
A. *Almost immediately
B. After 30 minutes
C. After 1 hour
D. After 4 hours
Q6. All of your volumes are in the /data topology. Which topology could NOT be used
to safely decommission a node?
A. /decommissioned
B. */data/decommissioned
C. /decommissioned/data
D. /data-decommissioned
Q7. A cluster has the node topology illustrated here. You create a volume and assign
a volume topology of /data/rack3. To which of the nodes could that volume's data be
written?
A. Nodes 1 through 24
B. *Nodes 17 through 24
C. Nodes 17 through 19
D. Any of them
11
Q8. With a Converged Community Edition Cluster, which three (3) of the following
services can be configured on multiple nodes?
A. * ZooKeeper
B. JobHistoryServer
C. * NodeManager
D. * WebServer
E. NFS
F. CLDB
Q9. You are creating a new cluster, and will load 200 TB of non-compressible data
into the cluster. 20 TB of the data is considered critical, and will have a replication
factor of 5. The rest of the data will use the default replication factor. About how
much total cluster storage space should you start with?
A. 640 TB
B. *800 TB
C. 900 TB
D. 1280 TB
Q10. Eddie cannot connect to his MCS. Which Linux command can he use to verify
the port is open and listening?
A. ifconfig
B. route
C. *netstat
D. iptables
12
2
Preparing for
the Certification
13
Section 2- Preparing for the Certification
MapR provides several ways to prepare for the certification
including classroom training, self-paced online training, videos,
webinars, blogs, and ebooks.
14
ADM 2000 Cluster Administration
Duration: 3 days
Cost: $2400
Course Description:
This instructor-led course teaches you real-world cluster administrator concepts and
practices, including planning, installation and configuration, load balancing and tuning
diagnosing deployment issues, performance, setting up a Hadoop cluster with direct
access NFS, snapshots, monitoring cluster health, resolving hardware issues and
troubleshooting job errors.
Syllabus
Day 1
Get Started
Lesson 1: Prepare for Installation
Lesson 2: Install a MapR Cluster
Day 2
Lesson 3: Verify and Test the Cluster
Lesson 4: Users, Groups and System Settings
Lesson 5: Configure Topology
Lesson 6: Configure Volumes
Lesson 7: Job Logs and Scheduling
Day 3
Lesson 8: Access Cluster Data
Lesson 9: Snapshots
Lesson 10: Mirrors
Lesson 11: Monitor and Manage the Cluster
Lesson 12: Disk and Node Maintenance
Lesson 13: Troubleshooting
15
On-Demand Training
16
ADM 202 - Data Access and Protection
Course Description:
This course covers how cluster data is accessed and protected. It takes an in-depth look
at the power of MapR snapshots and mirror volumes, and using Access Control
Expressions to control access to a volumes data.
Lesson 9 - Snapshots
Lesson 10 - Mirrors
17
ADM 210 Upgrade a MapR Cluster
Course Description:
This course takes you through the process of upgrading a MapR cluster, beginning with
what to include in a cluster upgrade plan, how to perform pre-upgrade testing, and then
to upgrading MapR and patching core software, ecosystem components, and MapR
clients.
18
Videos & Webinars
In addition to the classroom and self-paced training courses, we recommend these
videos, webinars, and tutorials
4. MapR Snapshots
https://www.mapr.com/resources/mapr-snapshots
8. Disaster Recovery
https://www.mapr.com/resources/disaster-recovery
19
Tutorials, blogs, and other resources
We recommend these resources to help you prepare for the MapR Certified Cluster
Administrator exam.
Datasets
These are some datasets that we recommend for experimenting with.
3. Kaggle
This site includes a collection of datasets used in machine learning competitions run
by Kaggle. Areas include classification, regression, ranking, recommender systems,
and image analysis. These datasets can be found under the Competitions section at
http://www.kaggle.com/competitions
4. KDnuggets
This site has a detailed list of public datasets, including some of those mentioned
earlier. The list is available at http://www.kdnuggets.com/datasets/index.html
5. SF Open Data
SF OpenData is the central clearinghouse for data published by the City and County
of San Francisco and is part of the broader open data program.
https://data.sfgov.org/data
20
3
Taking the Exam
21
Section 3 - Taking the Exam
MapR Certification exams are delivered online using a service from PSI. A professional
proctor will guide you into the test session. Once you are logged in for your test session,
and your webcam and desktop are shared, your proctor will launch your exam.
This method allows you to take our exams anytime, and anywhere, but you will need a
quiet environment where you will remain uninterrupted for up to two hours. You will also
need a reliable Internet connection for the entire test session.
22
Reserve a Test Session
MapR exams are delivered on a platform called PSI. When you are ready to schedule
your exam, go back to your profile in MapR Academy, click on your exam, and click the
Continue to Innovative Exams link to proceed to scheduling. Single Sign-On is used to
log you into the PSI application.
23
3) Choose a time slot at least 24 hours in advance
24
4) Once confirmed, your reservation will be in your My Exams tab of Innovative Exams
To cancel an exam reservation, log in to your MapR Academy Profile, find your exam
registration and click through to log in to the PSI/Innovative exams service. Go to My
Exams, select the exam registration and Cancel. This cancels your exam registration
only. MapR Exams are non-refundable. A cancellation confirmation email will be sent to
the examinee following the cancellation.
25
Test System Compatibility
We recommend that you check your system compatibility several days before your exam
to make sure you are ready to go. Go to
https://www.examslocal.com/ScheduleExam/Home/CompatibilityCheck
26
27
Day of the Exam
Make sure your Internet connection is strong and stable
Make sure you are in a quiet, well-lit room without distractions
Clear the room - you must be alone when taking your exam
No breaks are allowed during the exam; use the bathroom before you log in
Clear your desk of any materials, notebooks, and mobile devices
Silence your mobile and remove it from your desk
Configure your computer for a single display; multiple displays are not allowed
Close out of all other applications except for Chrome
Sign in 30 minutes in advance of your testing time so that you can communicate with
your proctor, and get completely set up well in advance of your test time.
You will be required to share your desktop and your webcam prior to the exam start.
YOUR EXAM SESSION WILL BE RECORDED. If the Proctor senses any misconduct, your
exam will be paused and you will be notified by the proctor of your misconduct.
If your misconduct is not corrected, the Proctor will shut down your exam, resulting in a
Fail.
Examples of misconduct and/or misuse of the exam include, but are not limited to, the
following:
28
After the Exam - Sharing Your Results
When you pass a MapR Certification exam, you will receive a confirmation email from
certification@mapr.com with the details of your success. This will include the title of
your certification and details on how you can download your digital certificate, and share
your certification on social media.
Your certification will be updated in your MapR Academy profile. From your profile you
can view your certificate and share it on LinkedIn.
29
Your certificate is available as a PDF. You can download and print your certificate from
your profile in learn.mapr.com.
Your credential contains a unique Certificate Number and a URL. You can share your
credential with anyone who needs to verify your certification.
If you happen to fail the exam, you will automatically qualify for a discounted exam
retake. Retakes are $100 USD and can be purchased by following the link in your exam
results email following the exam session.
Exam Retakes
If you fail an exam, you are eligible to purchase immediately and retake the exam in 14
days. Once you have passed the exam, you may not take that version (e.g., v.4.0) of the
exam again, but you may take any newer version of the exam (e.g., v.4.1).
30