Académique Documents
Professionnel Documents
Culture Documents
How should you use the pre-built extractors for a new project?
Correct.
Question: 2 of 60
Which feature of Text Analytics should you use to process Japanese or Chinese language text?
Correct.
A. Standard tokenizer
B. Multilingual tokenizer
Question: 3 of 60
What defines a relation in an AQL extractor?
A. a schema
B. a row
C. a view
D. a column
Question: 4 of 60
What advantage does the Text Analytics Web UI give you?
Question: 5 of 60
Which Text Analytics runtime component is used for languages such as Spanish and English by
breaking a stream of text into phrases or words?
A. Standard tokenizer
C. Multilingual tokenizer
D. Other extractors
Question: 6 of 60
Which AQL candidate rule combines tuples from two views with the same schema?
Correct.
A. Sequence
B. Union
C. Blocks
D. Select
Question: 7 of 60
What does a computer need to understand unstructured data?
A. attribute types
B. extractors
C. usage
D. context
Question: 8 of 60
What should you do in Text Analytics to fix an extractor that produces unwanted results?
Question: 9 of 60
How is a sequence created in Canvas?
Correct.
Correct.
A. RTF
B. CSV
C. JSON
D. TXT
Question: 11 of 60
Which basic feature rule of AQL helps find an exact match to a single word or phrase?
Correct.
A. Literals
B. Part of Speech
C. Splits
D. Dictionary
Question: 12 of 60
Which type of HBase column is mapped to multiple SQL columns?
A. Exclusive
B. Composite
C. Dense
D. Double
Question: 13 of 60
Which underlying data representation and access method does Big SQL use?
A. SMALLINT
B. Hive
C. MAP
D. TINYINT
Question: 14 of 60
Which Big SQL file format is human readable and supported by most tools, but is the least efficient
file format?
A. Avro
B. Sequence
C. Parquet
D. Delimited
Question: 15 of 60
Which type of key does HBase require in each row in an HBase table?
A. Duplicate
B. Primary
C. Foreign
D. Unique
Question: 16 of 60
What are the two types of Spark operations? (Choose two.)
(Please select ALL that apply)
Correct.
A. Actions
B. Transformations
C. Vectors
D. Sequences
E. DataFrames
Question: 17 of 60
What privilege is required to execute an EXPLAIN statement with INSERT privileges in Big SQL?
A. SQLADM authority
B. SYSMON authority
C. SECADM authority
D. SYSCTRL authority
Question: 18 of 60
What is used in a Big SQL file system to organize tables?
Correct.
A. DSM
B. JSqsh
C. schemas
D. partitions
Question: 19 of 60
Why is the SYSPROC.SYSINSTALLOBJECT procedure used with Big SQL?
Question: 20 of 60
What is required to run an EXPLAIN statement in Big SQL?
A. proper authorization
B. a rule
Question: 21 of 60
You need to create multiple Big SQL tables with columns defined as CHAR. What needs to be set to
enable CHAR columns?
Correct.
C. SET HADOOPCOMPATIBLITY_MODE=True
D. SET SYSHADOOP.COMPATIBILITY_MODE=1
Question: 22 of 60
You need to populate a Big SQL table to test an operation. Which INSERT statement is
recommended for testing, only because it does not support parallel reads or writes?
Correct.
Question: 23 of 60
Which Big SQL datatype should be avoided because it causes significant performance degradation?
A. CHAR
B. UNION
C. VARCHAR
D. STRING
Question: 24 of 60
What is missing from the following statement when querying a remote table? CREATE _______ FOR
remotetable1 ...
Correct.
A. NICKNAME
B. TABLE
C. VIEW
D. INDEX
Question: 25 of 60
You need to set up the command-line interface JSqsh to connect to a bigsql database. What is the
recommended method to set up the connection?
Question: 26 of 60
You have a very large Hadoop file system. You need to work on the data without migrating the data
out or changing the data format. Which IBM tool should you use?
Correct.
A. Big SQL
B. Pig
C. MapReduce
Question: 27 of 60
Which core component of the Hadoop framework is highly scalable and a common tool?
Correct.
A. Hive
B. Pig
C. Sqoop
D. MapReduce
Question: 28 of 60
Which action is performed prior to the Map step of a MapReduce v1 processing cycle?
Incorrect.The correct answer is :The job is broken into individual task pieces and
distributed.ExplanationDW613_Course_Guide_V2 9-7
Question: 29 of 60
What is the default replication factor for HDFS on a production cluster?
Correct.
A. 10
B. 5
C. 3
D. 1
Question: 30 of 60
Which command must be run after compiling a Java program so it can run on the Hadoop cluster?
Correct.
A. rm hadoop.class
C. jar tf name.jar
D. hadoop classpath
Question: 31 of 60
What is a feature of an Avro file?
Correct.
Question: 32 of 60
What happens if a task fails during a Hadoop job execution?
Correct.
Correct.
Question: 34 of 60
What is the default data type in Big R?
A. complex
B. character
C. integer
D. numeric
Question: 35 of 60
Which command is used to launch an interactive Python shell for Spark?
Correct.
A. spark-shell
B. hadoop pyshell
C. pyspark
D. python -spark
Question: 36 of 60
What is the primary core abstraction of Apache Spark?
C. Spark Streaming
D. GraphX
Question: 37 of 60
What is a feature of Apache ZooKeeper?
Question: 38 of 60
Which command is used to launch an interactive Apache Spark shell?
Correct.
A. hadoop spark
B. spark
C. spark-shell
D. scala --spark
Question: 39 of 60
Which action is performed during the Reduce step of a MapReduce v1 processing cycle?
Correct.
Question: 40 of 60
What are two major business advantages of using BigSheets? (Choose two.)
(Please select ALL that apply)
Incorrect.The correct answer is :built-in data readers for multiple formatsspreadsheet-like querying
and discovery interfaceExplanationDW613_Course_Guide_V1 3-24
Question: 41 of 60
A Hadoop file listing is performed and one of the output lines is: -rw-r--r-- 5 biadmin biadmin 871233
2015-09-12 09:33 data.txt What does the 5 in the output represent?
Incorrect.The correct answer is :replication factorExplanationDW613_Course_Guide_V2 8-54
A. replication factor
C. permissions
D. data size
Question: 42 of 60
When creating a new table in Big SQL, what additional keyword is used in the CREATE TABLE
statement to create the table in HDFS?
Correct.
A. dfs
B. cloud
C. replicated
D. hadoop
Question: 43 of 60
What is the ApplicationMaster in YARN responsible for? (Choose two.)
(Please select ALL that apply)
Correct.
A. command line
B. web browser
C. mobile app
Question: 45 of 60
What command is used to start a Flume agent?
A. flume-agent
B. flume-start
C. flume-ng
D. flume-src
Question: 46 of 60
Which integration API does Apache Ambari support?
Correct.
A. SOAP
B. RMI
C. REST
D. RPC
Question: 47 of 60
What does the MLlib component of Apache Spark support?
Correct.
A. stream processing
C. graph computation
Question: 48 of 60
What are two benefits of using the IBM Big SQL processing engine? (Choose two.)
(Please select ALL that apply)
Correct.
Question: 49 of 60
What does the programmatic implementation of a Map function do?
Correct.
Correct.
Question: 51 of 60
What command will load the BigR package in R?
Correct.
A. dir(pattern="bigr")
B. source("bigr")
C. bigr.connect
D. library(bigr)
Question: 52 of 60
Which command must be run first to become the HDFS user?
Correct.
A. pwd
B. hadoop fs
C. su - hdfs
D. hdfs
Question: 53 of 60
What type of NoSQL datastore does HBase fall into?
A. key-value
B. document
C. graph
D. column
Question: 54 of 60
Which two tasks can an Apache Ambari admin do that a regular Apache Ambari user cannot do?
(Choose two.)
(Please select ALL that apply)
Correct.
B. modify configurations
Question: 55 of 60
What does the federation feature of Big SQL allow?
Correct.
A. rewriting statements for better execution performance
Question: 56 of 60
Which statement is true regarding Reduce tasks in MapReduce?
Correct.
A. They only run on nodes that didn't generate data during the Map step.
B. They run only on nodes that generated data during the Map step.
Question: 57 of 60
How does Sqoop decide how to split data across mappers?
Question: 58 of 60
What is the JSqsh tool used for?
Correct.
Question: 59 of 60
What does the HCatalog component of Hive provide?
Correct.
Question: 60 of 60
What command is used to retrieve multiple rows out of an HBase table?
A. get
B. pull
C. scan
D. select
Question: 1 of 60
Which statement will create a table with parquet files?
Correct.
Question: 2 of 60
In the ZooKeeper environment, what does atomicity guarantee?
Question: 3 of 60
Which two components make up a Hadoop node? (Choose two.)
(Please select ALL that apply)
Correct.
A. disk
B. memory
C. network
D. CPU
Question: 4 of 60
Which component connects sinks and sources in Flume?
Correct.
A. HDFS
B. channels
C. ElasticSearch
D. interceptors
Question: 5 of 60
How does Apache Ambari use the Ganglia component?
Correct.
Question: 6 of 60
Why does YARN scale better than Hadoop v1 for multiple jobs? (Choose two.)
(Please select ALL that apply)
Incorrect.The correct answer is :There is one Application Master per job.Job tracking and resource
management are split.ExplanationDW613_Course_Guide_V2 9-44
Question: 7 of 60
What is a key factor in determining how to implement file compression with HDFS?
Correct.
Question: 8 of 60
An organization is developing a proof-of-concept for a big data system. Which phase of the big data
adoption cycle is the company currently in?
Correct.
A. Execute
B. Explore
C. Engage
D. Educate
Question: 9 of 60
Which component is required for Flume to work?
A. Interceptor
B. Data source
C. Syslog
D. RDBMS
Question: 10 of 60
What is a limitation of Apache Spark?
Correct.
A. It does not have universal tools.
Question: 11 of 60
Assuming the same data is stored in multiple data formats, which format will provide faster query
execution and require the least amount of IO operations to process?
Correct.
A. XML
B. flat file
C. JSON
D. Parquet
Question: 12 of 60
Which programming language is Apache Spark primarily written in?
A. Scala
B. Java
C. C++
D. Python 2
Question: 13 of 60
What is the default install location for the IBM Open Data Platform on Linux?
B. /usr/local/iop
C. /opt/ibm/iop
D. /var/iop
Question: 14 of 60
What does the bucketing feature of Hive do?
Correct.
Question: 15 of 60
What command will list files located on the HDFS in R?
A. list()
B. ls()
C. bigr.dir()
D. bigr.listfs()
Question: 16 of 60
Which open source component is a big data processing framework?
Correct.
A. IBM Big SQL
B. Apache Ambari
C. IBM BigSheets
D. Apache Spark
Question: 17 of 60
Data collected within your organization has a short period of time when it is relevant. Which
characteristic of a big data system does this represent?
Correct.
A. Velocity
B. Variety
C. Validation
D. Volume
Question: 18 of 60
What is a feature of an Avro file?
Correct.
Question: 19 of 60
What does the MLlib component of Apache Spark support?
Correct.
A. SQL and HiveQL
B. graph computation
D. stream processing
Question: 20 of 60
Which integration API does Apache Ambari support?
Correct.
A. REST
B. RPC
C. SOAP
D. RMI
Question: 21 of 60
What is the ApplicationMaster in YARN responsible for? (Choose two.)
(Please select ALL that apply)
Correct.
Question: 22 of 60
Which data inconsistency may appear while using ZooKeeper?
Correct.
Question: 23 of 60
What does the HCatalog component of Hive provide?
Correct.
Question: 24 of 60
What command is used to start a Flume agent?
Correct.
A. flume-ng
B. flume-start
C. flume-agent
D. flume-src
Question: 25 of 60
What are two benefits of using the IBM Big SQL processing engine? (Choose two.)
(Please select ALL that apply)
Correct.
A. It provides access to Hadoop data using SQL.
Question: 26 of 60
What are two major business advantages of using BigSheets? (Choose two.)
(Please select ALL that apply)
Incorrect.The correct answer is :built-in data readers for multiple formatsspreadsheet-like querying
and discovery interfaceExplanationDW613_Course_Guide_V1 3-24
Question: 27 of 60
What command will load the BigR package in R?
Correct.
A. source("bigr")
B. dir(pattern="bigr")
C. bigr.connect
D. library(bigr)
Question: 28 of 60
Which action is performed prior to the Map step of a MapReduce v1 processing cycle?
Correct.
Question: 29 of 60
Which action is performed during the Reduce step of a MapReduce v1 processing cycle?
Correct.
Question: 30 of 60
Which software is at the core of the IBM BigInsights platform?
Correct.
Question: 31 of 60
How does an end-user interact with the IBM BigSheets tool?
Correct.
B. mobile app
C. command line
D. web browser
Question: 32 of 60
What does the federation feature of Big SQL allow?
Correct.
Question: 33 of 60
Which command is used to launch an interactive Apache Spark shell?
Correct.
A. spark
B. hadoop spark
C. scala --spark
D. spark-shell
Question: 34 of 60
Which kind of HBase row key maps to multiple SQL columns?
A. Composite
B. Dense
C. Primary
D. Unique
Question: 35 of 60
How can you reduce the memory usage of the ANALYZE command in Big SQL?
Correct.
Question: 36 of 60
Which statement best describes Spark?
Correct.
Question: 37 of 60
Which two commands are used to load data into an existing Big SQL table from HDFS? (Choose
two.)
(Please select ALL that apply)
Correct.
A. Table
B. Select
C. Create
D. Load
E. Insert
Question: 38 of 60
Which feature in a Big SQL federation is a library to access a particular type of data source?
Correct.
A. server
B. view
C. wrapper
D. table
Question: 39 of 60
How will the following column mapping command be encoded? cf_data:full_names mapped by
(last_name, First_name) separator ','
Correct.
A. Character
B. String
C. Binary
D. Hex
Question: 40 of 60
Which statement is used to set the correct compatible collation with Big SQL?
A. SEQUENCE
B. PUSHDOWN
C. CREATE WRAPPER
D. CREATE SERVER
Question: 41 of 60
Which command should you use to set the default schema in a Big SQL table and also create the
schema if it does not exist?
A. default
B. use
C. format
D. create
Question: 42 of 60
Which core component of the Hadoop framework is highly scalable and a common tool?
Correct.
A. MapReduce
B. Sqoop
C. Pig
D. Hive
Question: 43 of 60
Which Big SQL file format is human readable and supported by most tools, but is the least efficient
file format?
Correct.
A. Delimited
B. Sequence
C. Avro
D. Parquet
Question: 44 of 60
You need to create multiple Big SQL tables with columns defined as CHAR. What needs to be set to
enable CHAR columns?
Correct.
C. SET SYSHADOOP.COMPATIBILITY_MODE=1
D. SET HADOOPCOMPATIBLITY_MODE=True
Question: 45 of 60
Which underlying data representation and access method does Big SQL use?
Correct.
A. SMALLINT
B. Hive
C. MAP
D. TINYINT
Question: 46 of 60
Which Big SQL datatype should be avoided because it causes significant performance degradation?
Correct.
A. UNION
B. VARCHAR
C. STRING
D. CHAR
Question: 47 of 60
Which type of HBase column is mapped to multiple SQL columns?
Correct.
A. Composite
B. Dense
C. Double
D. Exclusive
Question: 48 of 60
What is missing from the following statement when querying a remote table? CREATE _______ FOR
remotetable1 ...
Correct.
A. TABLE
B. INDEX
C. VIEW
D. NICKNAME
Question: 49 of 60
You have a very large Hadoop file system. You need to work on the data without migrating the data
out or changing the data format. Which IBM tool should you use?
Correct.
A. MapReduce
B. Pig
C. Big SQL
Question: 50 of 60
How can you fix duplicate results generated by an extractor from the same text because the text
matches more than one dictionary entry?
Question: 51 of 60
Where should you build extractors in the Information Extraction Web Tool?
Correct.
A. Documents
B. Regular expression
C. Property pane
D. Canvas
Question: 52 of 60
Which feature of Text Analytics allows you to rollback your extractors when necessary?
Correct.
A. Snapshots
B. Standard tokenizer
C. Multilingual tokenizer
D. Scalar functions
Question: 53 of 60
What are extractors transformed into when they are executed?
Correct.
Question: 54 of 60
In which text analytics phase are extractors developed and tested?
Correct.
A. Production
B. Analysis
C. Performance Tuning
D. Rule Development
Question: 55 of 60
How is a sequence created in Canvas?
Correct.
Question: 56 of 60
Which Text Analytics runtime component is used for languages such as Spanish and English by
breaking a stream of text into phrases or words?
Correct.
A. Standard tokenizer
B. Other extractors
C. Multilingual tokenizer
Question: 57 of 60
Which AQL candidate rule combines tuples from two views with the same schema?
Correct.
A. Sequence
B. Select
C. Blocks
D. Union
Question: 58 of 60
What defines a relation in an AQL extractor?
Correct.
A. a row
B. a column
C. a schema
D. a view
Question: 59 of 60
Which basic feature rule of AQL helps find an exact match to a single word or phrase?
Correct.
A. Part of Speech
B. Literals
C. Splits
D. Dictionary
Question: 60 of 60
What should you do in Text Analytics to fix an extractor that produces unwanted results?
Correct.