Labour Day Sale Limited Time 70% Discount Offer - Ends in 0d 00h 00m 00s - Coupon code: scxmas70

CCA-500 Exam Dumps - Cloudera Certified Administrator for Apache Hadoop (CCAH)

Question # 4

Which command does Hadoop offer to discover missing or corrupt HDFS data?

A.

Hdfs fs –du

B.

Hdfs fsck

C.

Dskchk

D.

The map-only checksum

E.

Hadoop does not provide any tools to discover missing or corrupt data; there is not need because three replicas are kept for each data block

Full Access
Question # 5

For each YARN job, the Hadoop framework generates task log file. Where are Hadoop task log files stored?

A.

Cached by the NodeManager managing the job containers, then written to a log directory on the NameNode

B.

Cached in the YARN container running the task, then copied into HDFS on job completion

C.

In HDFS, in the directory of the user who generates the job

D.

On the local disk of the slave mode running the task

Full Access
Question # 6

Which two features does Kerberos security add to a Hadoop cluster? (Choose two)

A.

User authentication on all remote procedure calls (RPCs)

B.

Encryption for data during transfer between the Mappers and Reducers

C.

Encryption for data on disk (“at rest”)

D.

Authentication for user access to the cluster against a central server

E.

Root access to the cluster for users hdfs and mapred but non-root access for clients

Full Access
Question # 7

Your cluster is configured with HDFS and MapReduce version 2 (MRv2) on YARN. What is the result when you execute: hadoop jar SampleJar MyClass on a client machine?

A.

SampleJar.Jar is sent to the ApplicationMaster which allocates a container for SampleJar.Jar

B.

Sample.jar is placed in a temporary directory in HDFS

C.

SampleJar.jar is sent directly to the ResourceManager

D.

SampleJar.jar is serialized into an XML file which is submitted to the ApplicatoionMaster

Full Access
Question # 8

Your Hadoop cluster contains nodes in three racks. You have not configured the dfs.hosts property in the NameNode’s configuration file. What results?

A.

The NameNode will update the dfs.hosts property to include machines running the DataNode daemon on the next NameNode reboot or with the command dfsadmin –refreshNodes

B.

No new nodes can be added to the cluster until you specify them in the dfs.hosts file

C.

Any machine running the DataNode daemon can immediately join the cluster

D.

Presented with a blank dfs.hosts property, the NameNode will permit DataNodes specified in mapred.hosts to join the cluster

Full Access
Question # 9

You observed that the number of spilled records from Map tasks far exceeds the number of map output records. Your child heap size is 1GB and your io.sort.mb value is set to 1000MB. How would you tune your io.sort.mb value to achieve maximum memory to disk I/O ratio?

A.

For a 1GB child heap size an io.sort.mb of 128 MB will always maximize memory to disk I/O

B.

Increase the io.sort.mb to 1GB

C.

Decrease the io.sort.mb value to 0

D.

Tune the io.sort.mb value until you observe that the number of spilled records equals (or is as close to equals) the number of map output records.

Full Access