CCA-500 Exam Dumps - Cloudera Certified Administrator for Apache Hadoop (CCAH)

Go to page:

Question # 4

Which command does Hadoop offer to discover missing or corrupt HDFS data?

Hdfs fs â€“du

Hdfs fsck

Dskchk

The map-only checksum

Hadoop does not provide any tools to discover missing or corrupt data; there is not need because three replicas are kept for each data block

Full Access

Question # 5

For each YARN job, the Hadoop framework generates task log file. Where are Hadoop task log files stored?

Cached by the NodeManager managing the job containers, then written to a log directory on the NameNode

Cached in the YARN container running the task, then copied into HDFS on job completion

In HDFS, in the directory of the user who generates the job

On the local disk of the slave mode running the task

Full Access

Question # 6

Which two features does Kerberos security add to a Hadoop cluster? (Choose two)

User authentication on all remote procedure calls (RPCs)

Encryption for data during transfer between the Mappers and Reducers

Encryption for data on disk (â€œat restâ€)

Authentication for user access to the cluster against a central server

Root access to the cluster for users hdfs and mapred but non-root access for clients

Full Access

Question # 7

Your cluster is configured with HDFS and MapReduce version 2 (MRv2) on YARN. What is the result when you execute: hadoop jar SampleJar MyClass on a client machine?

SampleJar.Jar is sent to the ApplicationMaster which allocates a container for SampleJar.Jar

Sample.jar is placed in a temporary directory in HDFS

SampleJar.jar is sent directly to the ResourceManager

SampleJar.jar is serialized into an XML file which is submitted to the ApplicatoionMaster

Full Access

Question # 8

Your Hadoop cluster contains nodes in three racks. You have not configured the dfs.hosts property in the NameNodeâ€™s configuration file. What results?

The NameNode will update the dfs.hosts property to include machines running the DataNode daemon on the next NameNode reboot or with the command dfsadmin â€“refreshNodes

No new nodes can be added to the cluster until you specify them in the dfs.hosts file

Any machine running the DataNode daemon can immediately join the cluster

Presented with a blank dfs.hosts property, the NameNode will permit DataNodes specified in mapred.hosts to join the cluster

Full Access

Go to page: