Cloudera Certified Administrator for Apache Hadoop
Exam 100% Verified
Your Hadoop cluster has 25 nodes with, in total, 100 TB - 4 TB per node - of raw disk
space allocated HDFS storage. Assuming the default configuration of Hadoop, how
much data will you be able to store?
A) About 10TB
B) About 33 TB
C) About 25TB
D) About 100TB - ANSWER
The most important consideration for slave nodes in a Hadoop cluster running
production jobs that require short turnaround times is:
A) The ratio between the amount of memory and the total storage capacity
B) The ratio between the number of processor cores and number of disk drives
C) The ratio between the number of processor cores and total storage capacity
D) The ratio between the number of processor cores and the amount of memory
E) The ratio between the amount of memory and the number of disk drives - ANSWER
Which of the following are three distcp features you can use on a Hadoop cluster?
A) You can use distcp to copy data between directories in the same cluster
B) You can use distcp to copy HBase table files
C) You can use distcp to execute an internal MapReduce job to copy files
,D) You can use distcp to copy physical blocks from source to the target destination in
your cluster
E) Use distcp to copy files only between two clusters or more. You cannot use distcp to
copy data between directories inside the same cluster - ANSWER
You have a cluster running 32 slave nodes and 3 master nodes running mapreduce V1
(MRv1).
You run the following command: $ hadoop fsck /
What four cluster conditions running this command will return to you?
A) Blocks replicated improperly or that don't meet your cluster upgrade policy (for
example, too many blocks replicated on the same node)
B) Under-replicated blocks
C) Current state of the filesystem returned from scanning individual blocks on each
datanode
D) Number of datanodes
E) Number of dead datanodes
F) Configured capacity of your cluster
Current state of the filesystem as it is known to the namenode
Location of each block for every block - ANSWER
Your Hadoop cluster consists of nodes in three racks. You have not populated the
dfs.hosts property in the configuration file of the NameNode. What happens?
A) The NameNode will update the dfs.hosts property with those machines running the
DataNode daemon at the next reboot of the NameNode or with the invocation of the
command dfsadmin -refreshNodes.
B) No new nodes can be added to the cluster until you specify them in the dfs.hosts file
, C) Any machine running the DataNode daemon can immediately join the cluster
D) Presented with an empty dfs.hosts property, the NameNode will allow DataNodes
specified in mapred.hosts to join the cluster - ANSWER
You suspect that your NameNode is incorrectly configured, and is swapping memory to
disk. Which Linux commands help you to identify whether swapping is occurring?
A) vmstat
B) free
C) swapinfo
D) top
E) df
F) memcat - ANSWER
Identify the daemon that performs checkpoint operations of the namespace state in a
cluster configured with HDFS High Availability (HA) using Quorum based-storage?
A) Standby NameNode
B) Secondary NameNode
C) CheckpointNode
D) NameNode
E) NodeManeger - ANSWER
Which two features does Kerberos security add to a Hadoop cluster?
A) Encryption for data on disk ("at rest") is provided.
B) User access authentication against a central server is provided.
C) All RPCs must be authenticated as a user.
Exam 100% Verified
Your Hadoop cluster has 25 nodes with, in total, 100 TB - 4 TB per node - of raw disk
space allocated HDFS storage. Assuming the default configuration of Hadoop, how
much data will you be able to store?
A) About 10TB
B) About 33 TB
C) About 25TB
D) About 100TB - ANSWER
The most important consideration for slave nodes in a Hadoop cluster running
production jobs that require short turnaround times is:
A) The ratio between the amount of memory and the total storage capacity
B) The ratio between the number of processor cores and number of disk drives
C) The ratio between the number of processor cores and total storage capacity
D) The ratio between the number of processor cores and the amount of memory
E) The ratio between the amount of memory and the number of disk drives - ANSWER
Which of the following are three distcp features you can use on a Hadoop cluster?
A) You can use distcp to copy data between directories in the same cluster
B) You can use distcp to copy HBase table files
C) You can use distcp to execute an internal MapReduce job to copy files
,D) You can use distcp to copy physical blocks from source to the target destination in
your cluster
E) Use distcp to copy files only between two clusters or more. You cannot use distcp to
copy data between directories inside the same cluster - ANSWER
You have a cluster running 32 slave nodes and 3 master nodes running mapreduce V1
(MRv1).
You run the following command: $ hadoop fsck /
What four cluster conditions running this command will return to you?
A) Blocks replicated improperly or that don't meet your cluster upgrade policy (for
example, too many blocks replicated on the same node)
B) Under-replicated blocks
C) Current state of the filesystem returned from scanning individual blocks on each
datanode
D) Number of datanodes
E) Number of dead datanodes
F) Configured capacity of your cluster
Current state of the filesystem as it is known to the namenode
Location of each block for every block - ANSWER
Your Hadoop cluster consists of nodes in three racks. You have not populated the
dfs.hosts property in the configuration file of the NameNode. What happens?
A) The NameNode will update the dfs.hosts property with those machines running the
DataNode daemon at the next reboot of the NameNode or with the invocation of the
command dfsadmin -refreshNodes.
B) No new nodes can be added to the cluster until you specify them in the dfs.hosts file
, C) Any machine running the DataNode daemon can immediately join the cluster
D) Presented with an empty dfs.hosts property, the NameNode will allow DataNodes
specified in mapred.hosts to join the cluster - ANSWER
You suspect that your NameNode is incorrectly configured, and is swapping memory to
disk. Which Linux commands help you to identify whether swapping is occurring?
A) vmstat
B) free
C) swapinfo
D) top
E) df
F) memcat - ANSWER
Identify the daemon that performs checkpoint operations of the namespace state in a
cluster configured with HDFS High Availability (HA) using Quorum based-storage?
A) Standby NameNode
B) Secondary NameNode
C) CheckpointNode
D) NameNode
E) NodeManeger - ANSWER
Which two features does Kerberos security add to a Hadoop cluster?
A) Encryption for data on disk ("at rest") is provided.
B) User access authentication against a central server is provided.
C) All RPCs must be authenticated as a user.