CLOUDERA CERTIFIED ADMINISTRATOR
FOR APACHE HADOOP EXAM QUESTIONS
AND ANSWERS UPDATED 2024/2025 A
COMPLETE SOLUTION ALL ANSWERS
CORRECT DETAILED BEST GRADED A+ FOR
SUCCESS
Your Hadoop cluster has 25 nodes with a total of 100 TB (4 TB per node) of raw
disk space allocated HDFS storage. Assuming Hadoop's default configuration,
how much data will you be able to store?
A) Approximately 10TB
B) Approximately 33 TB
C) Approximately 25TB
D) Approximately 100TB - CORRECT ANSWERS
The most important consideration for slave nodes in a Hadoop cluster running
production jobs that require short turnaround times is:
A) The ratio between the amount of memory and the total storage capacity
B) The ratio between the number of processor cores and number of disk drives
C) The ratio between the number of processor cores and total storage capacity
D) The ratio between the number of processor cores and the amount of
memory
E) The ratio between the amount of memory and the number of disk drives -
CORRECT ANSWERS
Which three distcp features can you utilize on a Hadoop cluster?
,CLOUDERA CERTIFIED ADMINISTRATOR
FOR APACHE HADOOP EXAM QUESTIONS
AND ANSWERS UPDATED 2024/2025 A
COMPLETE SOLUTION ALL ANSWERS
CORRECT DETAILED BEST GRADED A+ FOR
SUCCESS
A) Use distcp to copy data between directories inside the same cluster
B) Use distcp to copy HBase table files
C) Use distcp to run an internal MapReduce job to copy files
D) Use distcp to copy physical blocks from the source to the target destination
in your cluster
E) Use distcp to copy files only between two clusters or more. You cannot use
distcp to copy data between directories inside the same cluster - CORRECT
ANSWERS
You have a cluster running 32 slave nodes and 3 master nodes running
mapreduce V1 (MRv1).
You execute the command: $ hadoop fsck /
What four cluster conditions running this command will return to you?
A) Blocks replicated improperly or that don't satisfy your cluster enhancement
policy (e.g., too many blocks replicated on the same node)
B) Under-replicated blocks
C) The current state of the file system returned from scanning individual blocks
on each datanode
D) Number of datanodes
, CLOUDERA CERTIFIED ADMINISTRATOR
FOR APACHE HADOOP EXAM QUESTIONS
AND ANSWERS UPDATED 2024/2025 A
COMPLETE SOLUTION ALL ANSWERS
CORRECT DETAILED BEST GRADED A+ FOR
SUCCESS
E) Number of dead datanodes
F) Configure capacity of your cluster
The current state of the file system according to the namenode
The location for every block - CORRECT ANSWERS
Your Hadoop cluster contains nodes in three racks. You have not configured the
dfs.hosts property in the NameNode's configuration file. What happens?
A) The NameNode will update the dfs.hosts property to include machines
running the DataNode daemon on the next NameNode reboot or with the
command dfsadmin -refreshNodes
B) No new nodes can be added to the cluster until you specify them in the
dfs.hosts file
C) Any machine running the DataNode daemon can immediately join the
cluster
D) Presented with a blank dfs.hosts property, the NameNode will permit
DataNodes specified in mapred.hosts to join the cluster - CORRECT ANSWERS
You suspect that your NameNode is incorrectly configured, and is swapping
memory to disk. Which Linux commands help you to identify whether
swapping is occurring?
A) vmstat
FOR APACHE HADOOP EXAM QUESTIONS
AND ANSWERS UPDATED 2024/2025 A
COMPLETE SOLUTION ALL ANSWERS
CORRECT DETAILED BEST GRADED A+ FOR
SUCCESS
Your Hadoop cluster has 25 nodes with a total of 100 TB (4 TB per node) of raw
disk space allocated HDFS storage. Assuming Hadoop's default configuration,
how much data will you be able to store?
A) Approximately 10TB
B) Approximately 33 TB
C) Approximately 25TB
D) Approximately 100TB - CORRECT ANSWERS
The most important consideration for slave nodes in a Hadoop cluster running
production jobs that require short turnaround times is:
A) The ratio between the amount of memory and the total storage capacity
B) The ratio between the number of processor cores and number of disk drives
C) The ratio between the number of processor cores and total storage capacity
D) The ratio between the number of processor cores and the amount of
memory
E) The ratio between the amount of memory and the number of disk drives -
CORRECT ANSWERS
Which three distcp features can you utilize on a Hadoop cluster?
,CLOUDERA CERTIFIED ADMINISTRATOR
FOR APACHE HADOOP EXAM QUESTIONS
AND ANSWERS UPDATED 2024/2025 A
COMPLETE SOLUTION ALL ANSWERS
CORRECT DETAILED BEST GRADED A+ FOR
SUCCESS
A) Use distcp to copy data between directories inside the same cluster
B) Use distcp to copy HBase table files
C) Use distcp to run an internal MapReduce job to copy files
D) Use distcp to copy physical blocks from the source to the target destination
in your cluster
E) Use distcp to copy files only between two clusters or more. You cannot use
distcp to copy data between directories inside the same cluster - CORRECT
ANSWERS
You have a cluster running 32 slave nodes and 3 master nodes running
mapreduce V1 (MRv1).
You execute the command: $ hadoop fsck /
What four cluster conditions running this command will return to you?
A) Blocks replicated improperly or that don't satisfy your cluster enhancement
policy (e.g., too many blocks replicated on the same node)
B) Under-replicated blocks
C) The current state of the file system returned from scanning individual blocks
on each datanode
D) Number of datanodes
, CLOUDERA CERTIFIED ADMINISTRATOR
FOR APACHE HADOOP EXAM QUESTIONS
AND ANSWERS UPDATED 2024/2025 A
COMPLETE SOLUTION ALL ANSWERS
CORRECT DETAILED BEST GRADED A+ FOR
SUCCESS
E) Number of dead datanodes
F) Configure capacity of your cluster
The current state of the file system according to the namenode
The location for every block - CORRECT ANSWERS
Your Hadoop cluster contains nodes in three racks. You have not configured the
dfs.hosts property in the NameNode's configuration file. What happens?
A) The NameNode will update the dfs.hosts property to include machines
running the DataNode daemon on the next NameNode reboot or with the
command dfsadmin -refreshNodes
B) No new nodes can be added to the cluster until you specify them in the
dfs.hosts file
C) Any machine running the DataNode daemon can immediately join the
cluster
D) Presented with a blank dfs.hosts property, the NameNode will permit
DataNodes specified in mapred.hosts to join the cluster - CORRECT ANSWERS
You suspect that your NameNode is incorrectly configured, and is swapping
memory to disk. Which Linux commands help you to identify whether
swapping is occurring?
A) vmstat