EDTE EXAM REVIEW QUESTIONS WITH
100% CORRECT ANSWERS!!
What does not belong to the Spark library & engine?
A.Spark core
B.ML Lib
C.GraphX
D.Streaming
E.Spark SQL
F.Mesos
F
The below figure shows a Spark action. What is the correct word for (A) and (B) (in an (A)-
(B) form)?
A.Stage - Task
B.Map - Reduce
,C.Map - Task
D.Stage - Reduce
A
What is the best description for the figure below?
A.Spark Standalone
B.Spark Twins
C.Spark Dual-Pair
D.Spark Mesos
E.Spark Flamingo
A
Among the following descriptions on the relation of Spark and Hadoop, which one is
incorrect?
A.Hadoop and Spark systems can work together
B.Many Spark systems are connected to a Hadoop HDFS through YARN
, C.Hadoop is faster than Spark because MapReduce saves all of its processed data in its
physical storage medium (commonly HDDs) after each operation to be fault tolerant
(resilient from crashes)
D.In Spark, the Hadoop's MapReduce functionality is replaced with Spark's RDD, DAG,
Transformations, and Actions
C
Among the following descriptions on RDD, which one is incorrect?
A.RDD stands for Resilient Distributed Datasets
B.RDDs can recover from node failures automatically
C.Transformations are Lazy during DAG (Directed Acyclic Graph) setup
D.RDD divides each dataset into logical partitions
E.RDDs can process Scala and Python objects but not Java objects
E
Among the following descriptions on Spark DAG (Directed Acyclic Graph), which one is
incorrect?
A.Computation sequence of Transformations are represented using a lineage graph
100% CORRECT ANSWERS!!
What does not belong to the Spark library & engine?
A.Spark core
B.ML Lib
C.GraphX
D.Streaming
E.Spark SQL
F.Mesos
F
The below figure shows a Spark action. What is the correct word for (A) and (B) (in an (A)-
(B) form)?
A.Stage - Task
B.Map - Reduce
,C.Map - Task
D.Stage - Reduce
A
What is the best description for the figure below?
A.Spark Standalone
B.Spark Twins
C.Spark Dual-Pair
D.Spark Mesos
E.Spark Flamingo
A
Among the following descriptions on the relation of Spark and Hadoop, which one is
incorrect?
A.Hadoop and Spark systems can work together
B.Many Spark systems are connected to a Hadoop HDFS through YARN
, C.Hadoop is faster than Spark because MapReduce saves all of its processed data in its
physical storage medium (commonly HDDs) after each operation to be fault tolerant
(resilient from crashes)
D.In Spark, the Hadoop's MapReduce functionality is replaced with Spark's RDD, DAG,
Transformations, and Actions
C
Among the following descriptions on RDD, which one is incorrect?
A.RDD stands for Resilient Distributed Datasets
B.RDDs can recover from node failures automatically
C.Transformations are Lazy during DAG (Directed Acyclic Graph) setup
D.RDD divides each dataset into logical partitions
E.RDDs can process Scala and Python objects but not Java objects
E
Among the following descriptions on Spark DAG (Directed Acyclic Graph), which one is
incorrect?
A.Computation sequence of Transformations are represented using a lineage graph