Questions and answers for exam Preparation

MCQs

Total Questions : 10

Question 1. Point out the wrong statement :

The Mapper outputs are sorted and then partitioned per Reducer
The total number of partitions is the same as the number of reduce tasks for the job
The intermediate, sorted outputs are always stored in a simple (key-len, key, value-len, value) format
None of the mentioned

Discuss Question

Answer: Option D. -> None of the mentioned

All intermediate values associated with a given output key are subsequently grouped by the framework, and passed to the Reducer(s) to determine the final output.

Question 2. Point out the wrong statement :

It is legal to set the number of reduce-tasks to zero if no reduction is desired
The outputs of the map-tasks go directly to the FileSystem
The Mapreduce framework does not sort the map-outputs before writing them out to the FileSystem
None of the mentioned

Discuss Question

Answer: Option D. -> None of the mentioned

Outputs of the map-tasks go directly to the FileSystem, into the output path set by setOutputPath(Path).

Question 3. Applications can use the ____________ to report progress and set application-level status messages

Partitioner
OutputSplit
Reporter
All of the mentioned

Discuss Question

Answer: Option C. -> Reporter

Reporter is also used to update Counters, or just indicate that they are alive.

Question 4. __________ is the primary interface for a user to describe a MapReduce job to the Hadoop framework for execution.

JobConfig
JobConf
JobConfiguration
All of the mentioned

Discuss Question

Answer: Option B. -> JobConf

JobConf is typically used to specify the Mapper, combiner (if any), Partitioner, Reducer, InputFormat, OutputFormat and OutputCommitter implementations.

Question 5. The right level of parallelism for maps seems to be around _________ maps per-node

1-10
10-100
100-150
150-200

Discuss Question

Answer: Option B. -> 10-100

Task setup takes a while, so it is best if the maps take at least a minute to execute

Question 6. The ___________ executes the Mapper/ Reducer task as a child process in a separate jvm.

JobTracker
TaskTracker
TaskScheduler
None of the mentioned

Discuss Question

Answer: Option A. -> JobTracker

The child-task inherits the environment of the parent TaskTracker.

Question 7. During the execution of a streaming job, the names of the _______ parameters are transformed.

vmap
mapvim
mapreduce
mapred

Discuss Question

Answer: Option D. -> mapred

To get the values in a streaming job's mapper/reducer use the parameter names with the underscores.

Question 8. The ________ class provides the getValue() method to read the values from its instance.

Get
Result
Put
Value

Discuss Question

Answer: Option B. -> Result

Get the result by passing your Get class instance to the get method of the HTable class. This method returns the Result class object, which holds the requested result.

Question 9. The standard output (stdout) and error (stderr) streams of the task are read by the TaskTracker and logged to :