Sail E0 Webinar

MCQs

Total Questions : 16 | Page 1 of 2 pages
Question 1. PostingsFormat now uses a __________ API when writing postings, just like doc values.
  1.    push
  2.    pull
  3.    read
  4.    All of the mentioned
 Discuss Question
Answer: Option B. -> pull


This is powerful because you can do things in your postings format that require making more than one pass through the postings such as iterating over all postings.


Question 2. Which of the following is a more compact binary format ?
  1.    TCompactProtocol
  2.    TDenseProtocol
  3.    TBinaryProtocol
  4.    TSimpleJSONProtocol
 Discuss Question
Answer: Option A. -> TCompactProtocol


TCompactProtocol is typically more efficient to process as well.


Question 3. The tokens are passed through a Lucene ____________ to produce NGrams of the desired length.
  1.    ShngleFil
  2.    ShingleFilter
  3.    SingleFilter
  4.    Collfilter
 Discuss Question
Answer: Option B. -> ShingleFilter


The tools that the collocation identification algorithm are embedded within either consume tokenized text as input or provide the ability to specify an implementation of the Lucene Analyzer class perform tokenization in order to form ngrams.


Question 4. Which of the following language is not supported by Spark ?
  1.    Java
  2.    Pascal
  3.    Scala
  4.    Python
 Discuss Question
Answer: Option B. -> Pascal


The Spark engine runs in a variety of environments, from cloud services to Hadoop or Mesos clusters.


Question 5. ______________ leverages Spark Core's fast scheduling capability to perform streaming analytics.
  1.    MLlib
  2.    Spark Streaming
  3.    GraphX
  4.    RDDs
 Discuss Question
Answer: Option B. -> Spark Streaming


Spark Streaming ingests data in mini-batches and performs RDD transformations on those mini-batches of data.


Question 6. Crunch was designed for developers who understand __________ and want to use MapReduce effectively.
  1.    Java
  2.    Python
  3.    Scala
  4.    Javascript
 Discuss Question
Answer: Option A. -> Java


Crunch is often used in conjunction with Hive and Pig.


Question 7. Distributed Mode are mapped in the __________ file.
  1.    groomservers
  2.    grervers
  3.    grsvers
  4.    groom
 Discuss Question
Answer: Option A. -> groomservers


Distributed Mode is used when you have multiple machines.


Question 8. Drill integrates with BI tools using a standard __________ connector.
  1.    JDBC
  2.    ODBC
  3.    ODBC-JDBC
  4.    All of the mentioned
 Discuss Question
Answer: Option B. -> ODBC


Drill conforms to the stringent ANSI SQL standards ensuring compatibility with existing BI environments as well as Hive deployments.


Question 9. A number of ____________ source adapters give you the granular control to grab a specific file.
  1.    multimedia file
  2.    text file
  3.    image file
  4.    None of the mentioned
 Discuss Question
Answer: Option B. -> text file


A number of predefined source adapters are built into Flume.


Question 10. DoFns provide direct access to the __________ object that is used within a given Map or Reduce task via the getContext method.
  1.    TaskInputContext
  2.    TaskInputOutputContext
  3.    TaskOutputContext
  4.    All of the mentioned
 Discuss Question
Answer: Option B. -> TaskInputOutputContext


There are also a number of helper methods for working with the objects associated with the TaskInputOutputContext


Latest Videos

Latest Test Papers