Sail E0 Webinar
Question
The tokens are passed through a Lucene ____________ to produce NGrams of the desired length.
Options:
A .  ShngleFil
B .  ShingleFilter
C .  SingleFilter
D .  Collfilter
Answer: Option B


The tools that the collocation identification algorithm are embedded within either consume tokenized text as input or provide the ability to specify an implementation of the Lucene Analyzer class perform tokenization in order to form ngrams.



Was this answer helpful ?
Next Question

Submit Solution

Your email address will not be published. Required fields are marked *

Latest Videos

Latest Test Papers