C# (CSharp) Lucene.Net.Analysis.Hi Namespace

Classes

Name Description
HindiNormalizationFilter A TokenFilter that applies HindiNormalizer to normalize the orthography.

In some cases the normalization may cause unrelated terms to conflate, so to prevent terms from being normalized use an instance of SetKeywordMarkerFilter or a custom TokenFilter that sets the KeywordAttribute before this TokenStream.

HindiNormalizationFilterFactory Factory for HindiNormalizationFilter.
 <fieldType name="text_hinormal" class="solr.TextField" positionIncrementGap="100"> <analyzer> <tokenizer class="solr.StandardTokenizerFactory"/> <filter class="solr.HindiNormalizationFilterFactory"/> </analyzer> </fieldType>
HindiStemFilter A TokenFilter that applies HindiStemmer to stem Hindi words.
HindiStemFilterFactory Factory for HindiStemFilter.
 <fieldType name="text_histem" class="solr.TextField" positionIncrementGap="100"> <analyzer> <tokenizer class="solr.StandardTokenizerFactory"/> <filter class="solr.HindiStemFilterFactory"/> </analyzer> </fieldType>
HindiStemmer Light Stemmer for Hindi.

Implements the algorithm specified in: A Lightweight Stemmer for Hindi Ananthakrishnan Ramanathan and Durgesh D Rao. http://computing.open.ac.uk/Sites/EACLSouthAsia/Papers/p6-Ramanathan.pdf

TestHindiAnalyzer Tests the HindiAnalyzer
TestHindiNormalizer Test HindiNormalizer
TestHindiNormalizer.AnalyzerAnonymousInnerClassHelper
TestHindiStemmer Test HindiStemmer
TestHindiStemmer.AnalyzerAnonymousInnerClassHelper