C# (CSharp) Lucene.Net.Analysis.De Namespace

Classes

Name Description
GermanAnalyzer Analyzer for German language. Supports an external list of stopwords (words that will not be indexed at all) and an external list of exclusions (word that will not be stemmed, but indexed). A default set of stopwords is used unless an alternative list is specified, the exclusion list is empty by default.
GermanAnalyzer.DefaultSetHolder
GermanDIN2Stemmer A stemmer for the german language that uses the DIN-5007-2 "Phone Book" rules for handling umlaut characters.
GermanLightStemFilter A TokenFilter that applies GermanLightStemmer to stem German words.

To prevent terms from being stemmed use an instance of SetKeywordMarkerFilter or a custom TokenFilter that sets the KeywordAttribute before this TokenStream.

GermanLightStemFilterFactory Factory for GermanLightStemFilter.
 <fieldType name="text_delgtstem" class="solr.TextField" positionIncrementGap="100"> <analyzer> <tokenizer class="solr.StandardTokenizerFactory"/> <filter class="solr.LowerCaseFilterFactory"/> <filter class="solr.GermanLightStemFilterFactory"/> </analyzer> </fieldType>
GermanLightStemmer Light Stemmer for German.

This stemmer implements the "UniNE" algorithm in: Light Stemming Approaches for the French, Portuguese, German and Hungarian Languages Jacques Savoy

GermanMinimalStemFilter A TokenFilter that applies GermanMinimalStemmer to stem German words.

To prevent terms from being stemmed use an instance of SetKeywordMarkerFilter or a custom TokenFilter that sets the KeywordAttribute before this TokenStream.

GermanMinimalStemFilterFactory Factory for GermanMinimalStemFilter.
 <fieldType name="text_deminstem" class="solr.TextField" positionIncrementGap="100"> <analyzer> <tokenizer class="solr.StandardTokenizerFactory"/> <filter class="solr.LowerCaseFilterFactory"/> <filter class="solr.GermanMinimalStemFilterFactory"/> </analyzer> </fieldType>
GermanMinimalStemmer Minimal Stemmer for German.

This stemmer implements the following algorithm: Morphologie et recherche d'information Jacques Savoy.

GermanNormalizationFilterFactory Factory for GermanNormalizationFilter.
 <fieldType name="text_denorm" class="solr.TextField" positionIncrementGap="100"> <analyzer> <tokenizer class="solr.StandardTokenizerFactory"/> <filter class="solr.LowerCaseFilterFactory"/> <filter class="solr.GermanNormalizationFilterFactory"/> </analyzer> </fieldType>
GermanStemmer A stemmer for German words.

The algorithm is based on the report "A Fast and Simple Stemming Algorithm for German Words" by Jörg Caumanns (joerg.caumanns at isst.fhg.de).

TestGermanAnalyzer
TestGermanLightStemFilter Simple tests for GermanLightStemFilter
TestGermanLightStemFilter.AnalyzerAnonymousInnerClassHelper
TestGermanLightStemFilter.AnalyzerAnonymousInnerClassHelper2
TestGermanLightStemFilter.AnalyzerAnonymousInnerClassHelper3
TestGermanLightStemFilterFactory Simple tests to ensure the German light stem factory is working.
TestGermanMinimalStemFilter Simple tests for GermanMinimalStemFilter
TestGermanMinimalStemFilter.AnalyzerAnonymousInnerClassHelper
TestGermanMinimalStemFilter.AnalyzerAnonymousInnerClassHelper2
TestGermanMinimalStemFilter.AnalyzerAnonymousInnerClassHelper3
TestGermanMinimalStemFilterFactory Simple tests to ensure the German minimal stem factory is working.
TestGermanNormalizationFilter Tests GermanNormalizationFilter
TestGermanNormalizationFilter.AnalyzerAnonymousInnerClassHelper
TestGermanNormalizationFilter.AnalyzerAnonymousInnerClassHelper2
TestGermanNormalizationFilterFactory Simple tests to ensure the German normalization factory is working.
TestGermanStemFilter Test the German stemmer. The stemming algorithm is known to work less than perfect, as it doesn't use any word lists with exceptions. We also check some of the cases where the algorithm is wrong.
TestGermanStemFilter.AnalyzerAnonymousInnerClassHelper
TestGermanStemFilter.AnalyzerAnonymousInnerClassHelper2
TestGermanStemFilter.AnalyzerAnonymousInnerClassHelper3
TestGermanStemFilterFactory Simple tests to ensure the German stem filter factory is working.