C# (CSharp) Lucene.Net.Analysis.Cz Namespace

Classes

Name Description
CzechAnalyzer Analyzer for Czech language. Supports an external list of stopwords (words that will not be indexed at all). A default set of stopwords is used unless an alternative list is specified, the exclusion list is empty by default. @author Lukas Zapletal [[email protected]] @version $Id: CzechAnalyzer.java,v 1.2 2003/01/22 20:54:47 ehatcher Exp $
CzechAnalyzer.DefaultSetHolder
CzechAnalyzer.SavedStreams
CzechStemFilterFactory Factory for CzechStemFilter.
 <fieldType name="text_czstem" class="solr.TextField" positionIncrementGap="100"> <analyzer> <tokenizer class="solr.StandardTokenizerFactory"/> <filter class="solr.LowerCaseFilterFactory"/> <filter class="solr.CzechStemFilterFactory"/> </analyzer> </fieldType>
CzechStemmer Light Stemmer for Czech.

Implements the algorithm described in: Indexing and stemming approaches for the Czech language http://portal.acm.org/citation.cfm?id=1598600

TestCzechAnalyzer Test the CzechAnalyzer Before Lucene 3.1, CzechAnalyzer was a StandardAnalyzer with a custom stopword list. As of 3.1 it also includes a stemmer.
TestCzechStemFilterFactory Simple tests to ensure the Czech stem filter factory is working.
TestCzechStemmer Test the Czech Stemmer. Note: its algorithmic, so some stems are nonsense
TestCzechStemmer.AnalyzerAnonymousInnerClassHelper