Name | Description |
---|---|
CzechAnalyzer | Analyzer for Czech language. Supports an external list of stopwords (words that will not be indexed at all). A default set of stopwords is used unless an alternative list is specified, the exclusion list is empty by default. @author Lukas Zapletal [[email protected]] @version $Id: CzechAnalyzer.java,v 1.2 2003/01/22 20:54:47 ehatcher Exp $ |
CzechAnalyzer.DefaultSetHolder | |
CzechAnalyzer.SavedStreams | |
CzechStemFilterFactory | Factory for CzechStemFilter. <fieldType name="text_czstem" class="solr.TextField" positionIncrementGap="100"> <analyzer> <tokenizer class="solr.StandardTokenizerFactory"/> <filter class="solr.LowerCaseFilterFactory"/> <filter class="solr.CzechStemFilterFactory"/> </analyzer> </fieldType> |
CzechStemmer | Light Stemmer for Czech. Implements the algorithm described in: Indexing and stemming approaches for the Czech language http://portal.acm.org/citation.cfm?id=1598600 |
TestCzechAnalyzer | Test the CzechAnalyzer Before Lucene 3.1, CzechAnalyzer was a StandardAnalyzer with a custom stopword list. As of 3.1 it also includes a stemmer. |
TestCzechStemFilterFactory | Simple tests to ensure the Czech stem filter factory is working. |
TestCzechStemmer | Test the Czech Stemmer. Note: its algorithmic, so some stems are nonsense |
TestCzechStemmer.AnalyzerAnonymousInnerClassHelper |