DutchAnalyzer |
Analyzer for Dutch language. Supports an external list of stopwords (words that will not be indexed at all), an external list of exclusions (word that will not be stemmed, but indexed) and an external list of word-stem pairs that overrule the algorithm (dictionary stemming). A default set of stopwords is used unless an alternative list is specified, the exclusion list is empty by default. As start for the Analyzer the German Analyzer was used. The stemming algorithm implemented can be found at $Id: DutchAnalyzer.java,v 1.1 2004/03/09 14:55:08 otis Exp $ |