C# Class Lucene.Net.Analysis.Nl.DutchAnalyzer

Analyzer for Dutch language. Supports an external list of stopwords (words that will not be indexed at all), an external list of exclusions (word that will not be stemmed, but indexed) and an external list of word-stem pairs that overrule the algorithm (dictionary stemming). A default set of stopwords is used unless an alternative list is specified, the exclusion list is empty by default. As start for the Analyzer the German Analyzer was used. The stemming algorithm implemented can be found at $Id: DutchAnalyzer.java,v 1.1 2004/03/09 14:55:08 otis Exp $
Inheritance: Analyzer
Datei anzeigen Open project: synhershko/lucene.net Class Usage Examples

Public Properties

Property Type Description
DUTCH_STOP_WORDS String[]

Public Methods

Method Description
DutchAnalyzer ( Lucene.Net.Util.Version matchVersion ) : System
DutchAnalyzer ( Lucene.Net.Util.Version matchVersion, FileInfo stopwords ) : System
DutchAnalyzer ( Lucene.Net.Util.Version matchVersion, HashSet stopwords ) : System
DutchAnalyzer ( Lucene.Net.Util.Version matchVersion, ISet stopwords ) : System
DutchAnalyzer ( Lucene.Net.Util.Version matchVersion, ISet stopwords, ISet stemExclusionTable ) : System
ReusableTokenStream ( String fieldName, TextReader reader ) : TokenStream
SetStemDictionary ( FileInfo stemdictFile ) : void
SetStemExclusionTable ( ) : void
SetStemExclusionTable ( FileInfo exclusionlist ) : void
SetStemExclusionTable ( ISet exclusionlist ) : void
TokenStream ( String fieldName, TextReader reader ) : TokenStream
getDefaultStopSet ( ) : ISet

Method Details

DutchAnalyzer() public method

public DutchAnalyzer ( Lucene.Net.Util.Version matchVersion ) : System
matchVersion Lucene.Net.Util.Version
return System

DutchAnalyzer() public method

public DutchAnalyzer ( Lucene.Net.Util.Version matchVersion, FileInfo stopwords ) : System
matchVersion Lucene.Net.Util.Version
stopwords System.IO.FileInfo
return System

DutchAnalyzer() public method

public DutchAnalyzer ( Lucene.Net.Util.Version matchVersion, HashSet stopwords ) : System
matchVersion Lucene.Net.Util.Version
stopwords HashSet
return System

DutchAnalyzer() public method

public DutchAnalyzer ( Lucene.Net.Util.Version matchVersion, ISet stopwords ) : System
matchVersion Lucene.Net.Util.Version
stopwords ISet
return System

DutchAnalyzer() public method

public DutchAnalyzer ( Lucene.Net.Util.Version matchVersion, ISet stopwords, ISet stemExclusionTable ) : System
matchVersion Lucene.Net.Util.Version
stopwords ISet
stemExclusionTable ISet
return System

ReusableTokenStream() public method

public ReusableTokenStream ( String fieldName, TextReader reader ) : TokenStream
fieldName String
reader System.IO.TextReader
return TokenStream

SetStemDictionary() public method

public SetStemDictionary ( FileInfo stemdictFile ) : void
stemdictFile System.IO.FileInfo
return void

SetStemExclusionTable() public method

public SetStemExclusionTable ( ) : void
return void

SetStemExclusionTable() public method

public SetStemExclusionTable ( FileInfo exclusionlist ) : void
exclusionlist System.IO.FileInfo
return void

SetStemExclusionTable() public method

public SetStemExclusionTable ( ISet exclusionlist ) : void
exclusionlist ISet
return void

TokenStream() public method

public TokenStream ( String fieldName, TextReader reader ) : TokenStream
fieldName String
reader System.IO.TextReader
return TokenStream

getDefaultStopSet() public static method

public static getDefaultStopSet ( ) : ISet
return ISet

Property Details

DUTCH_STOP_WORDS public_oe static_oe property

public static String[] DUTCH_STOP_WORDS
return String[]