C# Класс Lucene.Net.Analysis.Compound.HyphenationCompoundWordTokenFilter

A TokenFilter that decomposes compound words found in many Germanic languages.

"Donaudampfschiff" becomes Donau, dampf, schiff so that you can find "Donaudampfschiff" even when you only enter "schiff". It uses a hyphenation grammar and a word dictionary to achieve this.

You must specify the required Version compatibility when creating CompoundWordTokenFilterBase:

As of 3.1, CompoundWordTokenFilterBase correctly handles Unicode 4.0 supplementary characters in strings and char arrays provided as compound word dictionaries.

Наследование: CompoundWordTokenFilterBase

Показать файл Открыть проект Примеры использования класса

Открытые методы

Метод	Описание
GetHyphenationTree ( FileInfo hyphenationFile ) : HyphenationTree	Create a hyphenator tree
GetHyphenationTree ( FileInfo hyphenationFile, Encoding encoding ) : HyphenationTree	Create a hyphenator tree
GetHyphenationTree ( Stream hyphenationSource ) : HyphenationTree	Create a hyphenator tree
GetHyphenationTree ( Stream hyphenationSource, Encoding encoding ) : HyphenationTree	Create a hyphenator tree
GetHyphenationTree ( string hyphenationFilename ) : HyphenationTree	Create a hyphenator tree
GetHyphenationTree ( string hyphenationFilename, Encoding encoding ) : HyphenationTree	Create a hyphenator tree
HyphenationCompoundWordTokenFilter ( LuceneVersion matchVersion, TokenStream input, HyphenationTree hyphenator ) : Lucene.Net.Analysis.Compound.Hyphenation	Create a HyphenationCompoundWordTokenFilter with no dictionary. Calls {@link #HyphenationCompoundWordTokenFilter(Version, TokenStream, HyphenationTree, int, int, int) HyphenationCompoundWordTokenFilter(matchVersion, input, hyphenator, DEFAULT_MIN_WORD_SIZE, DEFAULT_MIN_SUBWORD_SIZE, DEFAULT_MAX_SUBWORD_SIZE }
HyphenationCompoundWordTokenFilter ( LuceneVersion matchVersion, TokenStream input, HyphenationTree hyphenator, CharArraySet dictionary ) : Lucene.Net.Analysis.Compound.Hyphenation	Creates a new HyphenationCompoundWordTokenFilter instance.
HyphenationCompoundWordTokenFilter ( LuceneVersion matchVersion, TokenStream input, HyphenationTree hyphenator, CharArraySet dictionary, int minWordSize, int minSubwordSize, int maxSubwordSize, bool onlyLongestMatch ) : Lucene.Net.Analysis.Compound.Hyphenation	Creates a new HyphenationCompoundWordTokenFilter instance.
HyphenationCompoundWordTokenFilter ( LuceneVersion matchVersion, TokenStream input, HyphenationTree hyphenator, int minWordSize, int minSubwordSize, int maxSubwordSize ) : Lucene.Net.Analysis.Compound.Hyphenation	Create a HyphenationCompoundWordTokenFilter with no dictionary. Calls {@link #HyphenationCompoundWordTokenFilter(Version, TokenStream, HyphenationTree, CharArraySet, int, int, int, boolean) HyphenationCompoundWordTokenFilter(matchVersion, input, hyphenator, null, minWordSize, minSubwordSize, maxSubwordSize }

Защищенные методы

Метод	Описание
Decompose ( ) : void

Описание методов