C# Class Lucene.Net.Analysis.Th.ThaiTokenizer

Tokenizer that use BreakIterator to tokenize Thai text.

WARNING: this tokenizer may not be supported by all JREs. It is known to work with Sun/Oracle and Harmony JREs. If your application needs to be fully portable, consider using ICUTokenizer instead, which uses an ICU Thai BreakIterator that will always be available.

Inheritance: Lucene.Net.Analysis.Util.SegmentingTokenizerBase
Afficher le fichier Open project: apache/lucenenet Class Usage Examples

Méthodes publiques

Свойство Type Description
DBBI_AVAILABLE bool

Méthodes publiques

Méthode Description
ThaiTokenizer ( AttributeFactory factory, TextReader reader ) : ICU4NET

Creates a new ThaiTokenizer, supplying the AttributeFactory

ThaiTokenizer ( TextReader reader ) : ICU4NET

Creates a new ThaiTokenizer

Méthodes protégées

Méthode Description
IncrementWord ( ) : bool
SetNextSentence ( int sentenceStart, int sentenceEnd ) : void

Private Methods

Méthode Description
ThaiTokenizer ( ) : ICU4NET

Method Details

IncrementWord() protected méthode

protected IncrementWord ( ) : bool
Résultat bool

SetNextSentence() protected méthode

protected SetNextSentence ( int sentenceStart, int sentenceEnd ) : void
sentenceStart int
sentenceEnd int
Résultat void

ThaiTokenizer() public méthode

Creates a new ThaiTokenizer, supplying the AttributeFactory
public ThaiTokenizer ( AttributeFactory factory, TextReader reader ) : ICU4NET
factory AttributeFactory
reader System.IO.TextReader
Résultat ICU4NET

ThaiTokenizer() public méthode

Creates a new ThaiTokenizer
public ThaiTokenizer ( TextReader reader ) : ICU4NET
reader System.IO.TextReader
Résultat ICU4NET

Property Details

DBBI_AVAILABLE public_oe static_oe property

True if the JRE supports a working dictionary-based breakiterator for Thai. If this is false, this tokenizer will not work at all!
public static bool DBBI_AVAILABLE
Résultat bool