C# Class org.apache.lucene.analysis.ngram.NGramTokenizer

Inheritance: Tokenizer
Datei anzeigen Open project: paulirwin/lucene.net Class Usage Examples

Public Methods

Method Description
NGramTokenizer ( System.Version version, AttributeFactory factory, Reader input, int minGram, int maxGram ) : System

Creates NGramTokenizer with given min and max n-grams.

NGramTokenizer ( System.Version version, Reader input ) : System

Creates NGramTokenizer with default min and max n-grams.

NGramTokenizer ( System.Version version, Reader input, int minGram, int maxGram ) : System

Creates NGramTokenizer with given min and max n-grams.

end ( ) : void
incrementToken ( ) : bool
reset ( ) : void

Protected Methods

Method Description
isTokenChar ( int chr ) : bool

Only collect characters which satisfy this condition.

Private Methods

Method Description
NGramTokenizer ( System.Version version, AttributeFactory factory, Reader input, int minGram, int maxGram, bool edgesOnly ) : System
NGramTokenizer ( System.Version version, Reader input, int minGram, int maxGram, bool edgesOnly ) : System
consume ( ) : void

Consume one code point.

init ( System.Version version, int minGram, int maxGram, bool edgesOnly ) : void
updateLastNonTokenChar ( ) : void

Method Details

NGramTokenizer() public method

Creates NGramTokenizer with given min and max n-grams.
public NGramTokenizer ( System.Version version, AttributeFactory factory, Reader input, int minGram, int maxGram ) : System
version System.Version the lucene compatibility version
factory AttributeFactory to use
input Reader holding the input to be tokenized
minGram int the smallest n-gram to generate
maxGram int the largest n-gram to generate
return System

NGramTokenizer() public method

Creates NGramTokenizer with default min and max n-grams.
public NGramTokenizer ( System.Version version, Reader input ) : System
version System.Version the lucene compatibility version
input Reader holding the input to be tokenized
return System

NGramTokenizer() public method

Creates NGramTokenizer with given min and max n-grams.
public NGramTokenizer ( System.Version version, Reader input, int minGram, int maxGram ) : System
version System.Version the lucene compatibility version
input Reader holding the input to be tokenized
minGram int the smallest n-gram to generate
maxGram int the largest n-gram to generate
return System

end() public method

public end ( ) : void
return void

incrementToken() public method

public incrementToken ( ) : bool
return bool

isTokenChar() protected method

Only collect characters which satisfy this condition.
protected isTokenChar ( int chr ) : bool
chr int
return bool

reset() public method

public reset ( ) : void
return void