C# Class Lucene.Net.Analysis.Miscellaneous.CodepointCountFilter

Removes words that are too long or too short from the stream.

Note: Length is calculated as the number of Unicode codepoints.

Inheritance: Lucene.Net.Analysis.Util.FilteringTokenFilter
Exibir arquivo Open project: apache/lucenenet Class Usage Examples

Public Methods

Method Description
CodepointCountFilter ( LuceneVersion version, TokenStream @in, int min, int max ) : Lucene.Net.Analysis.Tokenattributes

Create a new CodepointCountFilter. This will filter out tokens whose CharTermAttribute is either too short (Character#CodePointCount(char[], int, int) < min) or too long (Character#codePointCount(char[], int, int) > max).

Protected Methods

Method Description
Accept ( ) : bool

Method Details

Accept() protected method

protected Accept ( ) : bool
return bool

CodepointCountFilter() public method

Create a new CodepointCountFilter. This will filter out tokens whose CharTermAttribute is either too short (Character#CodePointCount(char[], int, int) < min) or too long (Character#codePointCount(char[], int, int) > max).
public CodepointCountFilter ( LuceneVersion version, TokenStream @in, int min, int max ) : Lucene.Net.Analysis.Tokenattributes
version LuceneVersion the Lucene match version
@in TokenStream
min int the minimum length
max int the maximum length
return Lucene.Net.Analysis.Tokenattributes