C# Class Lucene.Net.Analysis.Cjk.CJKWidthFilter

A TokenFilter that normalizes CJK width differences:
  • Folds fullwidth ASCII variants into the equivalent basic latin
  • Folds halfwidth Katakana variants into the equivalent kana

NOTE: this filter can be viewed as a (practical) subset of NFKC/NFKD Unicode normalization. See the normalization support in the ICU package for full normalization.

Inheritance: TokenFilter
Exibir arquivo Open project: apache/lucenenet Class Usage Examples

Public Methods

Method Description
CJKWidthFilter ( TokenStream input ) : Lucene.Net.Analysis.Tokenattributes
IncrementToken ( ) : bool

Private Methods

Method Description
Combine ( char text, int pos, char ch ) : bool

returns true if we successfully combined the voice mark

Method Details

CJKWidthFilter() public method

public CJKWidthFilter ( TokenStream input ) : Lucene.Net.Analysis.Tokenattributes
input TokenStream
return Lucene.Net.Analysis.Tokenattributes

IncrementToken() public method

public IncrementToken ( ) : bool
return bool