C# Class Lucene.Net.Analysis.Cjk.CJKWidthFilter

A TokenFilter that normalizes CJK width differences:
  • Folds fullwidth ASCII variants into the equivalent basic latin
  • Folds halfwidth Katakana variants into the equivalent kana

NOTE: this filter can be viewed as a (practical) subset of NFKC/NFKD Unicode normalization. See the normalization support in the ICU package for full normalization.

Inheritance: TokenFilter
Afficher le fichier Open project: apache/lucenenet Class Usage Examples

Méthodes publiques

Méthode Description
CJKWidthFilter ( TokenStream input ) : Lucene.Net.Analysis.Tokenattributes
IncrementToken ( ) : bool

Private Methods

Méthode Description
Combine ( char text, int pos, char ch ) : bool

returns true if we successfully combined the voice mark

Method Details

CJKWidthFilter() public méthode

public CJKWidthFilter ( TokenStream input ) : Lucene.Net.Analysis.Tokenattributes
input TokenStream
Résultat Lucene.Net.Analysis.Tokenattributes

IncrementToken() public méthode

public IncrementToken ( ) : bool
Résultat bool