C# 클래스 Lucene.Net.Analysis.Core.LowerCaseTokenizer

LowerCaseTokenizer performs the function of LetterTokenizer and LowerCaseFilter together. It divides text at non-letters and converts them to lower case. While it is functionally equivalent to the combination of LetterTokenizer and LowerCaseFilter, there is a performance advantage to doing the two tasks at once, hence this (redundant) implementation.

Note: this does a decent job for most European languages, but does a terrible job for some Asian languages, where words are not separated by spaces.

You must specify the required Version compatibility when creating LowerCaseTokenizer:

파일 보기 프로젝트 열기: paulirwin/lucene.net 1 사용 예제들

공개 메소드들

메소드 설명
LowerCaseTokenizer ( Lucene.Net.Util.Version matchVersion, AttributeFactory factory, TextReader @in ) : System.IO

Construct a new LowerCaseTokenizer using a given org.apache.lucene.util.AttributeSource.AttributeFactory.

LowerCaseTokenizer ( Lucene.Net.Util.Version matchVersion, TextReader @in ) : System.IO

Construct a new LowerCaseTokenizer.

보호된 메소드들

메소드 설명
Normalize ( int c ) : int

Converts char to lower case Character#toLowerCase(int).

메소드 상세

LowerCaseTokenizer() 공개 메소드

Construct a new LowerCaseTokenizer using a given org.apache.lucene.util.AttributeSource.AttributeFactory.
public LowerCaseTokenizer ( Lucene.Net.Util.Version matchVersion, AttributeFactory factory, TextReader @in ) : System.IO
matchVersion Lucene.Net.Util.Version /// Lucene version to match See above"/>
factory AttributeFactory /// the attribute factory to use for this
@in System.IO.TextReader
리턴 System.IO

LowerCaseTokenizer() 공개 메소드

Construct a new LowerCaseTokenizer.
public LowerCaseTokenizer ( Lucene.Net.Util.Version matchVersion, TextReader @in ) : System.IO
matchVersion Lucene.Net.Util.Version /// Lucene version to match See above"/> ///
@in System.IO.TextReader
리턴 System.IO

Normalize() 보호된 메소드

Converts char to lower case Character#toLowerCase(int).
protected Normalize ( int c ) : int
c int
리턴 int