C# 클래스 Lucene.Net.Analysis.Standard.StandardTokenizerImpl

This class is a scanner generated by JFlex 1.4.1 on 12/18/07 9:22 PM from the specification file /Volumes/User/grantingersoll/projects/lucene/java/lucene-clean/src/java/org/apache/lucene/analysis/standard/StandardTokenizerImpl.jflex
파일 보기 프로젝트 열기: apache/lucenenet 1 사용 예제들

공개 프로퍼티들

프로퍼티 타입 설명
HANGUL_TYPE int
HIRAGANA_TYPE int
IDEOGRAPHIC_TYPE int
KATAKANA_TYPE int
NUMERIC_TYPE int
SOUTH_EAST_ASIAN_TYPE int
WORD_TYPE int
YYEOF int

공개 메소드들

메소드 설명
GetNextToken ( ) : int

Resumes scanning until the next regular expression is matched, the end of input is encountered or an I/O-Error occurs.

GetText ( ICharTermAttribute t ) : void

Fills CharTermAttribute with the current token text.

StandardTokenizerImpl ( TextReader @in ) : Lucene.Net.Analysis.Tokenattributes

Creates a new scanner

YyBegin ( int newState ) : void

Enters a new lexical state

YyCharAt ( int pos ) : char

Returns the character at position pos from the matched text. It is equivalent to YyText().charAt(pos), but faster

YyClose ( ) : void

Closes the input stream.

YyPushBack ( int number ) : void

Pushes the specified amount of characters back into the input stream. They will be read again by then next call of the scanning method

YyReset ( TextReader reader ) : void

Resets the scanner to read from a new input stream. Does not close the old reader. All internal variables are reset, the old input stream cannot be reused (internal buffer is discarded and lost). Lexical state is set to ZZ_INITIAL. Internal scan buffer is resized down to its initial length, if it has grown.

비공개 메소드들

메소드 설명
ZzRefill ( ) : bool

Refills the input buffer.

ZzScanError ( int errorCode ) : void

Reports an error that occured while scanning. In a wellformed scanner (no or only correct usage of YyPushBack(int) and a match-all fallback rule) this method will only be called with things that "Can't Possibly Happen". If this method is called, something is seriously wrong (e.g. a JFlex bug producing a faulty scanner etc.). Usual syntax/scanner level error handling should be done in error fallback rules.

ZzUnpackAction ( string packed, int offset, int result ) : int
ZzUnpackAction ( ) : int[]
ZzUnpackAttribute ( string packed, int offset, int result ) : int
ZzUnpackAttribute ( ) : int[]
ZzUnpackCMap ( string packed ) : char[]

Unpacks the compressed character translation table.

ZzUnpackRowMap ( string packed, int offset, int result ) : int
ZzUnpackRowMap ( ) : int[]
ZzUnpackTrans ( string packed, int offset, int result ) : int
ZzUnpackTrans ( ) : int[]

메소드 상세

GetNextToken() 공개 메소드

Resumes scanning until the next regular expression is matched, the end of input is encountered or an I/O-Error occurs.
if any I/O-Error occurs
public GetNextToken ( ) : int
리턴 int

GetText() 공개 메소드

Fills CharTermAttribute with the current token text.
public GetText ( ICharTermAttribute t ) : void
t ICharTermAttribute
리턴 void

StandardTokenizerImpl() 공개 메소드

Creates a new scanner
public StandardTokenizerImpl ( TextReader @in ) : Lucene.Net.Analysis.Tokenattributes
@in TextReader
리턴 Lucene.Net.Analysis.Tokenattributes

YyBegin() 공개 메소드

Enters a new lexical state
public YyBegin ( int newState ) : void
newState int the new lexical state
리턴 void

YyCharAt() 공개 메소드

Returns the character at position pos from the matched text. It is equivalent to YyText().charAt(pos), but faster
public YyCharAt ( int pos ) : char
pos int the position of the character to fetch. /// A value from 0 to YyLength()-1. ///
리턴 char

YyClose() 공개 메소드

Closes the input stream.
public YyClose ( ) : void
리턴 void

YyPushBack() 공개 메소드

Pushes the specified amount of characters back into the input stream. They will be read again by then next call of the scanning method
public YyPushBack ( int number ) : void
number int the number of characters to be read again. /// This number must not be greater than YyLength()!
리턴 void

YyReset() 공개 메소드

Resets the scanner to read from a new input stream. Does not close the old reader. All internal variables are reset, the old input stream cannot be reused (internal buffer is discarded and lost). Lexical state is set to ZZ_INITIAL. Internal scan buffer is resized down to its initial length, if it has grown.
public YyReset ( TextReader reader ) : void
reader TextReader the new input stream
리턴 void

프로퍼티 상세

HANGUL_TYPE 공개적으로 정적으로 프로퍼티

public static int HANGUL_TYPE
리턴 int

HIRAGANA_TYPE 공개적으로 정적으로 프로퍼티

public static int HIRAGANA_TYPE
리턴 int

IDEOGRAPHIC_TYPE 공개적으로 정적으로 프로퍼티

public static int IDEOGRAPHIC_TYPE
리턴 int

KATAKANA_TYPE 공개적으로 정적으로 프로퍼티

public static int KATAKANA_TYPE
리턴 int

NUMERIC_TYPE 공개적으로 정적으로 프로퍼티

Numbers
public static int NUMERIC_TYPE
리턴 int

SOUTH_EAST_ASIAN_TYPE 공개적으로 정적으로 프로퍼티

Chars in class \p{Line_Break = Complex_Context} are from South East Asian scripts (Thai, Lao, Myanmar, Khmer, etc.). Sequences of these are kept together as as a single token rather than broken up, because the logic required to break them at word boundaries is too complex for UAX#29.

See Unicode Line Breaking Algorithm: http://www.unicode.org/reports/tr14/#SA

public static int SOUTH_EAST_ASIAN_TYPE
리턴 int

WORD_TYPE 공개적으로 정적으로 프로퍼티

Alphanumeric sequences
public static int WORD_TYPE
리턴 int

YYEOF 공개적으로 정적으로 프로퍼티

This character denotes the end of file
public static int YYEOF
리턴 int