C# Class org.apache.lucene.analysis.standard.ClassicTokenizerImpl

This class implements the classic lucene StandardTokenizer up until 3.0
Inheritance: StandardTokenizerInterface
ファイルを表示 Open project: paulirwin/lucene.net

Public Properties

Property Type Description
TOKEN_TYPES string[]

Public Methods

Method Description
getText ( CharTermAttribute t ) : void

Fills CharTermAttribute with the current token text.

yybegin ( int newState ) : void

Enters a new lexical state

yychar ( ) : int
yycharat ( int pos ) : char

Returns the character at position pos from the matched text. It is equivalent to yytext().charAt(pos), but faster

yyclose ( ) : void

Closes the input stream.

yylength ( ) : int

Returns the length of the matched text region.

yypushback ( int number ) : void

Pushes the specified amount of characters back into the input stream. They will be read again by then next call of the scanning method

yyreset ( Reader reader ) : void

Resets the scanner to read from a new input stream. Does not close the old reader. All internal variables are reset, the old input stream cannot be reused (internal buffer is discarded and lost). Lexical state is set to ZZ_INITIAL. Internal scan buffer is resized down to its initial length, if it has grown.

yystate ( ) : int

Returns the current lexical state.

yytext ( ) : string

Returns the text matched by the current regular expression.

Private Methods

Method Description
ClassicTokenizerImpl ( Reader @in ) : System

Creates a new scanner

zzRefill ( ) : bool

Refills the input buffer.

zzScanError ( int errorCode ) : void

Reports an error that occured while scanning. In a wellformed scanner (no or only correct usage of yypushback(int) and a match-all fallback rule) this method will only be called with things that "Can't Possibly Happen". If this method is called, something is seriously wrong (e.g. a JFlex bug producing a faulty scanner etc.). Usual syntax/scanner level error handling should be done in error fallback rules.

zzUnpackAction ( string packed, int offset, int result ) : int
zzUnpackAction ( ) : int[]
zzUnpackAttribute ( string packed, int offset, int result ) : int
zzUnpackAttribute ( ) : int[]
zzUnpackCMap ( string packed ) : char[]

Unpacks the compressed character translation table.

zzUnpackRowMap ( string packed, int offset, int result ) : int
zzUnpackRowMap ( ) : int[]
zzUnpackTrans ( string packed, int offset, int result ) : int
zzUnpackTrans ( ) : int[]

Method Details

getText() public method

Fills CharTermAttribute with the current token text.
public getText ( CharTermAttribute t ) : void
t CharTermAttribute
return void

yybegin() public method

Enters a new lexical state
public yybegin ( int newState ) : void
newState int the new lexical state
return void

yychar() public method

public yychar ( ) : int
return int

yycharat() public method

Returns the character at position pos from the matched text. It is equivalent to yytext().charAt(pos), but faster
public yycharat ( int pos ) : char
pos int the position of the character to fetch. /// A value from 0 to yylength()-1. ///
return char

yyclose() public method

Closes the input stream.
public yyclose ( ) : void
return void

yylength() public method

Returns the length of the matched text region.
public yylength ( ) : int
return int

yypushback() public method

Pushes the specified amount of characters back into the input stream. They will be read again by then next call of the scanning method
public yypushback ( int number ) : void
number int the number of characters to be read again. /// This number must not be greater than yylength()!
return void

yyreset() public method

Resets the scanner to read from a new input stream. Does not close the old reader. All internal variables are reset, the old input stream cannot be reused (internal buffer is discarded and lost). Lexical state is set to ZZ_INITIAL. Internal scan buffer is resized down to its initial length, if it has grown.
public yyreset ( Reader reader ) : void
reader Reader the new input stream
return void

yystate() public method

Returns the current lexical state.
public yystate ( ) : int
return int

yytext() public method

Returns the text matched by the current regular expression.
public yytext ( ) : string
return string

Property Details

TOKEN_TYPES public_oe static_oe property

public static string[] TOKEN_TYPES
return string[]