C# Class Lucene.Net.Analysis.Tokenizer

A Tokenizer is a TokenStream whose input is a Reader.

this is an abstract class; subclasses must override #IncrementToken()

NOTE: Subclasses overriding #IncrementToken() must call AttributeSource#ClearAttributes() before setting attributes.

Inheritance: TokenStream
Datei anzeigen Open project: apache/lucenenet Class Usage Examples

Protected Properties

Property Type Description
input System.IO.TextReader

Public Methods

Method Description
Dispose ( ) : void

{@inheritDoc}

NOTE: The default implementation closes the input Reader, so be sure to call super.close() when overriding this method.

Reset ( ) : void

Protected Methods

Method Description
CorrectOffset ( int currentOff ) : int

Return the corrected offset. If #input is a CharFilter subclass this method calls CharFilter#correctOffset, else returns currentOff.

Tokenizer ( AttributeFactory factory, TextReader input ) : System

Construct a token stream processing the given input using the given AttributeFactory.

Tokenizer ( TextReader input ) : System

Construct a token stream processing the given input.

Private Methods

Method Description
SetReaderTestPoint ( ) : bool

Method Details

CorrectOffset() protected method

Return the corrected offset. If #input is a CharFilter subclass this method calls CharFilter#correctOffset, else returns currentOff.
protected CorrectOffset ( int currentOff ) : int
currentOff int offset as seen in the output
return int

Dispose() public method

{@inheritDoc}

NOTE: The default implementation closes the input Reader, so be sure to call super.close() when overriding this method.

public Dispose ( ) : void
return void

Reset() public method

public Reset ( ) : void
return void

Tokenizer() protected method

Construct a token stream processing the given input using the given AttributeFactory.
protected Tokenizer ( AttributeFactory factory, TextReader input ) : System
factory AttributeFactory
input System.IO.TextReader
return System

Tokenizer() protected method

Construct a token stream processing the given input.
protected Tokenizer ( TextReader input ) : System
input System.IO.TextReader
return System

Property Details

input protected_oe property

The text source for this Tokenizer.
protected TextReader,System.IO input
return System.IO.TextReader