C# Class Lucene.Net.Analysis.Miscellaneous.WordDelimiterIterator

A BreakIterator-like API for iterating over subwords in text, according to WordDelimiterFilter rules. @lucene.internal
Mostra file Open project: apache/lucenenet Class Usage Examples

Public Properties

Property Type Description
DEFAULT_WORD_DELIM_TABLE sbyte[]

Private Properties

Property Type Description
CharType int
EndsWithPossessive bool
IsBreak bool
Next int
SetBounds void
SetText void
WordDelimiterIterator System.Globalization
WordDelimiterIterator System.Globalization

Public Methods

Method Description
GetType ( int ch ) : sbyte

Computes the type of the given character

Private Methods

Method Description
CharType ( int ch ) : int

Determines the type of the given character

EndsWithPossessive ( int pos ) : bool

Determines if the text at the given position indicates an English possessive which should be removed

IsBreak ( int lastType, int type ) : bool

Determines whether the transition from lastType to type indicates a break

Next ( ) : int

Advance to the next subword in the string.

SetBounds ( ) : void

Set the internal word bounds (remove leading and trailing delimiters). Note, if a possessive is found, don't remove it yet, simply note it.

SetText ( char text, int length ) : void

Reset the text to a new value, and reset all state

WordDelimiterIterator ( ) : System.Globalization
WordDelimiterIterator ( sbyte charTypeTable, bool splitOnCaseChange, bool splitOnNumerics, bool stemEnglishPossessive ) : System.Globalization

Create a new WordDelimiterIterator operating with the supplied rules.

Method Details

GetType() public static method

Computes the type of the given character
public static GetType ( int ch ) : sbyte
ch int Character whose type is to be determined
return sbyte

Property Details

DEFAULT_WORD_DELIM_TABLE public_oe static_oe property

public static sbyte[] DEFAULT_WORD_DELIM_TABLE
return sbyte[]