C# Class org.apache.lucene.analysis.miscellaneous.WordDelimiterIterator

A BreakIterator-like API for iterating over subwords in text, according to WordDelimiterFilter rules. @lucene.internal

Show file Open project: paulirwin/lucene.net Class Usage Examples

Public Properties

Property	Type	Description
DEFAULT_WORD_DELIM_TABLE	sbyte[]

Private Properties

Property	Type	Description
WordDelimiterIterator	org.apache.lucene.analysis.miscellaneous
WordDelimiterIterator	org.apache.lucene.analysis.miscellaneous
charType	int
endsWithPossessive	bool
isBreak	bool
next	int
setBounds	void
setText	void
type	int

Public Methods

Method	Description
getType ( int ch ) : sbyte	Computes the type of the given character

Private Methods

Method	Description
WordDelimiterIterator ( ) : org.apache.lucene.analysis.miscellaneous
WordDelimiterIterator ( sbyte charTypeTable, bool splitOnCaseChange, bool splitOnNumerics, bool stemEnglishPossessive ) : org.apache.lucene.analysis.miscellaneous	Create a new WordDelimiterIterator operating with the supplied rules.
charType ( int ch ) : int	Determines the type of the given character
endsWithPossessive ( int pos ) : bool	Determines if the text at the given position indicates an English possessive which should be removed
isBreak ( int lastType, int type ) : bool	Determines whether the transition from lastType to type indicates a break
next ( ) : int	Advance to the next subword in the string.
setBounds ( ) : void	Set the internal word bounds (remove leading and trailing delimiters). Note, if a possessive is found, don't remove it yet, simply note it.
setText ( char text, int length ) : void	Reset the text to a new value, and reset all state
type ( ) : int	Return the type of the current subword. This currently uses the type of the first character in the subword.

Method Details

getType() public static method

Computes the type of the given character

public static getType ( int ch ) : sbyte
ch	int	Character whose type is to be determined
return	sbyte

Property Details

DEFAULT_WORD_DELIM_TABLE public static property

public static sbyte[] DEFAULT_WORD_DELIM_TABLE
return	sbyte[]