C# Class org.apache.lucene.analysis.miscellaneous.WordDelimiterIterator

A BreakIterator-like API for iterating over subwords in text, according to WordDelimiterFilter rules. @lucene.internal
Mostra file Open project: paulirwin/lucene.net Class Usage Examples

Public Properties

Property Type Description
DEFAULT_WORD_DELIM_TABLE sbyte[]

Private Properties

Property Type Description
WordDelimiterIterator org.apache.lucene.analysis.miscellaneous
WordDelimiterIterator org.apache.lucene.analysis.miscellaneous
charType int
endsWithPossessive bool
isBreak bool
next int
setBounds void
setText void
type int

Public Methods

Method Description
getType ( int ch ) : sbyte

Computes the type of the given character

Private Methods

Method Description
WordDelimiterIterator ( ) : org.apache.lucene.analysis.miscellaneous
WordDelimiterIterator ( sbyte charTypeTable, bool splitOnCaseChange, bool splitOnNumerics, bool stemEnglishPossessive ) : org.apache.lucene.analysis.miscellaneous

Create a new WordDelimiterIterator operating with the supplied rules.

charType ( int ch ) : int

Determines the type of the given character

endsWithPossessive ( int pos ) : bool

Determines if the text at the given position indicates an English possessive which should be removed

isBreak ( int lastType, int type ) : bool

Determines whether the transition from lastType to type indicates a break

next ( ) : int

Advance to the next subword in the string.

setBounds ( ) : void

Set the internal word bounds (remove leading and trailing delimiters). Note, if a possessive is found, don't remove it yet, simply note it.

setText ( char text, int length ) : void

Reset the text to a new value, and reset all state

type ( ) : int

Return the type of the current subword. This currently uses the type of the first character in the subword.

Method Details

getType() public static method

Computes the type of the given character
public static getType ( int ch ) : sbyte
ch int Character whose type is to be determined
return sbyte

Property Details

DEFAULT_WORD_DELIM_TABLE public_oe static_oe property

public static sbyte[] DEFAULT_WORD_DELIM_TABLE
return sbyte[]