Property | Type | Description | |
---|---|---|---|
DEFAULT_WORD_DELIM_TABLE | sbyte[] |
Property | Type | Description | |
---|---|---|---|
WordDelimiterIterator | org.apache.lucene.analysis.miscellaneous | ||
WordDelimiterIterator | org.apache.lucene.analysis.miscellaneous | ||
charType | int | ||
endsWithPossessive | bool | ||
isBreak | bool | ||
next | int | ||
setBounds | void | ||
setText | void | ||
type | int |
Method | Description | |
---|---|---|
getType ( int ch ) : sbyte |
Computes the type of the given character
|
Method | Description | |
---|---|---|
WordDelimiterIterator ( ) : org.apache.lucene.analysis.miscellaneous | ||
WordDelimiterIterator ( sbyte charTypeTable, bool splitOnCaseChange, bool splitOnNumerics, bool stemEnglishPossessive ) : org.apache.lucene.analysis.miscellaneous |
Create a new WordDelimiterIterator operating with the supplied rules.
|
|
charType ( int ch ) : int |
Determines the type of the given character
|
|
endsWithPossessive ( int pos ) : bool |
Determines if the text at the given position indicates an English possessive which should be removed
|
|
isBreak ( int lastType, int type ) : bool |
Determines whether the transition from lastType to type indicates a break
|
|
next ( ) : int |
Advance to the next subword in the string.
|
|
setBounds ( ) : void |
Set the internal word bounds (remove leading and trailing delimiters). Note, if a possessive is found, don't remove it yet, simply note it.
|
|
setText ( char text, int length ) : void |
Reset the text to a new value, and reset all state
|
|
type ( ) : int |
Return the type of the current subword. This currently uses the type of the first character in the subword.
|
public static getType ( int ch ) : sbyte | ||
ch | int | Character whose type is to be determined |
return | sbyte |