C# Class Lucene.Net.Analysis.El.GreekStemmer

A stemmer for Greek words, according to: Development of a Stemmer for the Greek Language. Georgios Ntais

NOTE: Input is expected to be casefolded for Greek (including folding of final sigma to sigma), and with diacritics removed. This can be achieved with either GreekLowerCaseFilter or ICUFoldingFilter. @lucene.experimental

Datei anzeigen Open project: apache/lucenenet

Public Methods

Method Description
Stem ( char s, int len ) : int

Stems a word contained in a leading portion of a char[] array. The word is passed through a number of rules that modify it's length.

Private Methods

Method Description
EndsWith ( char s, int len, string suffix ) : bool

Checks if the word contained in the leading portion of char[] array , ends with the suffix given as parameter.

EndsWithVowel ( char s, int len ) : bool

Checks if the word contained in the leading portion of char[] array , ends with a Greek vowel.

EndsWithVowelNoY ( char s, int len ) : bool

Checks if the word contained in the leading portion of char[] array , ends with a Greek vowel.

Rule0 ( char s, int len ) : int
Rule1 ( char s, int len ) : int
Rule10 ( char s, int len ) : int
Rule11 ( char s, int len ) : int
Rule12 ( char s, int len ) : int
Rule13 ( char s, int len ) : int
Rule14 ( char s, int len ) : int
Rule15 ( char s, int len ) : int
Rule16 ( char s, int len ) : int
Rule17 ( char s, int len ) : int
Rule18 ( char s, int len ) : int
Rule19 ( char s, int len ) : int
Rule2 ( char s, int len ) : int
Rule20 ( char s, int len ) : int
Rule21 ( char s, int len ) : int
Rule3 ( char s, int len ) : int
Rule4 ( char s, int len ) : int
Rule5 ( char s, int len ) : int
Rule6 ( char s, int len ) : int
Rule7 ( char s, int len ) : int
Rule8 ( char s, int len ) : int
Rule9 ( char s, int len ) : int
rule22 ( char s, int len ) : int

Method Details

Stem() public method

Stems a word contained in a leading portion of a char[] array. The word is passed through a number of rules that modify it's length.
public Stem ( char s, int len ) : int
s char A char[] array that contains the word to be stemmed.
len int The length of the char[] array.
return int