C# Class Lucene.Net.Analysis.Id.IndonesianStemmer

Stemmer for Indonesian.

Stems Indonesian words with the algorithm presented in: A Study of Stemming Effects on Information Retrieval in Bahasa Indonesia, Fadillah Z Tala. http://www.illc.uva.nl/Publications/ResearchReports/MoL-2003-02.text.pdf

Datei anzeigen Open project: apache/lucenenet

Public Methods

Method Description
stem ( char text, int length, bool stemDerivational ) : int

Stem a term (returning its new length).

Use stemDerivational to control whether full stemming or only light inflectional stemming is done.

Private Methods

Method Description
StemDerivational ( char text, int length ) : int
isVowel ( char ch ) : bool
removeFirstOrderPrefix ( char text, int length ) : int
removeParticle ( char text, int length ) : int
removePossessivePronoun ( char text, int length ) : int
removeSecondOrderPrefix ( char text, int length ) : int
removeSuffix ( char text, int length ) : int

Method Details

stem() public method

Stem a term (returning its new length).

Use stemDerivational to control whether full stemming or only light inflectional stemming is done.

public stem ( char text, int length, bool stemDerivational ) : int
text char
length int
stemDerivational bool
return int