C# Class Lucene.Net.Analysis.Ar.ArabicStemmer

Stemmer for Arabic.

Stemming is done in-place for efficiency, operating on a termbuffer.

Stemming is defined as:

  • Removal of attached definite article, conjunction, and prepositions.
  • Stemming of common suffixes.

Afficher le fichier Open project: apache/lucenenet

Méthodes publiques

Свойство Type Description
prefixes char[][]
suffixes char[][]

Méthodes publiques

Méthode Description
Stem ( char s, int len ) : int

Stem an input buffer of Arabic text.

StemPrefix ( char s, int len ) : int

Stem a prefix off an Arabic word.

StemSuffix ( char s, int len ) : int

Stem suffix(es) off an Arabic word.

Private Methods

Méthode Description
ArabicStemmer ( ) : Lucene.Net.Analysis.Util
EndsWithCheckLength ( char s, int len, char suffix ) : bool

Returns true if the suffix matches and can be stemmed

StartsWithCheckLength ( char s, int len, char prefix ) : bool

Returns true if the prefix matches and can be stemmed

Method Details

Stem() public méthode

Stem an input buffer of Arabic text.
public Stem ( char s, int len ) : int
s char input buffer
len int length of input buffer
Résultat int

StemPrefix() public méthode

Stem a prefix off an Arabic word.
public StemPrefix ( char s, int len ) : int
s char input buffer
len int length of input buffer
Résultat int

StemSuffix() public méthode

Stem suffix(es) off an Arabic word.
public StemSuffix ( char s, int len ) : int
s char input buffer
len int length of input buffer
Résultat int

Property Details

prefixes public_oe static_oe property

public static char[][] prefixes
Résultat char[][]

suffixes public_oe static_oe property

public static char[][] suffixes
Résultat char[][]