C# Class Lucene.Net.Analysis.Ar.ArabicStemmer

Stemmer for Arabic.

Stemming is done in-place for efficiency, operating on a termbuffer.

Stemming is defined as:

  • Removal of attached definite article, conjunction, and prepositions.
  • Stemming of common suffixes.

Exibir arquivo Open project: apache/lucenenet

Public Properties

Property Type Description
prefixes char[][]
suffixes char[][]

Public Methods

Method Description
Stem ( char s, int len ) : int

Stem an input buffer of Arabic text.

StemPrefix ( char s, int len ) : int

Stem a prefix off an Arabic word.

StemSuffix ( char s, int len ) : int

Stem suffix(es) off an Arabic word.

Private Methods

Method Description
ArabicStemmer ( ) : Lucene.Net.Analysis.Util
EndsWithCheckLength ( char s, int len, char suffix ) : bool

Returns true if the suffix matches and can be stemmed

StartsWithCheckLength ( char s, int len, char prefix ) : bool

Returns true if the prefix matches and can be stemmed

Method Details

Stem() public method

Stem an input buffer of Arabic text.
public Stem ( char s, int len ) : int
s char input buffer
len int length of input buffer
return int

StemPrefix() public method

Stem a prefix off an Arabic word.
public StemPrefix ( char s, int len ) : int
s char input buffer
len int length of input buffer
return int

StemSuffix() public method

Stem suffix(es) off an Arabic word.
public StemSuffix ( char s, int len ) : int
s char input buffer
len int length of input buffer
return int

Property Details

prefixes public_oe static_oe property

public static char[][] prefixes
return char[][]

suffixes public_oe static_oe property

public static char[][] suffixes
return char[][]