C# 클래스 Lucene.Net.Analysis.Ar.ArabicNormalizer

Normalizer for Arabic.

Normalization is done in-place for efficiency, operating on a termbuffer.

Normalization is defined as:

  • Normalization of hamza with alef seat to a bare alef.
  • Normalization of teh marbuta to heh
  • Normalization of dotless yeh (alef maksura) to yeh.
  • Removal of Arabic diacritics (the harakat)
  • Removal of tatweel (stretching character).

파일 보기 프로젝트 열기: apache/lucenenet

공개 메소드들

메소드 설명
Normalize ( char s, int len ) : int

Normalize an input buffer of Arabic text

메소드 상세

Normalize() 공개 메소드

Normalize an input buffer of Arabic text
public Normalize ( char s, int len ) : int
s char input buffer
len int length of input buffer
리턴 int