C# 클래스 Lucene.Net.Analysis.Ar.ArabicAnalyzer

Analyzer for Arabic.

This analyzer implements light-stemming as specified by: Light Stemming for Arabic Information Retrieval http://www.mtholyoke.edu/~lballest/Pubs/arab_stem05.pdf

The analysis package contains three primary components:

ArabicNormalizationFilter: Arabic orthographic normalization.
ArabicStemFilter: Arabic light stemming
Arabic stop words file: a set of default Arabic stop words.

상속: Lucene.Net.Analysis.Util.StopwordAnalyzerBase

파일 보기 프로젝트 열기: apache/lucenenet 1 사용 예제들

공개 메소드들

메소드	설명
ArabicAnalyzer ( LuceneVersion matchVersion ) : Lucene.Net.Analysis.Core	Builds an analyzer with the default stop words: #DEFAULT_STOPWORD_FILE.
ArabicAnalyzer ( LuceneVersion matchVersion, CharArraySet stopwords ) : Lucene.Net.Analysis.Core	Builds an analyzer with the given stop words
ArabicAnalyzer ( LuceneVersion matchVersion, CharArraySet stopwords, CharArraySet stemExclusionSet ) : Lucene.Net.Analysis.Core	Builds an analyzer with the given stop word. If a none-empty stem exclusion set is provided this analyzer will add a SetKeywordMarkerFilter before ArabicStemFilter.
CreateComponents ( string fieldName, TextReader reader ) : TokenStreamComponents	Creates org.apache.lucene.analysis.Analyzer.TokenStreamComponents used to tokenize all the text in the provided Reader.

메소드 상세

ArabicAnalyzer() 공개 메소드

Builds an analyzer with the default stop words: #DEFAULT_STOPWORD_FILE.

public ArabicAnalyzer ( LuceneVersion matchVersion ) : Lucene.Net.Analysis.Core
matchVersion	LuceneVersion
리턴	Lucene.Net.Analysis.Core

ArabicAnalyzer() 공개 메소드

Builds an analyzer with the given stop words

public ArabicAnalyzer ( LuceneVersion matchVersion, CharArraySet stopwords ) : Lucene.Net.Analysis.Core
matchVersion	LuceneVersion	/// lucene compatibility version
stopwords	CharArraySet	/// a stopword set
리턴	Lucene.Net.Analysis.Core

ArabicAnalyzer() 공개 메소드

Builds an analyzer with the given stop word. If a none-empty stem exclusion set is provided this analyzer will add a SetKeywordMarkerFilter before ArabicStemFilter.

public ArabicAnalyzer ( LuceneVersion matchVersion, CharArraySet stopwords, CharArraySet stemExclusionSet ) : Lucene.Net.Analysis.Core
matchVersion	LuceneVersion	/// lucene compatibility version
stopwords	CharArraySet	/// a stopword set
stemExclusionSet	CharArraySet	/// a set of terms not to be stemmed
리턴	Lucene.Net.Analysis.Core

CreateComponents() 공개 메소드

Creates org.apache.lucene.analysis.Analyzer.TokenStreamComponents used to tokenize all the text in the provided Reader.

public CreateComponents ( string fieldName, TextReader reader ) : TokenStreamComponents
fieldName	string
reader	System.IO.TextReader
리턴	TokenStreamComponents