C# Class Lucene.Net.Analysis.Query.QueryAutoStopWordAnalyzer

An Analyzer used primarily at query time to wrap another analyzer and provide a layer of protection which prevents very common words from being passed into queries.

For very large indexes the cost of reading TermDocs for a very common word can be high. This analyzer was created after experience with a 38 million doc index which had a term in around 50% of docs and was causing TermQueries for this term to take 2 seconds.

Inheritance: AnalyzerWrapper
Exibir arquivo Open project: synhershko/lucene.net Class Usage Examples

Public Methods

Method Description
AddStopWords ( IndexReader reader ) : int
AddStopWords ( IndexReader reader, String fieldName, float maxPercentDocs ) : int
AddStopWords ( IndexReader reader, String fieldName, int maxDocFreq ) : int
AddStopWords ( IndexReader reader, float maxPercentDocs ) : int
AddStopWords ( IndexReader reader, int maxDocFreq ) : int
GetStopWords ( String fieldName ) : String[]
GetStopWords ( ) : Lucene.Net.Index.Term[]
QueryAutoStopWordAnalyzer ( Lucene.Net.Util.Version matchVersion, Analyzer _delegate ) : System
ReusableTokenStream ( String fieldName, TextReader reader ) : TokenStream
TokenStream ( String fieldName, TextReader reader ) : TokenStream

Method Details

AddStopWords() public method

public AddStopWords ( IndexReader reader ) : int
reader Lucene.Net.Index.IndexReader
return int

AddStopWords() public method

public AddStopWords ( IndexReader reader, String fieldName, float maxPercentDocs ) : int
reader Lucene.Net.Index.IndexReader
fieldName String
maxPercentDocs float
return int

AddStopWords() public method

public AddStopWords ( IndexReader reader, String fieldName, int maxDocFreq ) : int
reader Lucene.Net.Index.IndexReader
fieldName String
maxDocFreq int
return int

AddStopWords() public method

public AddStopWords ( IndexReader reader, float maxPercentDocs ) : int
reader Lucene.Net.Index.IndexReader
maxPercentDocs float
return int

AddStopWords() public method

public AddStopWords ( IndexReader reader, int maxDocFreq ) : int
reader Lucene.Net.Index.IndexReader
maxDocFreq int
return int

GetStopWords() public method

public GetStopWords ( String fieldName ) : String[]
fieldName String
return String[]

GetStopWords() public method

public GetStopWords ( ) : Lucene.Net.Index.Term[]
return Lucene.Net.Index.Term[]

QueryAutoStopWordAnalyzer() public method

public QueryAutoStopWordAnalyzer ( Lucene.Net.Util.Version matchVersion, Analyzer _delegate ) : System
matchVersion Lucene.Net.Util.Version
_delegate Analyzer
return System

ReusableTokenStream() public method

public ReusableTokenStream ( String fieldName, TextReader reader ) : TokenStream
fieldName String
reader TextReader
return TokenStream

TokenStream() public method

public TokenStream ( String fieldName, TextReader reader ) : TokenStream
fieldName String
reader TextReader
return TokenStream