Name | Description |
---|---|
ChineseAnalyzer | An Analyzer that tokenizes text with ChineseTokenizer and filters with ChineseFilter |
ChineseFilter | A {@link TokenFilter} with a stop word table.
|
ChineseTokenizer | Tokenize Chinese text as individual chinese chars. The difference between ChineseTokenizer and CJKTokenizer is that they have different token parsing logic. For example, if the Chinese text "C1C2C3C4" is to be indexed:
Therefore the index created by CJKTokenizer is much larger. The problem is that when searching for C1, C1C2, C1C3, C4C2, C1C2C3 ... the ChineseTokenizer works, but the CJKTokenizer will not work. |
ChineseTokenizerFactory | |
TestChineseFilterFactory | Simple tests to ensure the Chinese filter factory is working. |
TestChineseTokenizer | |
TestChineseTokenizer.JustChineseFilterAnalyzer | |
TestChineseTokenizer.JustChineseTokenizerAnalyzer | |
TestChineseTokenizerFactory | Simple tests to ensure the Chinese tokenizer factory is working. |