Name | Description |
---|---|
IndicNormalizer | Normalizes the Unicode representation of text in Indian languages. Follows guidelines from Unicode 5.2, chapter 6, South Asian Scripts I and graphical decompositions from http://ldc.upenn.edu/myl/IndianScriptsUnicode.html |
IndicNormalizer.ScriptData | |
IndicTokenizer | |
TestIndicNormalizer | Test IndicNormalizer |
TestIndicNormalizer.AnalyzerAnonymousInnerClassHelper |