Method | Description | |
---|---|---|
ResolveEntities ( string contentsWithUnresolvedEntityReferences ) : string |
Replaces entity references with spaces.
|
|
SimpleHtmlTokenizer ( ) : System |
Constructor uses BREAK_ON_WORD_BREAKS tokenizer config by default.
|
|
SimpleHtmlTokenizer ( int tokenizerConfig ) : System | ||
SimpleHtmlTokenizer ( string regularExpression ) : System | ||
Tokenize ( string input ) : string[] |
public ResolveEntities ( string contentsWithUnresolvedEntityReferences ) : string | ||
contentsWithUnresolvedEntityReferences | string | The contents with the entity references. |
return | string |
public SimpleHtmlTokenizer ( int tokenizerConfig ) : System | ||
tokenizerConfig | int | |
return | System |
public SimpleHtmlTokenizer ( string regularExpression ) : System | ||
regularExpression | string | |
return | System |