C# Class NBoilerpipe.Extractors.ExtractorBase

The base class of Extractors.
The base class of Extractors. Also provides some helper methods to quickly retrieve the text that remained after processing.
Inheritance: BoilerpipeExtractor
Show file Open project: oganix/NBoilerpipe

Public Methods

Method Description
GetText ( NBoilerpipe.Document.TextDocument doc ) : string

Extracts text from the given NBoilerpipe.Document.TextDocument object.

GetText ( string html ) : string

Extracts text from the HTML code given as a String.

Extracts text from the HTML code given as a String.

Process ( NBoilerpipe.Document.TextDocument arg1 ) : bool

Method Details

GetText() public method

Extracts text from the given NBoilerpipe.Document.TextDocument object.
NBoilerpipe.BoilerpipeProcessingException ///
public GetText ( NBoilerpipe.Document.TextDocument doc ) : string
doc NBoilerpipe.Document.TextDocument /// The /// NBoilerpipe.Document.TextDocument /// . ///
return string

GetText() public method

Extracts text from the HTML code given as a String.
Extracts text from the HTML code given as a String.
NBoilerpipe.BoilerpipeProcessingException ///
public GetText ( string html ) : string
html string The HTML code as a String.
return string

Process() public abstract method

public abstract Process ( NBoilerpipe.Document.TextDocument arg1 ) : bool
arg1 NBoilerpipe.Document.TextDocument
return bool