C# Class NBoilerpipe.Filters.English.TerminatingBlocksFinder

Finds blocks which are potentially indicating the end of an article text and marks them with NBoilerpipe.Labels.DefaultLabels.INDICATES_END_OF_TEXT . This can be used in conjunction with a downstream IgnoreBlocksAfterContentFilter .
Inheritance: BoilerpipeFilter
Show file Open project: oganix/NBoilerpipe

Public Properties

Property Type Description
INSTANCE TerminatingBlocksFinder

Public Methods

Method Description
GetInstance ( ) : TerminatingBlocksFinder

Returns the singleton instance for TerminatingBlocksFinder.

Returns the singleton instance for TerminatingBlocksFinder.

Process ( NBoilerpipe.Document.TextDocument doc ) : bool

Private Methods

Method Description
IsDigit ( char c ) : bool
StartsWithNumber ( string t, int len ) : bool

Checks whether the given text t starts with a sequence of digits, followed by one of the given strings.

Checks whether the given text t starts with a sequence of digits, followed by one of the given strings.

Method Details

GetInstance() public static method

Returns the singleton instance for TerminatingBlocksFinder.
Returns the singleton instance for TerminatingBlocksFinder.
public static GetInstance ( ) : TerminatingBlocksFinder
return TerminatingBlocksFinder

Process() public method

public Process ( NBoilerpipe.Document.TextDocument doc ) : bool
doc NBoilerpipe.Document.TextDocument
return bool

Property Details

INSTANCE public static property

public static TerminatingBlocksFinder,NBoilerpipe.Filters.English INSTANCE
return TerminatingBlocksFinder