C# Class NBoilerpipe.Filters.Heuristics.DocumentTitleMatchClassifier

Marks NBoilerpipe.Document.TextBlock s which contain parts of the HTML <TITLE> tag, using some heuristics which are quite specific to the news domain.
Inheritance: BoilerpipeFilter
Afficher le fichier Open project: oganix/NBoilerpipe

Méthodes publiques

Méthode Description
DocumentTitleMatchClassifier ( string title ) : System.Collections.Generic
GetPotentialTitles ( ) : ICollection
Process ( NBoilerpipe.Document.TextDocument doc ) : bool

Private Methods

Méthode Description
GetLongestPart ( string title, string pattern ) : string

Method Details

DocumentTitleMatchClassifier() public méthode

public DocumentTitleMatchClassifier ( string title ) : System.Collections.Generic
title string
Résultat System.Collections.Generic

GetPotentialTitles() public méthode

public GetPotentialTitles ( ) : ICollection
Résultat ICollection

Process() public méthode

public Process ( NBoilerpipe.Document.TextDocument doc ) : bool
doc NBoilerpipe.Document.TextDocument
Résultat bool