C# Class NCrawler.HtmlProcessor.HtmlDocumentProcessorPipelineStep

Inheritance: ContentCrawlerRules, IPipelineStep
Mostra file Open project: esbencarlsen/NCrawler

Public Methods

Method Description
HtmlDocumentProcessorPipelineStep ( int maxDegreeOfParallelism ) : System
HtmlDocumentProcessorPipelineStep ( int maxDegreeOfParallelism, string>.Dictionary filterTextRules, string>.Dictionary filterLinksRules ) : System
Process ( ICrawler crawler, PropertyBag propertyBag ) : Task

Protected Methods

Method Description
NormalizeLink ( string baseUrl, string link ) : string

Private Methods

Method Description
IsHtmlContent ( string contentType ) : bool

Method Details

HtmlDocumentProcessorPipelineStep() public method

public HtmlDocumentProcessorPipelineStep ( int maxDegreeOfParallelism ) : System
maxDegreeOfParallelism int
return System

HtmlDocumentProcessorPipelineStep() public method

public HtmlDocumentProcessorPipelineStep ( int maxDegreeOfParallelism, string>.Dictionary filterTextRules, string>.Dictionary filterLinksRules ) : System
maxDegreeOfParallelism int
filterTextRules string>.Dictionary
filterLinksRules string>.Dictionary
return System

NormalizeLink() protected method

protected NormalizeLink ( string baseUrl, string link ) : string
baseUrl string
link string
return string

Process() public method

public Process ( ICrawler crawler, PropertyBag propertyBag ) : Task
crawler ICrawler
propertyBag PropertyBag
return Task