C# Class WikiParser.Parser

Parser class
Show file Open project: irfiit/wikipedia Class Usage Examples

Public Methods

Method Description
GetDisambiguationPagesFromParsedWikiDump ( string parsedInput ) : List

Load and show parsed XML file

GetDisambiguationPagesFromWikiDump ( string rawInput ) : List

Load, parse and show raw XML wiki dump file

GetPageLongDescription ( string text, string title ) : string

Remove metadata from raw long description

LoadDisambiguationPagesEmpty ( string parsedInput, List disambiguationPages ) : void

Load all disambiguation pages without additional informations (only list of them)

LoadEmptyPagesForDisambiguationPage ( string parsedInput, int index, List pages ) : void

Load all pages for selected disambiguation page without additional informations (only list of them)

LoadPageInfo ( string parsedInput, int indexDS, int indexP, PageInfo page ) : void

Load info about selected page

RemoveDuplicatePages ( List disambiguationPages ) : void

Remove pages that are duplicates

Private Methods

Method Description
FillDisambiguationPages ( string rawInput, List disambiguationPages ) : void

Find all pages for found disambiguation pages in raw wiki dump

FindDisambiguationPages ( string rawInput, List disambiguationPages ) : void

Find all disambiguation pages in raw wiki dump

LoadDisambiguationPages ( string parsedInput, List disambiguationPages ) : void

Load and show parsed XML file

Method Details

GetDisambiguationPagesFromParsedWikiDump() public method

Load and show parsed XML file
public GetDisambiguationPagesFromParsedWikiDump ( string parsedInput ) : List
parsedInput string
return List

GetDisambiguationPagesFromWikiDump() public method

Load, parse and show raw XML wiki dump file
public GetDisambiguationPagesFromWikiDump ( string rawInput ) : List
rawInput string
return List

GetPageLongDescription() public method

Remove metadata from raw long description
public GetPageLongDescription ( string text, string title ) : string
text string
title string
return string

LoadDisambiguationPagesEmpty() public method

Load all disambiguation pages without additional informations (only list of them)
public LoadDisambiguationPagesEmpty ( string parsedInput, List disambiguationPages ) : void
parsedInput string
disambiguationPages List
return void

LoadEmptyPagesForDisambiguationPage() public method

Load all pages for selected disambiguation page without additional informations (only list of them)
public LoadEmptyPagesForDisambiguationPage ( string parsedInput, int index, List pages ) : void
parsedInput string
index int
pages List
return void

LoadPageInfo() public method

Load info about selected page
public LoadPageInfo ( string parsedInput, int indexDS, int indexP, PageInfo page ) : void
parsedInput string
indexDS int
indexP int
page PageInfo
return void

RemoveDuplicatePages() public method

Remove pages that are duplicates
public RemoveDuplicatePages ( List disambiguationPages ) : void
disambiguationPages List
return void