C# Class AzureSearchOCRwithKeywordExtraction.PdfImageExtractor

Helper class to extract images from a PDF file. Works with the most common image types embedded in PDF files, as far as I can tell.
Show file Open project: liamca/AzureSearchOCR Class Usage Examples

Public Methods

Method Description
ExtractImages ( string filename ) : System.Drawing.Image>.Dictionary

Extracts all images (of types that iTextSharp knows how to decode) from a PDF file.

ExtractImages ( string filename, int pageNumber ) : System.Drawing.Image>.Dictionary

Extracts all images (of types that iTextSharp knows how to decode) from a specified page of a PDF file.

PageContainsImages ( string filename, int pageNumber ) : bool

Checks whether a specified page of a PDF file contains images.

Method Details

ExtractImages() public static method

Extracts all images (of types that iTextSharp knows how to decode) from a PDF file.
public static ExtractImages ( string filename ) : System.Drawing.Image>.Dictionary
filename string
return System.Drawing.Image>.Dictionary

ExtractImages() public static method

Extracts all images (of types that iTextSharp knows how to decode) from a specified page of a PDF file.
public static ExtractImages ( string filename, int pageNumber ) : System.Drawing.Image>.Dictionary
filename string
pageNumber int
return System.Drawing.Image>.Dictionary

PageContainsImages() public static method

Checks whether a specified page of a PDF file contains images.
public static PageContainsImages ( string filename, int pageNumber ) : bool
filename string
pageNumber int
return bool