Foxit PDF SDK
foxit.addon.ocr.OCR Class Reference
Inheritance diagram for foxit.addon.ocr.OCR:
foxit.common.Base

Public Member Functions

 OCR ()
 Constructor.
 
 OCR (OCR other)
 Constructor, with another ocr object. More...
 
OCRSuspectInfoArray GetOCRSuspectsInfo (PDFDoc ocred_pdf_doc)
 Get OCR suspicious information. More...
 
bool IsEmpty ()
 Check whether current object is empty or not. More...
 
void OCRPDFDocument (PDFDoc pdf_doc, bool is_editable)
 OCR each page of a PDF document. More...
 
void OCRPDFDocuments (OCRSettingDataArray settingdata_array)
 OCR multiple pages of multiple PDF documents. More...
 
void OCRPDFPage (PDFPage pdf_page, bool is_editable)
 OCR a PDF page. More...
 

Detailed Description

This class is used to do OCR for a PDF page or a PDF document. Please ensure OCR engine has been initialized before using this class.

See also
OCREngine

Constructor & Destructor Documentation

◆ OCR()

foxit.addon.ocr.OCR.OCR ( OCR  other)
inline

Constructor, with another ocr object.

Parameters
[in]otherAnother ocr object.

Member Function Documentation

◆ GetOCRSuspectsInfo()

OCRSuspectInfoArray foxit.addon.ocr.OCR.GetOCRSuspectsInfo ( PDFDoc  ocred_pdf_doc)
inline

Get OCR suspicious information.

The parameter ocred_pdf_doc is a valid PDF document that should have been ocred.

Parameters
[in]ocred_pdf_docA valid PDF document object.
Returns
An array of OCRSuspectInfo objects, If its value is empty, that means the document OCR has no suspicious information.

◆ IsEmpty()

bool foxit.addon.ocr.OCR.IsEmpty ( )
inline

Check whether current object is empty or not.

When the current object is empty, that means current object is useless.

Returns
true means current object is empty, while false means not.

◆ OCRPDFDocument()

void foxit.addon.ocr.OCR.OCRPDFDocument ( PDFDoc  pdf_doc,
bool  is_editable 
)
inline

OCR each page of a PDF document.

After this function succeeds, the PDF page content may be changed. It is better to parse or re-parse PDF pages in the input PDF document before using these pages.

Parameters
[in]pdf_docA valid PDF document object.
[in]is_editabletrue means the OCR result is editable. false means the OCR result can only be searched but not be edited.
Returns
None.

◆ OCRPDFDocuments()

void foxit.addon.ocr.OCR.OCRPDFDocuments ( OCRSettingDataArray  settingdata_array)
inline

OCR multiple pages of multiple PDF documents.

This function can be used to batch process multiple documents or pages. Users can set documents and page ranges via OCRSettingDataArray . The time performance of this function will be better than calling OCR.OCRPDFDocument or OCR.OCRPDFPage multiple times when dealing with a large number of documents or pages. After successful execution, the page content may be changed, it is better to parse or re-parse the PDF pages before using these pages.

Parameters
[in]settingdata_arrayAn array of OCRSettingData objects, if the parameter page_range of OCRSettingData object is empty, that means OCR each page of the PDF document.
Returns
None.

◆ OCRPDFPage()

void foxit.addon.ocr.OCR.OCRPDFPage ( PDFPage  pdf_page,
bool  is_editable 
)
inline

OCR a PDF page.

After this function succeeds, the PDF page content may be changed and the input PDF page is recommended to be re-parsed.

Parameters
[in]pdf_pageA valid PDF page object. This PDF page should have been parsed.
[in]is_editabletrue means the OCR result is editable. false means the OCR result can only be searched but not be edit.
Returns
None.