Foxit PDF Conversion SDK
PDFConversionSDK.PDF2OfficeSettingData Class Reference

Public Member Functions

 constructor (metrics_data_folder_path, enable_ml_recognition, page_range, include_pdf_comments, word_setting_data, power_point_setting_data, excel_setting_data, enable_trailing_space, include_images)
 Constructor, with parameters. More...
 
 Set (metrics_data_folder_path, enable_ml_recognition, page_range, include_pdf_comments, word_setting_data, power_point_setting_data, excel_setting_data, enable_trailing_space, include_images)
 Set value. More...
 

Detailed Description

This class represents setting data used for converting PDF to Office(Word, Excel or PowerPoint) format file.

Member Function Documentation

◆ constructor()

PDFConversionSDK.PDF2OfficeSettingData.constructor ( metrics_data_folder_path  ,
enable_ml_recognition  ,
page_range  ,
include_pdf_comments  ,
word_setting_data  ,
power_point_setting_data  ,
excel_setting_data  ,
enable_trailing_space  ,
include_images   
)

Constructor, with parameters.

Parameters
[in]metrics_data_folder_path(Current property has been deprecated since Foxit PDF Conversion SDK 2.1. Instead, metrics data will be included in the dynamic library.) A valid path of a folder which contains metrics data files. This should not be an empty string. These metrics data files are used to simulate the office format document typesetting process during conversion. They are offered in the "res/metrics_data" folder of the Foxit PDF Conversion SDK package.
[in]enable_ml_recognitionA boolean value which indicates whether enable machine learning-based recognition functionality. Default value: false. true means enable machine learning-based recognition functionality to identify borderless tables in PDF documents. In order to convert the tables better, it will draw a black border with a width of 1 on the recognized borderless table. This will be improved in subsequent versions. And this recognition functionality will be executed on the server side and return the relevant results when it is done. false means disable machine learning-based recognition functionality. And the recognition functionality not based on machine learning will be enabled.
[in]page_rangeA range object that specifies some pages. These pages will be used to convert PDF document to office format documents. Default value: true.
[in]include_pdf_commentsA boolean value which indicates whether to include PDF documents comments for the conversion of PDF documents to office format documents. Default value: true. true means to include PDF document comments for the conversion of PDF documents to office format documents. false means not to include PDF document comments for the conversion of PDF documents to office format documents.
[in]word_setting_dataA setting data object that used for converting PDF documents to Word format documents.
[in]power_point_setting_dataA setting data object that used for converting PDF documents to Power Point format documents.
[in]excel_setting_dataA setting data object that used for converting PDF documents to Excel format documents.
[in]enable_trailing_spaceA boolean value which indicates whether to add trailing spaces from lines in paragraphs.for converting PDF documents to office format documents. Default value: true. true means to add trailing spaces from lines in paragraphs for converting PDF documents to office format documents. false means not to add trailing spaces from lines in paragraphs and add a property to the paragraph that allows for line breaks. Currently, this support is limited to paragraphs where more than 80% of the text involves the end of one line and the beginning of the next both being in Korean. This is only available for converting PDF documents to Word format documents, with support for other Office formats planned for the future.
[in]include_imagesA boolean value which indicates whether to include images in PDF documents for converting PDF documents to office format documents. Default value: true. true means to include images in PDF documents for converting PDF documents to office format documents. false means not to include images in PDF documents PDF documents to office format documents. This is only available for converting PDF documents to Word format documents, with support for other Office formats planned for the future.
Note
Our machine learning-based technology for identifying borderless tables uses HTTPS and requires a network connection to send the images of the rendered PDF pages to our server during the conversion process.

◆ Set()

PDFConversionSDK.PDF2OfficeSettingData.Set ( metrics_data_folder_path  ,
enable_ml_recognition  ,
page_range  ,
include_pdf_comments  ,
word_setting_data  ,
power_point_setting_data  ,
excel_setting_data  ,
enable_trailing_space  ,
include_images   
)

Set value.

Parameters
[in]metrics_data_folder_path(Current property has been deprecated since Foxit PDF Conversion SDK 2.1. Instead, metrics data will be included in the dynamic library.) A valid path of a folder which contains metrics data files. This should not be an empty string. These metrics data files are used to simulate the office format document typesetting process during conversion. They are offered in the "res/metrics_data" folder of the Foxit PDF Conversion SDK package.
[in]enable_ml_recognitionA boolean value which indicates whether enable machine learning-based recognition functionality. Default value: false. true means enable machine learning-based recognition functionality to identify borderless tables in PDF documents. In order to convert the tables better, it will draw a black border with a width of 1 on the recognized borderless table. This will be improved in subsequent versions. And this recognition functionality will be executed on the server side and return the relevant results when it is done. false means disable machine learning-based recognition functionality. And the recognition functionality not based on machine learning will be enabled.
[in]page_rangeA range object that specifies some pages. These pages will be used to convert PDF documents to office format documents.
[in]include_pdf_commentsA boolean value which indicates whether to include PDF documents comments for the conversion of PDF documents to office format documents. Default value: true. true means to include PDF document comments for the conversion of PDF documents to office format documents. false means not to include PDF document comments for the conversion of PDF documents to office format documents.
[in]word_setting_dataA setting data object that used for converting PDF documents to Word format documents.
[in]power_point_setting_dataA setting data object that used for converting PDF documents to Power Point format documents.
[in]excel_setting_dataA setting data object that used for converting PDF documents to Excel format documents.
[in]enable_trailing_spaceA boolean value which indicates whether to add trailing spaces from lines in paragraphs.for converting PDF documents to office format documents. Default value: true. true means to add trailing spaces from lines in paragraphs for converting PDF documents to office format documents. false means not to add trailing spaces from lines in paragraphs and add a property to the paragraph that allows for line breaks. Currently, this support is limited to paragraphs where more than 80% of the text involves the end of one line and the beginning of the next both being in Korean. This is only available for converting PDF documents to Word format documents, with support for other Office formats planned for the future.
[in]include_imagesA boolean value which indicates whether to include images in PDF documents for converting PDF documents to office format documents. Default value: true. true means to include images in PDF documents for converting PDF documents to office format documents. false means not to include images in PDF documents PDF documents to office format documents. This is only available for converting PDF documents to Word format documents, with support for other Office formats planned for the future.
Note
Our machine learning-based technology for identifying borderless tables uses HTTPS and requires a network connection to send the images of the rendered PDF pages to our server during the conversion process.
Returns
None.