Files Handling

Get Text from PDF

A special toolkit implemented in Wizlink extracts a plain text from .pdf file and then parses it. A PDF Viewer shows retrieved text in a form of an array (one word in a cell). Numbers of columns and rows in the array start from 0.

Open PDF Viewer from Tools in main menu bar. Click the Load PDF option and choose the pdf file. Click Open. Wait patiently as it will take some time to parse the document.

You can click a cell in an array and PDF Viewer shows numbers of its row and column.

Put full path to the pdf file into Input part of Properties, declare variables for output part of Properties (where needed: Number of Rows, PDF Content, Rows, Columns per Row).

v2_fh_PDF
Acquired TextStringOutputtext input by user
Alternative Scenario If FailedBooleanInputcheckbox; default is "on"
Display NameLiteralInputchanges display name of the activity
Exclude from Transaction CheckBooleanInputcheckbox; default is "off"
File PathStringInputdirectory path to a file (file name and extension included)
Columns per RowInt32[]Outputvar(number of row): number of columns with the data in a given row
Number of RowsInt32Outputnumber of all rows in a table
PDF ContentString[][]Outputarray of text values retrieved from application
ResultBooleanOutputreturns a Boolean value. True if no error in activity, otherwise will return False
Result CodeInt32OutputResult Code will return 0 if there is no error in activity, otherwise in most cases will return 1. See the table of Extended Result Codes for more info
RowsString[]Outputvar(number of row): content of all cells in a given row