PDF to CSV API

Extract tables from textual and scanned PDF documents to comma-separated values CSV files. The API identifies bordered and border-less tabular structures within pdf documents and extracts these tables to a list of CSV formatted files.

API Request:

Copy parameter's URL Secret Secret
Optional

String Authentication secret must be provided as a query parameter. If omitted, token must be provided.

Show all

Copy parameter's URL Token Token
Optional

String Authentication token must be provided as a query parameter. If omitted, secret must be provided.

Show all

Copy parameter's URL File File
Required

File File to be converted. Value can be URL or file content.

Show all

Copy parameter's URL StoreFile Store file
Optional

Bool Store converted file on our secure server and provides download URL.

Default: False

Show all

Copy parameter's URL FileName File name
Optional

String Converted output file name without extension. The extension will be added automatically.

Show all

Copy parameter's URL Timeout Timeout
Optional

Integer Conversion timeout in seconds.

Default: 600

Range: 10 .. 1200

Show all

Copy parameter's URL Async Asynchronous
Optional

Bool Run conversion job asynchronously.

Default: False

Show all

Copy parameter's URL JobId Job ID
Optional

String Conversion job self generated UUID (RFC 4122) used for getting conversion result asynchronously.

Show all

Copy parameter's URL WebHook WebHook
Optional

String Set WebHook URL to call after asynchronous conversion is finished. Async parameter must be enabled.

Show all

Copy parameter's URL DocumentPassword Document Password
Optional

String Sets the password to open protected documents.

Show all

Copy parameter's URL PageRange Page Range
Optional

String Set page range. Example 1-10 or 1,2,5.

Default: 1-2000

Show all

Copy parameter's URL AutoRotate Auto Rotate
Optional

Bool Automatic page orientation detection.

Default: False

Show all

Copy parameter's URL OcrLanguage OCR Language
Optional

Collection Set the OCR language.

Default: automatic

Values: automatic, ca, da, de, es, en, fr, fi, he, it, ko, ja, nl, no, pl, pt, ro, ru, sv, sl, tr

Show all

Copy parameter's URL EnableOcr Enable OCR
Optional

Bool Enable optical character recognition(OCR).

Default: True

Show all

Copy parameter's URL OcrType OCR Type
Optional

Collection Set the OCR type. The Automatic option tries to detect the non-textual context in PDF and perform OCR. The Always option performs OCR in full PDF, and it is slower than Automatic, but in some cases, it could improve OCR.

Default: automatic

Values: automatic, always

Show all

Copy parameter's URL IncludeInvisibleText Include invisible text
Optional

Bool Include invisible text where text and background colors are the same.

Default: False

Show all

Developer mode

Snippets are autogenerated according to converter parameter choices above. Please select file in "File" or "Files" parameter before using code snippets.

Code snippet is using ConvertAPI JavaScript Client

Code snippet is using ConvertAPI Node.js Client

Code snippet is using ConvertAPI PHP Client

Code snippet is using ConvertAPI Java Client

Code snippet is using ConvertAPI C# Client

Code snippet is using ConvertAPI Ruby Client

Code snippet is using ConvertAPI Python Client

Code snippet is using ConvertAPI Go Client

Code snippet is using command line utility program

For conversions with the multiple file result please refer to the example