PDF to CSV using Python
The Python SDK for extracting data from PDF documents to the CSV files.
PDF to CSV features
Extract tables from textual and scanned PDF documents to comma-separated values CSV files. The Python library identifies bordered and border-less tabular structures within pdf documents and extracts these tables to a list of CSV formatted files.
ConvertAPI Python library install
ConvertAPI provides a Python library that allows you to perform a PDF to CSV conversion with just a few lines of code. Convert PDF to CSV documents using Python SDK with no effort at all!
pip install --upgrade convertapi
Authenticate your Python library
You can obtain your secret key by signing up for a free account. Once you sign up, you'll receive 250 free conversions instantly! Grab your authentication secret from the account dashboard, and authenticate the ConvertAPI Python library like this:
import convertapi
convertapi.api_credentials = 'secret_or_token'
Convert PDF to CSV using Python in no time!
Once you have your authentication in place, simply copy-paste this pdf to csv conversion code snippet into your Python project:
Try the conversion online - no coding required!
You can try out advanced conversion parameters and test the conversion result online using our interactive demo tool. This tool will produce the same conversion output as if you were using the library from your solution, and it will auto-generate the code snippet for you!
Try for FREE!Conversion parameters
Sets the password to open protected documents.
Set page range. Example 1-10 or 1,2,5.
Set fields separator.
Enable optical character recognition(OCR). Set the property to ALL to perform OCR on all pages and Scanned to perform OCR on scanned pages if PDF contains mixed pages - text and image pages.
Values: Scanned All None
Set the OCR language. Ask support to add your language if missing.
Values: ca da nl fa de es en he pl pt ru sv tr lt