PDF to OCR using Python
Convert PDF scans to searchable PDF documents.
The Python library for converting scanned PDF documents to searchable and editable PDF documents using optical character recognition (OCR). Add textual layer to scanned PDF document.
Try for FREEConvertAPI Python library install
ConvertAPI provides a Python library that allows you to perform a PDF to OCR conversion with just a few lines of code. Convert PDF to OCR documents using Python SDK with no effort at all!
pip install --upgrade convertapi
Authenticate your Python library
You can obtain your secret key by signing up for a free account. Once you sign up, you'll receive 250 free conversions instantly! Grab your authentication secret from the account dashboard, and authenticate the ConvertAPI Python library like this:
import convertapi
convertapi.api_secret = 'your-api-secret'
Convert PDF to OCR using Python in no time!
Once you have your authentication in place, simply copy-paste this pdf to ocr conversion code snippet into your Python project:
Upload the file and see how it works
You can set up the advanced conversion parameters and test the conversion result online using our interactive demo tool. It will auto-generate the code snippet for you!
Advanced PDF to OCR conversion parameters
Sets the password to open protected documents.
Set page range. Example 1-10 or 1,2,5.
Set the OCR language. Ask support to add your language if missing.
Values: ca da nl de es en he pl pt ru sv tr lt