PDF to OCR using Python
Convert PDF scans to searchable PDF documents.
PDF to OCR features
The Python library for converting scanned PDF documents to searchable and editable PDF documents using optical character recognition (OCR). Add textual layer to scanned PDF document.
ConvertAPI Python library install
ConvertAPI provides a Python library that allows you to perform a PDF to OCR conversion with just a few lines of code. Convert PDF to OCR documents using Python SDK with no effort at all!
pip install --upgrade convertapi
Authenticate your Python library
You can obtain your secret key by signing up for a free account. Once you sign up, you'll receive 250 free conversions instantly! Grab your authentication secret from the account dashboard, and authenticate the ConvertAPI Python library like this:
import convertapi
convertapi.api_credentials = 'secret_or_token'
Convert PDF to OCR using Python in no time!
Once you have your authentication in place, simply copy-paste this pdf to ocr conversion code snippet into your Python project:
Try the conversion online - no coding required!
You can try out advanced conversion parameters and test the conversion result online using our interactive demo tool. This tool will produce the same conversion output as if you were using the library from your solution, and it will auto-generate the code snippet for you!
Try for FREE!Conversion parameters
Sets the password to open protected documents.
Set page range. Example 1-10 or 1,2,5.
Set the OCR language. Ask support to add your language if missing.
Values: ca da nl fa de es en he pl pt ru sv tr lt