PDF to OCR using Python

Convert PDF scans to searchable PDF documents.

PDF OCR

The Python library for converting scanned PDF documents to searchable and editable PDF documents using optical character recognition (OCR). Add textual layer to scanned PDF document.

Try for FREE

ConvertAPI Python library install

ConvertAPI provides a Python library that allows you to perform a PDF to OCR conversion with just a few lines of code. Convert PDF to OCR documents using Python SDK with no effort at all!

Install with pip:
pip install --upgrade convertapi

Authenticate your Python library

You can obtain your secret key by signing up for a free account. Once you sign up, you'll receive 250 free conversions instantly! Grab your authentication secret from the account dashboard, and authenticate the ConvertAPI Python library like this:

# get your secret key here: https://www.convertapi.com/a/auth
import convertapi
convertapi.api_secret = 'your-api-secret'

Convert PDF to OCR using Python in no time!

Once you have your authentication in place, simply copy-paste this pdf to ocr conversion code snippet into your Python project:

// Code snippet is using the ConvertAPI JavaScript Client: https://github.com/ConvertAPI/convertapi-js

// Code snippet is using the ConvertAPI Node.js Client: https://github.com/ConvertAPI/convertapi-nodejs

// Code snippet is using the ConvertAPI PHP Client: https://github.com/ConvertAPI/convertapi-php

// Code snippet is using the ConvertAPI Java Client: https://github.com/ConvertAPI/convertapi-java

// Code snippet is using the ConvertAPI C# Client: https://github.com/ConvertAPI/convertapi-dotnet

# Code snippet is using the ConvertAPI Ruby Client: https://github.com/ConvertAPI/convertapi-ruby

# Code snippet is using the ConvertAPI Python Client: https://github.com/ConvertAPI/convertapi-python

// Code snippet is using the ConvertAPI Go Client: https://github.com/ConvertAPI/convertapi-go

REM Code snippet is using the command line utility program: https://github.com/ConvertAPI/convertapi-cli

<!-- For conversions with the multiple file result please refer to this example: https://repl.it/@ConvertAPI/HTML-Form-with-multiple-file-result -->

Upload the file and see how it works

You can set up the advanced conversion parameters and test the conversion result online using our interactive demo tool. It will auto-generate the code snippet for you!

Advanced PDF to OCR conversion parameters

Password String

Sets the password to open protected documents.

PageRange String

Set page range. Example 1-10 or 1,2,5.

OcrLanguage Collection

Set the OCR language. Ask support to add your language if missing.

Values:   ca da nl de es en he pl pt ru sv tr lt

Try PDF to OCR for free!