PDF OCR Python Overview

Convert scanned PDF documents into fully searchable and editable PDFs using accurate OCR. Detect text in page images and add a hidden text layer that preserves the original appearance while enabling selection and indexing. Select OCR languages to maximize accuracy for multilingual content, choose recognition mode and page segmentation, and limit processing to page ranges for faster runs. Handle password‑protected inputs and tune timeouts for large files. Ideal for digitization, compliance, and content discovery.

Lightning Fast Conversions

Process and convert files in seconds with our high-performance cloud infrastructure.

Accuracy Guaranteed

Our advanced algorithms ensure pixel-perfect and content-accurate file conversions.

Enterprise-Grade Security

ISO 27001, HIPAA, and GDPR compliant with encrypted file processing.

Global Infrastructure

Strategically located servers ensure low latency and high availability worldwide.

Developer Friendly

Comprehensive SDKs and clear documentation for quick and simple integration.

Time-Saving Automation

Automate repetitive document workflows and focus on what matters most.

Customizable Parameters

Fine-tune your automation with these powerful conversion options

File

File Supported formats: .pdf

File to be converted. Value can be URL or file content.

Password

String

Sets the password to open protected documents.

PageRange

String Default: 1-2000

Set page range. Example 1-10 or 1,2,5.

OcrMode

Collection Default: auto

The OcrMode setting controls how OCR is applied to PDF pages. In Auto mode, pages that already contain text are skipped, so OCR runs only where needed. In Always mode, OCR is applied to every page, keeping existing text while adding recognition for images and other visual content. In Reprocess mode, OCR is performed on the whole page, producing a fresh text layer regardless of what was there before.

Values: auto always reprocess

OcrLanguage

Collection Default: en

Set the OCR language. Ask support to add your language if missing.

Values: ar ca zh-cn zh-tw da nl en fi fa de el he it ja ko lt no pl pt ro ru sl es sv tr ua th

PageSegmentationMode

Collection Default: sparseText

The PageSegmentationMode parameter specifies how the OCR engine segments and interprets text within PDF documents. Choosing the appropriate mode enhances OCR accuracy by aligning closely with your document's layout and structure.

Select one of the available modes to control text detection and layout analysis:

SparseText - Detects as much text as possible without enforcing any specific order. Suitable for documents containing scattered or fragmented text.
SparseTextOsd - Similar to SparseText, but also includes orientation and script detection (OSD). Useful for documents with rotated text or multiple scripts and languages.
Auto - Automatically selects the best segmentation mode based on document content. Ideal for general documents with mixed or unknown layouts.
AutoOsd - Combines automatic segmentation with orientation and script detection. Recommended for documents with uncertain text orientation or multilingual content.
SingleColumn - Assumes a single column of text with varying text sizes. Best suited for straightforward layouts.
SingleLine - Treats the entire image as a single line of text. Useful for single-line labels, banners, or narrow text snippets.
SingleWord - Treats the entire image as a single word. Ideal for recognizing isolated words or short phrases.

Values: sparseText sparseTextOsd auto autoOsd singleLine singleColumn singleWord

StoreFile

Bool Default: False

When the StoreFile parameter is set to True, your converted file is written to ConvertAPI’s encrypted, temporary storage and made available via a time-limited secure download URL, valid for up to 3 hours. After this period, the file is permanently deleted.

When StoreFile is set to False, conversion happens entirely in-memory. The raw file bytes are streamed back in the API response without touching disk or external storage, ensuring maximum security and zero persistence so that only you can access the content.

OutputType

Collection Default: pdf

This property is used to determine how the OCR layer should be returned. If the output type is PDF, the OCR layer will be embedded into the PDF file. Alternatively, if a text output is selected, the OCR layer will be returned as a text file.

Values: pdf txt

Step-by-Step Guide

Easy PDF OCR integration programmatically using our Python library

1. ConvertAPI Python library install

ConvertAPI provides a Python library that allows you to perform a PDF OCR conversion with just a few lines of code. PDF OCR documents using Python SDK with no effort at all!

Install with pip:

pip install --upgrade convertapi

PyPI GitHub

2. Authenticate your Python library

You can obtain your API Token by signing up for a free account. Once you sign up, you'll receive 250 free conversions instantly! Grab your API Token from the account dashboard, and authenticate the ConvertAPI Python library like this:

# get your API Token here: https://www.convertapi.com/a/auth


                    import convertapi
                    

                    convertapi.api_credentials = 'api_token'

3. PDF OCR using Python in no time!

Once you have your authentication in place, simply copy-paste this pdf to ocr conversion code snippet into your Python project:

PDF OCR in Python

// Code snippet is using the ConvertAPI JavaScript Client: https://github.com/ConvertAPI/convertapi-library-js

// Code snippet is using the ConvertAPI Node.js Client: https://github.com/ConvertAPI/convertapi-nodejs

// Code snippet is using the ConvertAPI PHP Client: https://github.com/ConvertAPI/convertapi-php

// Code snippet is using the ConvertAPI Java Client: https://github.com/ConvertAPI/convertapi-java

// Code snippet is using the ConvertAPI C# Client: https://github.com/ConvertAPI/convertapi-dotnet

# Code snippet is using the ConvertAPI Ruby Client: https://github.com/ConvertAPI/convertapi-ruby

# Code snippet is using the ConvertAPI Python Client: https://github.com/ConvertAPI/convertapi-python

// Code snippet is using the ConvertAPI Go Client: https://github.com/ConvertAPI/convertapi-go

REM Code snippet is using the command line utility program: https://github.com/ConvertAPI/convertapi-cli

Integrate within minutes

Easy PDF OCR automation using our simple Python SDK

GitHub Repository

Explore the source code and examples on GitHub.

PyPI Package

View ConvertAPI package and versions on the Python Package Index.

Python SDK Documentation

Read more about the ConvertAPI Python SDK capabilities.

Try the PDF OCR conversion online

Try it Free

Compatible With all Python Frameworks & Tools

Businesses trust us

Highest rated File Conversion API on major B2B software listing platforms: Capterra, G2, and Trustpilot.

"ConvertAPI has been a game-changer for our document automation workflows. Their conversion accuracy and API reliability are unmatched in the industry for over 7 years."

"ConvertAPI is a reliable, cost-effective solution with a proven track record of stability. It has grown significantly in maturity, adopting enterprise-grade practices over the years."

"We've integrated ConvertAPI across our entire document processing platform. The performance is exceptional and the support team is always responsive. Highly recommended!"

Enterprise-Grade Security

We ensure that all document processing is handled securely in the cloud, adhering to industry-leading standards like ISO 27001, GDPR, and HIPAA. To enhance security even further, we can ensure that no files or data are stored on our servers and never leave your country.

Learn more about security

Ready to Streamline Your File Conversions?

Get Started for Free Contact Us

PDF Print Production

PDF Redact

PDF Accessibility

PDF Templating

API Tools

SDK Libraries

No-Code Integrations

Developer Hub

Security & Compliance

Blog

Affiliates

Support

PDF OCR Python

Convert scanned PDFs to searchable, editable documents with OCR language selection, recognition modes, and page range control.

PDF OCR in Python

Lightning Fast Conversions

Accuracy Guaranteed

Enterprise-Grade Security

Global Infrastructure

Developer Friendly

Time-Saving Automation

1. ConvertAPI Python library install

2. Authenticate your Python library

3. PDF OCR using Python in no time!

PDF OCR in Python

GitHub Repository

PyPI Package

Python SDK Documentation

Try the PDF OCR conversion online

Compatible With all Python Frameworks & Tools

Businesses trust us

Enterprise-Grade Security

Ready to Streamline Your File Conversions?