PDF OCR Go

Convert scanned PDFs to searchable, editable documents with OCR language selection, recognition modes, and page range control.

Ocr Tools

PDF OCR Go Overview

Convert scanned PDF documents into fully searchable and editable PDFs using accurate OCR. Detect text in page images and add a hidden text layer that preserves the original appearance while enabling selection and indexing. Select OCR languages to maximize accuracy for multilingual content, choose recognition mode and page segmentation, and limit processing to page ranges for faster runs. Handle password‑protected inputs and tune timeouts for large files. Ideal for digitization, compliance, and content discovery.

Lightning Fast Conversions

Process and convert files in seconds with our high-performance cloud infrastructure.

Accuracy Guaranteed

Our advanced algorithms ensure pixel-perfect and content-accurate file conversions.

Enterprise-Grade Security

ISO 27001, HIPAA, and GDPR compliant with encrypted file processing.

Global Infrastructure

Strategically located servers ensure low latency and high availability worldwide.

Developer Friendly

Comprehensive SDKs and clear documentation for quick and simple integration.

Time-Saving Automation

Automate repetitive document workflows and focus on what matters most.

Customizable Parameters

Fine-tune your automation with these powerful conversion options

File

File Supported formats: .pdf

File to be converted. Value can be URL or file content.

Password

String

Sets the password to open protected documents.

PageRange

String Default: 1-2000

Set page range. Example 1-10 or 1,2,5.

OcrMode

Collection Default: auto

The OcrMode setting controls how OCR is applied to PDF pages. In Auto mode, pages that already contain text are skipped, so OCR runs only where needed. In Always mode, OCR is applied to every page, keeping existing text while adding recognition for images and other visual content. In Reprocess mode, OCR is performed on the whole page, producing a fresh text layer regardless of what was there before.

Values:   auto always reprocess

OcrLanguage

Collection Default: en

Set the OCR language. Ask support to add your language if missing.

Values:   ar ca zh-cn zh-tw da nl en fi fa de el he it ja ko lt no pl pt ro ru sl es sv tr ua th

PageSegmentationMode

Collection Default: sparseText

The PageSegmentationMode parameter specifies how the OCR engine segments and interprets text within PDF documents. Choosing the appropriate mode enhances OCR accuracy by aligning closely with your document's layout and structure.

Select one of the available modes to control text detection and layout analysis:

  • SparseText - Detects as much text as possible without enforcing any specific order. Suitable for documents containing scattered or fragmented text.
  • SparseTextOsd - Similar to SparseText, but also includes orientation and script detection (OSD). Useful for documents with rotated text or multiple scripts and languages.
  • Auto - Automatically selects the best segmentation mode based on document content. Ideal for general documents with mixed or unknown layouts.
  • AutoOsd - Combines automatic segmentation with orientation and script detection. Recommended for documents with uncertain text orientation or multilingual content.
  • SingleColumn - Assumes a single column of text with varying text sizes. Best suited for straightforward layouts.
  • SingleLine - Treats the entire image as a single line of text. Useful for single-line labels, banners, or narrow text snippets.
  • SingleWord - Treats the entire image as a single word. Ideal for recognizing isolated words or short phrases.
Values:   sparseText sparseTextOsd auto autoOsd singleLine singleColumn singleWord

StoreFile

Bool Default: False

When the StoreFile parameter is set to True, your converted file is written to ConvertAPI’s encrypted, temporary storage and made available via a time-limited secure download URL, valid for up to 3 hours. After this period, the file is permanently deleted.

When StoreFile is set to False, conversion happens entirely in-memory. The raw file bytes are streamed back in the API response without touching disk or external storage, ensuring maximum security and zero persistence so that only you can access the content.

OutputType

Collection Default: pdf

This property is used to determine how the OCR layer should be returned. If the output type is PDF, the OCR layer will be embedded into the PDF file. Alternatively, if a text output is selected, the OCR layer will be returned as a text file.

Values:   pdf txt

Step-by-Step Guide

Easy PDF OCR integration programmatically using our Go SDK

1. ConvertAPI Go library install

ConvertAPI provides a library for Go programming language that allows you to perform a PDF OCR conversion with just a few lines of code. PDF OCR documents using GO language with no effort at all!

Execute this command in your GOPATH:
go get github.com/ConvertAPI/convertapi-go

2. Authenticate your Go library

You can obtain your API Token by signing up for a free account. Once you sign up, you'll receive 250 free conversions instantly! Grab your API Token from the account dashboard, and authenticate the ConvertAPI GO library like this:

# get your API Token here: https://www.convertapi.com/a/auth
config.Default = config.NewDefault("api_token")

3. PDF OCR using Golang in no time!

Once you have your authentication in place, simply copy-paste this pdf to ocr conversion code snippet into your Golang project:

Integrate within minutes

Easy PDF OCR automation using our simple Go SDK

Try the PDF OCR conversion online

Try it Free

Compatible With all Go Frameworks & Tools

Compatible with Go Compatible with Buffalo Compatible with Echo Compatible with Fiber Compatible with GoRM Compatible with GoLand Compatible with GoChi

Businesses trust us

Highest rated File Conversion API on major B2B software listing platforms: Capterra, G2, and Trustpilot.

"ConvertAPI has been a game-changer for our document automation workflows. Their conversion accuracy and API reliability are unmatched in the industry for over 7 years."

"ConvertAPI is a reliable, cost-effective solution with a proven track record of stability. It has grown significantly in maturity, adopting enterprise-grade practices over the years."

"We've integrated ConvertAPI across our entire document processing platform. The performance is exceptional and the support team is always responsive. Highly recommended!"

Enterprise-Grade Security

We ensure that all document processing is handled securely in the cloud, adhering to industry-leading standards like ISO 27001, GDPR, and HIPAA. To enhance security even further, we can ensure that no files or data are stored on our servers and never leave your country.

Learn more about security

Ready to Streamline Your File Conversions?