PDF to JSON Ruby

Convert PDF documents to structured, layout-aware JSON with text, tables, images, and bounding boxes for AI and data pipelines.

PDF Tools

PDF to JSON Ruby Overview

Convert PDF documents into clean, structured JSON built for large language models, retrieval-augmented generation, and automated data pipelines. Every page is decomposed into typed elements - paragraphs, tables, images, and graphics - each carrying its text, reading order, and precise bounding box. Tables are reconstructed into rows and cells so tabular data stays machine-readable for downstream use. Handle password-protected files, target page ranges, and apply OCR to scanned documents so even image-based PDFs yield searchable, structured output for indexing, analytics, and AI ingestion.

Lightning Fast Conversions

Process and convert files in seconds with our high-performance cloud infrastructure.

Accuracy Guaranteed

Our advanced algorithms ensure pixel-perfect and content-accurate file conversions.

Enterprise-Grade Security

ISO 27001, HIPAA, and GDPR compliant with encrypted file processing.

Global Infrastructure

Strategically located servers ensure low latency and high availability worldwide.

Developer Friendly

Comprehensive SDKs and clear documentation for quick and simple integration.

Time-Saving Automation

Automate repetitive document workflows and focus on what matters most.

Customizable Parameters

Fine-tune your automation with these powerful conversion options

File

File Supported formats: .pdf

File to be converted. Value can be URL or file content.

Password

String

Sets the password to open protected documents.

PageRange

String Default: 1-2000

Set page range. Example 1-10 or 1,2,5.

OcrMode

Collection Default: auto

Defines how OCR is applied during conversion. Auto performs OCR only when needed. Force applies OCR to all pages. Never disables OCR entirely.

Values:   auto force never

OcrLanguage

Collection Default: auto

Configure the OCR language for text recognition. If auto-detection fails, manually specify the language.

Values:   auto ar ca zh da nl en fi fr de el ko it ja no pl pt ro ru sl es sv tr ua th

IncludeGraphics

Bool Default: True

Include vector graphic elements in the JSON output.

ExtractTableRows

Bool Default: True

Reconstruct table rows and cells from the detected table layout.

StoreFile

Bool Default: False

When the StoreFile parameter is set to True, your converted file is written to ConvertAPI’s encrypted, temporary storage and made available via a time-limited secure download URL, valid for up to 3 hours. After this period, the file is permanently deleted.

When StoreFile is set to False, conversion happens entirely in-memory. The raw file bytes are streamed back in the API response without touching disk or external storage, ensuring maximum security and zero persistence so that only you can access the content.

Step-by-Step Guide

Easy PDF to JSON integration programmatically using our Ruby GEM

1. ConvertAPI Ruby library install

ConvertAPI provides a Ruby Gem that allows you to perform a PDF to JSON conversion with just a few lines of code. First, install the ConvertAPI Ruby SDK:

Add this line to your application's Gemfile:
gem 'convert_api'

2. Authenticate your Ruby library

You can obtain your API Token by signing up for a free account. Once you sign up, you'll receive 250 free conversions instantly! Grab your API Token from the account dashboard, and authenticate the ConvertAPI Ruby library like this:

# get your API Token here: https://www.convertapi.com/a/auth
ConvertApi.configure do |config|
   config.api_credentials = 'api_token'
end

3. Convert PDF to JSON using Ruby in no time!

Once you have your authentication in place, simply copy-paste this pdf to json conversion code snippet into your Ruby project:

Integrate within minutes

Easy PDF to JSON automation using our simple Ruby SDK

Try the PDF to JSON conversion online

Try it Free

Compatible With all Ruby Frameworks & Tools

Compatible with Ruby Compatible with Ruby on Rails Compatible with Ruby Sinatra Compatible with RSPEC Available on RubyGems Compatible with VS Code Compatible with RubyMine

Businesses trust us

Highest rated File Conversion API on major B2B software listing platforms: Capterra, G2, and Trustpilot.

"ConvertAPI has been a game-changer for our document automation workflows. Their conversion accuracy and API reliability are unmatched in the industry for over 7 years."

"ConvertAPI is a reliable, cost-effective solution with a proven track record of stability. It has grown significantly in maturity, adopting enterprise-grade practices over the years."

"We've integrated ConvertAPI across our entire document processing platform. The performance is exceptional and the support team is always responsive. Highly recommended!"

Enterprise-Grade Security

We ensure that all document processing is handled securely in the cloud, adhering to industry-leading standards like ISO 27001, GDPR, and HIPAA. To enhance security even further, we can ensure that no files or data are stored on our servers and never leave your country.

Learn more about security

Ready to Streamline Your File Conversions?