PDF to XLSX using Java

Accurate PDF to Excel Data and Table Extraction

PDF to XLSX features

Advanced PDF to Excel Java library, designed to convert PDFs, including scanned documents, into editable Excel sheets swiftly and accurately. This powerful tool ensures data integrity, perfectly replicating tables and charts for efficient data analysis. Ideal for businesses and developers, our Java library integrates seamlessly into various systems, offering user-friendly, precise data transformation with minimal editing required. Utilizing cutting-edge OCR technology, it guarantees high accuracy, especially for scanned documents. Elevate your data processing with our reliable, fast PDF to Excel conversion Java library.

ConvertAPI Java library install

ConvertAPI provides a Java SDK that allows you to perform a PDF to XLSX conversion with just a few lines of code. Convert PDF to XLSX documents using Java programming language with no effort at all!

# Add the following dependency to your pom.xml:

Authenticate your Java library

You can obtain your secret key by signing up for a free account. Once you sign up, you'll receive 250 free conversions instantly! Grab your authentication secret from the account dashboard, and authenticate the ConvertAPI Java library like this:

# get your secret key here: https://www.convertapi.com/a/auth
Convert PDF to XLSX using Java in no time!

Once you have your authentication in place, simply copy-paste this pdf to xlsx conversion code snippet into your Java project:

Try the conversion online - no coding required!

You can try out advanced conversion parameters and test the conversion result online using our interactive demo tool. This tool will produce the same conversion output as if you were using the library from your solution, and it will auto-generate the code snippet for you!

Convert PDF to Excel with styling and formatting

Our newly introduced PDF to XLSX API converter is a super convenient tool for transforming either textual or scanned PDFs into editable MS Office Excel spreadsheets. This tool provides superior-quality of conversion while preserving not only the data but also formatting and styling. Other than the tabular data, it can also include images, paragraphs, and other non-table content into the produced XLSX file.

How to convert PDF documents to XLSX using Java?

The PDF to XLSX conversion consists of a few simple steps:

  1. Sign up for a free ConvertAPI account
  2. Install our Java library using Maven
  3. Set up the conversion using our interactive demo
  4. Copy-paste the auto-generated code snippet

Using this conversion, you also gain the ability to choose whether you want to combine extracted tables into a single sheet or keep them separate. All the rest will be resolved automatically based on the best-predicted outcome.


ConvertAPI's PDF to XLSX API is an excellent tool for converting PDFs to XLSX using the Java programming language. With just a few simple steps, you can set up your project by using the auto-generated code snippet available on the interactive demo page, saving you valuable time and effort.

Conversion parameters

Password String

Sets the password to open protected documents.

PageRange String

Set page range. Example 1-10 or 1,2,5.

IncludeFormatting Bool

Include non-table content in an excel file, like images and paragraphs.

SingleSheet Bool

Combine extracted tables into a single sheet.

OcrLanguage Collection

Set the OCR language.

Values:   automatic ca da de es en fr fi it nl no pl pt ro ru sv sl tr

EnableOcr Bool

Enable optical character recognition(OCR).

