PDF to TXT using Java
The Java SDK for converting PDF document to a plain text file, extract text from PDF.
PDF to TXT features
Convert textual and scanned PDF document to a plain text file, extract text from PDF, apply OCR on a scanned PDF document before conversion.
ConvertAPI Java library install
ConvertAPI provides a Java SDK that allows you to perform a PDF to TXT conversion with just a few lines of code. Convert PDF to TXT documents using Java programming language with no effort at all!
<dependency>
<groupId>com.convertapi.client</groupId>
<artifactId>convertapi</artifactId>
<version>2.10</version>
</dependency>
Authenticate your Java library
You can obtain your secret key by signing up for a free account. Once you sign up, you'll receive 250 free conversions instantly! Grab your authentication secret from the account dashboard, and authenticate the ConvertAPI Java library like this:
use \ConvertApi\ConvertApi;
Config.setDefaultApiCredentials("secret_or_token");
Convert PDF to TXT using Java in no time!
Once you have your authentication in place, simply copy-paste this pdf to txt conversion code snippet into your Java project:
Try the conversion online - no coding required!
You can try out advanced conversion parameters and test the conversion result online using our interactive demo tool. This tool will produce the same conversion output as if you were using the library from your solution, and it will auto-generate the code snippet for you!
Try for FREE!Conversion parameters
Sets the password to open protected documents.
Set page range. Example 1-10 or 1,2,5.
Configure the OCR language for text recognition. If auto-detection fails, manually specify the language.
Values: automatic ar ca zh da nl en fi fr de gr ko it ja no pl pt ro ru sl es sv tr th
Enable optical character recognition(OCR).
Persist formatting while extracting text. Only works when RemoveHeadersFooters and RemoveFootnotes properties are disabled.
Remove headers and footers from the document.
Remove footnotes from the document.
Remove tables from the document.