Redact PDF API
Automatically find and redact sensitive data in PDFs using AI
Try it FreeAutomatically detect and black out information in PDF document. Whether you need to secure business documents or protect personal information, our tool provides reliable, efficient, and user-friendly redaction solutions. Start safeguarding your PDFs today with our trusted data masking API.
ConvertAPI offers a cloud-based solution, which makes Redact PDF conversion effortless with just a single REST-API call:
Converting pdf to redact via REST-API is now easier than ever. Sign up for a free account to obtain your authentication secret, append it as a query string parameter to the conversion URL, and voilà! You can find advanced authentication techniques in our docs.
Once you have your authentication in place, you can convert your files using application/json
, multipart/form-data
and application/octet-stream
content types.
A simple HTTP POST request using JSON would look like this:
You can easily convert pdf to redact with a simple cURL request. Our documentation provides more advanced cURL request techniques.
ConvertAPI CLI allows you to convert files using our command-line file conversion utility tool. Check out CLI documentation for a detailed explanation and usage examples.
You can try out advanced conversion parameters and test the conversion result online using our interactive demo tool. This tool will produce the same conversion output as if you were using the library from your solution, and it will auto-generate the code snippet for you!
Try for FREE!Set page range. Example 1-10 or 1,2,5.
The Preset parameter determines the type of sensitive data the AI will detect and redact from the document. It complements the Custom Redaction Options
, enabling you to refine or expand the redaction criteria. Select manual
to use your customized redaction options exclusively.
Choose a preset to define the type of sensitive data the AI will detect and redact from the document:
Auto – Automatically detects and redacts sensitive data across all categories, including PII, financial, healthcare, legal, and confidential information. Best for general redaction when the document type is unknown or contains mixed data.
GDPR - Redacts personal data as required by GDPR, including names, emails, IP addresses, phone numbers, and national IDs.
HIPAA - Ensures compliance with HIPAA by redacting protected health information (PHI) such as patient names, medical record numbers, diagnoses, and prescription details.
FERPA - Redacts personally identifiable student information to comply with FERPA, including student names, school records, and educational identifiers.
FOIA - Prevents the exposure of sensitive personal or national security data in documents released under FOIA. This includes classified content, addresses, and government IDs.
GLBA - Complies with GLBA by redacting financial data such as bank account numbers, credit card details, loan records, and investment information.
CCPA -Meets CCPA requirements by removing consumer personal data such as purchase history, geolocation, contact details, and online identifiers upon request.
Manual - Disables automatic AI detection preset. Only the manually set parameters will be used to determine what should be redacted. Ideal for users who want full control over redaction without AI-based automation.
Values: auto gdpr hipaa ferpa foia glba ccpa manual
Sets the minimum confidence threshold for AI-based detection of sensitive data. Higher values reduce false positives but may miss subtle matches.
Specifies the color used to mask redacted text, accepting formats such as Hexadecimal (e.g., #FFFFFF
for white or #FF5733
for orange), RGB with an optional alpha channel (e.g., 255,255,255
for white or 255,255,255
), and named colors (e.g., white
, red
, blue
).
The RedactionThickness property controls the height of the redaction stroke line relative to the original line height.
Personally Identifiable Information (PII) - Detects and redacts common personal identifiers, including names, email addresses, phone numbers, birthdates, and home addresses.
Patient Health Information (PHI) - Detects health-related information such as patient names, medical records, insurance details, and prescription data.
Financial Data - Focuses on financial records, including credit card numbers, bank account numbers, financial transaction details, etc.
Legal and Contractual Data - Detects legal and contractual terms, including case numbers, legal clauses, signatures, and confidential agreements.
Legal and Contractual Data - Detects proprietary business information, contracts and agreements, internal communications, trade secrets, intellectual property details, and sensitive corporate data.
A JSON array defining specific values for redaction. Supports three methods:
If a preset is selected (e.g., gdpr
, hipaa
), RedactionData
will work in addition to the preset’s AI-based detection.
If the manual
preset is selected, automatic preset is disabled, and only the values specified in the Custom Redaction Options
will be redacted.
[ { "Text": "john@domain.com" }, { "Detect": "Bank account number" }, { "Regex": "\\b100\\s*(€|\\$)\\b" } ]
We ensure that all document processing is handled securely in the cloud, adhering to industry-leading standards like ISO 27001, GDPR, and HIPAA. To enhance security even further, we can ensure that no files or data are stored on our servers and never leave your country.
Learn more about security