PDF OCR Read API

Start building for free today – no credit card required.

Extract and read text content from scanned PDF documents with our advanced OCR API. Convert image-based pages into structured, readable text with high accuracy recognition. Perfect for data extraction, content analysis, and automating digitization workflows for documents that lack selectable text.


Benefits of our PDF OCR Read API

Effortlessly extract text from scanned documents with our comprehensive OCR Read API. Retrieve clean, structured text from image-based PDFs page by page, making it ready for analysis, indexing, or data processing. Whether you're digitizing invoices, extracting information from archived records, or automating data entry from scanned forms, our API provides accurate text recognition that transforms inaccessible images into actionable data.

Read our API documentation to learn how to extract text from scanned PDFs.

Multi-language Code Example

curl -X POST https://apdf.io/api/pdf/ocr/read \
  -H "Authorization: Bearer TOKEN" \
  -d file="FILE_URL"
const data = new FormData();
data.append('file', 'FILE_URL');

fetch('https://apdf.io/api/pdf/ocr/read', {
    headers: {'Authorization': 'Bearer TOKEN'},
    method: 'POST',
    body: data
})
.then(response => response.json())
.then(json => console.log(json));
use GuzzleHttp\Client;

$client = new Client();
$response = $client->post(
    'https://apdf.io/api/pdf/ocr/read', [
    'headers' => [
        'Authorization' => 'Bearer TOKEN'
    ],
    'form_params' => [
        'file' => 'FILE_URL'
    ]
]);

$body = $response->getBody();
echo json_encode($body->getContents());
require 'rest-client'

response = RestClient.post(
  'https://apdf.io/api/pdf/ocr/read',
  {
    'file' => 'FILE_URL'
  },
  {
    Authorization: "Bearer TOKEN"
  }
)

puts response.body
import requests

response = requests.post(
    'https://apdf.io/api/pdf/ocr/read',
    headers = {
        'Authorization': 'Bearer TOKEN'
    },
    data = {
        'file': 'FILE_URL'
    }
)

print(response.text)
import (
  "fmt"
  "github.com/go-resty/resty/v2"
)

func main() {
  client := resty.New()
  data := map[string]string{
    "file": "FILE_URL"
  }

  resp, _ := client.R().
    SetFormData(data).
    SetHeader("Authorization", "Bearer TOKEN").
    Post("https://apdf.io/api/pdf/ocr/read")

  fmt.Println(resp.String())
}
import okhttp3.*;

class Pdf {
  public static void main(String[] args) throws Exception {
    OkHttpClient client = new OkHttpClient();
    FormBody formBody = new FormBody.Builder()
      .add("file", "FILE_URL")
      .build();

    Request request = new Request.Builder()
      .url("https://apdf.io/api/pdf/ocr/read")
      .addHeader("Authorization", "Bearer TOKEN")
      .post(formBody)
      .build();

    Response response = client.newCall(request).execute();
    System.out.println(response.body().string());
  }
}

No-Code OCR Text Extraction

Integrate our PDF API with Zapier Webhooks to automate your OCR text extraction workflows effortlessly. Whether you're processing scanned invoices, extracting data from forms, or digitizing paper archives, you can trigger OCR text extraction from hundreds of apps supported by Zapier. Automatically convert image-based content into structured text data, enabling seamless data analysis and eliminating time-consuming manual transcription.


Start using our API today — no trial needed!
Get Started for Free