Tutorials
Extract Invoice Pages from Bank Statements with PHP
Split combined PDF documents and extract specific page ranges using PHP and the aPDF.io Split API. Perfect for separating invoices from bank statements, extracting sections from vendor documents, or isolating specific pages.
Merge PDF GoCombine Monthly Reports into Quarterly Summaries with Go
Automate report compilation by merging multiple PDF files into quarterly or annual summaries using Go. Learn how to create cover pages, skip existing covers, and build professional combined reports automatically.
PDF Overlay PythonAdd Page Numbers and Headers to PDFs with Python
Add consistent page numbers, headers, or footers to existing PDF documents using Python and the aPDF.io Overlay API. Perfect for preparing reports for distribution without regenerating the original documents.
OCR Node.js Data EntryExtract Text from Scanned Invoices for Automated Data Entry
Automate data entry from scanned invoices and receipts using Node.js and the aPDF.io OCR Read API. Extract vendor names, invoice numbers, and totals from image-based PDFs and feed them directly into your accounting system.
OCR Search PHPSearch Inside Scanned Contracts and Documents with PHP
Find specific text within scanned PDF documents using PHP and the aPDF.io OCR Search API. Perfect for legal compliance, contract review, and searching through digitized archives where standard PDF search doesn't work.
PDF Security RubyRemove Password Protection from Archived PDFs with Ruby
Batch remove password protection from PDF documents using Ruby when you have the known password. Ideal for migrating protected archives to new systems or simplifying access to legacy documents protected with standard organizational passwords.
Metadata JavaExtract PDF Metadata for Document Management with Java
Read PDF metadata including page count, file size, author, and encryption status using Java. Build smarter document catalogs, enable advanced filtering, and detect protected files automatically for your document management system.
Compress PDF Node.jsCompress Large PDFs for Email Attachments with Node.js
Reduce PDF file sizes before sending email attachments using Node.js and the aPDF.io Compress API. Avoid bounced emails, improve deliverability, and save bandwidth with automatic compression that can shrink files by 50-90%.
PDF to Image GoGenerate PDF Thumbnails for Document Previews with Go
Build a document preview system by converting PDF pages to thumbnail images using Go. Learn how to generate fast, lightweight previews for file browsers and document management systems without heavy local dependencies.
OCR PythonConvert Scanned PDFs to Searchable Documents with Python
Transform image-based scanned PDFs into fully searchable documents using Python and the aPDF.io OCR API. Perfect for digitizing paper archives, enabling text selection, and making legacy documents ready for modern search systems.
PDF to Image PHPHow to Send "Embedded" PDF Invoice Previews in Emails (using Images)
Learn how to embed PDF invoices directly in HTML emails by converting the first page to an image. Since email clients don't support embedded PDFs, this technique uses the aPDF.io API to create visual invoice previews that increase engagement and trust with your customers.
Create PDF GoGenerate PDF Event Tickets with QR Codes in A6 Format using Go
Learn how to generate A6-sized event tickets with embedded QR codes using Go. This tutorial focuses on small-format PDFs (tickets, badges, labels) using CSS print media and the aPDF.io API. Perfect for event management systems, concerts, and conferences where scanning and compact printing are essential.
Rotate Pages PythonFix Sideways PDF Scans: Auto-Rotate Pages with Python
Learn how to automatically fix sideways or upside-down PDF pages from mobile uploads. Using Python and the aPDF.io Rotate API, you can detect and correct orientation issues in scanned documents, making them ready for review workflows.
PDF Underlay Letterhead RubyAdd Company Letterhead to Existing PDFs
Learn how to retrofit company branding onto legacy PDF reports by sliding a letterhead design behind the existing content. Using Ruby and the aPDF.io Underlay API, you can batch-process thousands of documents without regenerating them.
Delete Pages Node.jsAutomatically Deleting Sensitive Pages from PDFs
Learn how to automatically remove Terms & Conditions, internal notes, or other sensitive pages from PDFs before sharing with third parties. Using Node.js and the aPDF.io Delete Pages API, you can batch-process documents and strip confidential information in seconds.
Create PDF PHP HTMLDynamic Certificate Generation for Online Courses (LMS)
Learn how to dynamically generate professional PDF certificates for online course platforms using PHP. Design certificates with HTML/CSS, use landscape orientation, and automatically generate them when students complete courses.
PDF Password JavaScriptBatch Password Protect PDFs for GDPR Compliance
This tutorial shows how to automatically password-protect sensitive documents like payslips and medical records before emailing. Using JavaScript and the aPDF.io Security API, you can batch-process documents with user-specific passwords for GDPR compliance.
Search PDF PythonBuilding a "Search Inside PDF" Feature for Your Web App
This tutorial shows how to build a PDF search feature using Python and the aPDF.io Search API. Find specific terms, get page numbers and context lines, and optionally use regex patterns for advanced matching. Includes a Flask web interface example.
Compress PDF AWS S3 JavaReduce AWS S3 Costs by Automating PDF Compression
This tutorial shows how to cut AWS S3 storage costs by compressing PDFs before storing them. Using Java and the aPDF.io Compress API, developers can achieve up to 90% file size reduction on image-heavy PDFs, directly translating into lower storage bills.
PDF Watermark RubyAutomate PDF Watermarking: Add Logos to Documents in Bulk
This tutorial explains how to automate PDF watermarking using Ruby and the aPDF.io Overlay API. By layering a transparent PDF onto original documents, developers can bulk-process files and use parameters to target specific page ranges (e.g., skipping cover pages).
Extract Text LLM PythonExtract PDF Text for RAG Pipelines & LLMs with Python
This tutorial explains how to extract clean text from PDFs for RAG pipelines using the aPDF.io API in Python. It demonstrates replacing messy local libraries with an API that returns structured JSON, allowing developers to easily feed high-quality document data into OpenAI or vector databases.
Split PDF Node.js GoSplitting Bulk PDF Files: How to Separate Invoices & Payslips
The tutorial explains how to separate bulk PDF files into individual documents using the aPDF.io Split API with JavaScript or Go. By utilizing the pages parameter (e.g., n1 for single pages), developers can automatically "burst" large batches, such as invoices or payslips, into separate, downloadable files.
Extract Pages PHPBuilding a Document Previewer: Extract PDF First Page as a Thumbnail
Learn how to build a fast PDF previewer for Document Management Systems using PHP. Extract only the first page as a lightweight thumbnail preview without forcing users to download the entire 50MB file. Perfect for improving DMS user experience and reducing bandwidth.
Merge PDF PHP Node.jsHow to Merge Multiple PDF Reports into a Single File (Node.js & PHP)
This tutorial explains how to combine multiple PDF files into a single document using the aPDF.io Merge API with Node.js or PHP. By sending file URLs to the API, developers can avoid server memory issues and easily handle advanced tasks, such as excluding cover pages by specifying page ranges.
Create PDF Invoice PythonBuild an Automated Invoice Generator using HTML & Python
The tutorial explains how to build a Python invoice generator by using Jinja2 to merge data into standard HTML/CSS templates. Instead of manually coding complex PDF layouts, the script sends the rendered HTML to the aPDF.io API, which instantly converts it into a professional, downloadable PDF.