PDF Metadata Forensics: A Complete Field-by-Field Reference

Originally published at htpbe.tech. The version on htpbe.tech stays in sync with the latest detection algorithm — refer to it for the canonical text.

Every PDF file carries two layers of information. The first is the visible content — the text, images, and layout a reader sees. The second is metadata: structured data describing the document itself. This second layer records when the document was created, which application produced it, whether it has been modified, and by what tools.

Forensic analysis of these fields can reconstruct a document’s history without examining its visible content at all. For document fraud detection professionals, understanding each field — what it stores, what it reveals, and what makes a value suspicious — is the foundation of PDF authenticity assessment.

This reference covers every major metadata field used in PDF forensics, the structural element that cannot be cleared (the cross-reference table), and how these signals combine into an overall authenticity verdict.

Two Metadata Systems in One File

Originally published at htpbe.tech. The version on htpbe.tech stays in sync with the latest detection algorithm — refer to it for the canonical text.

Two Metadata Systems in One File

PDF Metadata Forensics: A Complete Field-by-Field Reference

PDF Metadata Forensics: A Complete Field-by-Field Reference

Other newsrooms on this story

Related reading

PDF Integrity Report: March 2026

Introducing HTPBE? Quantum: PDF Tamper Detection Before the Document Exists

The Developer’s Guide to Translating Foreign PDFs (Text, OCR, and AI Workflows)

Markdown to PDF: 8 methods compared (and why most of them disappoint)

I tried every popular library for programmatic PDF form filling. None of them…

The AI-Powered PDF Marks the End of an Era

Other newsrooms on this story

Related reading

PDF Integrity Report: March 2026

Introducing HTPBE? Quantum: PDF Tamper Detection Before the Document Exists

The Developer’s Guide to Translating Foreign PDFs (Text, OCR, and AI Workflows)

Markdown to PDF: 8 methods compared (and why most of them disappoint)

I tried every popular library for programmatic PDF form filling. None of them…

The AI-Powered PDF Marks the End of an Era