Autotend Forensics Methodology

Metadata

Author, timestamps, application, edit time, revision count — the fields written automatically every time a document is saved.

Reading the Application field in DOCX app.xml
The Application field in a docx tells you what program saved the file. It's almost always present, almost always reliable, and one of the simplest forensics signals to read. Here's what each common value means and what to make of an unexpected one.
5 min read
'Created' vs 'Modified' timestamps in a docx — what each one means
Word writes two timestamps into every docx — one for when the document was first created, one for when it was last saved. They look interchangeable. They aren't. Here's exactly how each is set, what they tell you together, and how to read them in context.
5 min read
PDF Producer field — a guide for instructors
Every PDF carries a Producer field declaring what program generated it. The value is one of the most useful single signals about how a PDF submission came together — whether it was exported from Word, generated by an LLM tool, printed to PDF, scanned, or re-saved through a converter. Here's what the common values mean.
6 min read
'Last Modified By' in a docx — what it means and what it doesn't
A docx file records who last saved it. This sounds like an answer to "who wrote this?" but the field has subtle behavior that makes it easy to misread. Here's exactly what it tracks, how it can get spoofed, and what it's actually useful for.
5 min read
The EditingDuration field in DOCX — what it actually measures
Word's "total editing time" field is one of the most-cited and most-misread numbers in document forensics. It measures something specific. It is not "how long the student worked on this." Here's what it does measure, what it doesn't, and how to read it in context.
6 min read
What your professor actually sees when they scan your essay
If your school uses a document forensics tool, your submitted file is being inspected — not just its content. Here's exactly what shows up, what each signal means, what raises flags, and how to check your own document before submitting.
8 min read
What docx metadata reveals about a document
A docx file carries dozens of hidden fields — author, edit time, application, revision history — that often tell a different story than the visible content. Here's what's in there, what each field means, and what it can't tell you.
7 min read

Edit history

Revision marks, tracked changes, hidden text, residue from accepted suggestions.

How tracked changes work as an authenticity signal
Tracked changes — the revision marks Word leaves when you edit with 'Track Changes' on — are one of the strongest positive authenticity signals in a docx. Here's how to read them, what their absence does and doesn't mean, and why they can't be retroactively faked.
7 min read

Paste detection

Large unbroken text blocks, missing typing tempo, mismatched formatting signatures.

Paste-detection windows in DOCX — what we look for in revisions.xml
Word records autosave checkpoints in revisions.xml. When a large chunk of text arrives between two checkpoints with no incremental edits in between, that's paste-shaped. This page covers exactly what the detector looks at, what it flags, and where it gets things wrong.
7 min read

AI-assisted writing signals

Patterns commonly found in AI-assisted writing — surfaced as signals, never asserted as verdicts.

Why prose-style AI detection is biased without per-student baselines
Tools that judge whether writing "looks like AI" by reading the prose itself — vocabulary, sentence structure, formality — reliably misfire on non-native English speakers, formal-register writers, and well-coached students. Here's the bias mechanism, the research, and what we do instead.
9 min read

Structural

ZIP-shape oddities, embedded objects, file-system path leaks, export-source fingerprints.

What we look at, and why.

Metadata

Reading the Application field in DOCX app.xml

'Created' vs 'Modified' timestamps in a docx — what each one means

PDF Producer field — a guide for instructors

'Last Modified By' in a docx — what it means and what it doesn't

The EditingDuration field in DOCX — what it actually measures

What your professor actually sees when they scan your essay

What docx metadata reveals about a document

Edit history

How tracked changes work as an authenticity signal

Paste detection

Paste-detection windows in DOCX — what we look for in revisions.xml

AI-assisted writing signals

Why prose-style AI detection is biased without per-student baselines

Structural

Excel formula vs cell-style consistency as a forensics signal

PowerPoint slide-master leaks — what they reveal about a deck's origin

When two students' essays share metadata fingerprints

How to scan a PDF for tampering — an instructor's guide

Why Google Docs exports look different in a forensics scan

Browse by signal category

Metadata

Edit history

Paste detection

Font & encoding

AI-assisted writing signals

Structural

Browse by file format

DOCX

PDF

PPTX

XLSX

ODT

Pages

RTF