1. Lock the field map
The first pass should define the document type, required columns, source document ID, page reference, vendor or customer field, date field, total field, line-item columns, and review rules for missing or inconsistent values.
Proof hub / Document data sample
Goal: each invoice, statement, form, or order packet becomes a clean row set with source document IDs, page numbers, line items, totals, missing-field flags, and a handoff note.
The first pass should define the document type, required columns, source document ID, page reference, vendor or customer field, date field, total field, line-item columns, and review rules for missing or inconsistent values.
{
"document_id": "INV-1001",
"page": 1,
"vendor": "Northwind Supplies",
"date": "2026-05-01",
"purchase_order": "PO-7781",
"item_code": "SKU-CLN-25",
"description": "Cleaner concentrate",
"quantity": 4,
"unit_price": 250.00,
"line_total": 1000.00,
"tax": 80.50,
"document_total": 1280.50,
"review_status": "ready"
}
A narrow first milestone can be bought as a 24-hour spreadsheet cleanup and KPI report, or as a 24-hour workflow/report repair when the document rows need to feed another system.