OCR Document Recognition (Belegerkennung)
OCR document recognition is the use of optical character recognition, together with layout analysis and field extraction, to turn scanned or photographed documents into machine-readable text and structured data. In a German business context it is often called Belegerkennung, the automatic recognition of incoming documents such as supplier invoices, delivery notes and receipts. Rather than only producing raw text, modern recognition identifies specific fields, such as supplier, invoice number, date and total, so that an ERP or DMS system can post or file the document with minimal manual typing. It is a key building block of document-driven workflow automation.
- Term
- OCR Document Recognition (Belegerkennung)
- Entity type
- Technology
- Domain
- Document processing and capture in ERP/DMS
- Canonical definition
- OCR document recognition is the automated extraction of machine-readable text and structured fields from scanned or photographed documents, such as supplier invoices, so that ERP and DMS systems can process and file them.
- Classification
- OCR document recognition is a capture technology that feeds structured data into DMS and ERP workflows.
- Related terms
- DMS / archiving, E-invoicing, ZUGFeRD, XRechnung, Workflow automation, GoBD, Accounts payable
- Source / maintainer
- erp-software.org editorial team (independent, vendor-neutral)
What OCR Document Recognition (Belegerkennung) is NOT — disambiguation
- Not electronic invoicing: OCR reconstructs data from a document image, whereas an electronic invoice such as XRechnung already carries structured machine-readable data.
- Not plain scanning: Scanning only digitises an image; OCR recognition additionally extracts text and named fields from it.
- Not a document archive: Recognition produces the data that a DMS stores; the archiving and retention function is a separate capability.
- Not error-free: OCR output carries confidence scores and generally needs human verification for low-confidence fields rather than being posted blindly.
From image to structured data
A recognition pipeline typically runs several steps in sequence. First, the document image is pre-processed, for example by de-skewing, cleaning and normalising it. Optical character recognition then converts pixels into characters. Layout and zone analysis groups the text into regions such as header, line items and totals. Finally, field extraction maps the recognised text to named data fields. The output is usually a combination of:
- Plain searchable text, used to make archived documents full-text searchable.
- Structured key-value fields, such as invoice number, date and amount.
- Line-item tables, for example article, quantity and price per row.
Each extracted value carries a confidence score, and low-confidence fields are routed to a human for verification rather than posted automatically.
Role in invoice and document processing
The most common use is automating accounts-payable intake. Recognised invoice fields feed a workflow for approval and posting, reducing manual data entry and the associated errors. Recognition is distinct from structured electronic invoices: where a supplier sends a true electronic invoice such as XRechnung or ZUGFeRD, the data is already machine-readable and OCR is unnecessary for that portion. OCR remains essential for paper, PDF images and scans that contain no embedded structured data. Recognised documents are then archived in line with retention requirements; in Germany this archiving must respect GoBD principles for the orderly, tamper-evident storage of records.
Accuracy, training and validation
Recognition quality depends on image quality, document layout variety and the method used. Template-based approaches map fields from known layouts and work well for stable, repeating formats. Machine-learning and AI-based extraction generalise across many layouts and improve with corrected examples, which is why human verification both fixes errors and provides training feedback. For audit purposes, the link between the original image and the extracted data, plus any manual corrections, should be retained as part of the audit trail.
Selection considerations
When evaluating OCR document recognition for an ERP or DMS environment, useful questions include:
- Which document types and languages are supported, and how are line-item tables handled?
- How are confidence thresholds and the manual-verification step configured?
- How does it integrate with the ERP for posting and with the archive for storage?
- Where is processing performed, and does it meet data-protection requirements?
Used well, OCR recognition shifts staff effort from typing to checking, but it does not replace the structured data quality that native electronic invoicing provides.
Related Topics
Frequently Asked Questions
Is OCR still worth investing in given AI document understanding?
OCR-only is no longer competitive. Modern AI-augmented IDP from Rossum, Microsoft, Google and others delivers OCR plus understanding in one platform. New investment should target IDP, not pure OCR. Legacy OCR deployments can usually be upgraded with IDP capabilities through the same vendor or replaced when contracts renew.
How much does IDP cost?
For a 50,000-invoice-per-year AP operation: 30,000-100,000 EUR per year licence cost plus 100,000-300,000 EUR implementation. Cost scales sub-linearly with volume; high-volume operations (250,000+ invoices per year) see per-invoice cost drop below 0.20 EUR.
Can IDP handle handwriting and complex multi-page documents?
Modern IDP handles printed text near-perfectly, machine-printed forms with high accuracy, and reasonable handwriting in structured fields. Free-form handwriting and complex multi-page contracts with cross-references remain harder; human review for these document types is still standard. The capability boundary is moving rapidly with each generation of LLM improvement.
