ABBYY Releases FineReader Engine with DocLang, the AI Native Document Standard
ABBYY Releases FineReader Engine with DocLang, the AI Native Document Standard
ABBYY empowers organizations with AI-ready data, improving outcomes from advanced LLM and agentic-based automation pipelines
AUSTIN, Texas--(BUSINESS WIRE)--On the heels of the announcement by the Linux AI & Data Foundation about the new DocLang AI native document standard founded by ABBYY, IBM, HumanSignal, Nvidia, and RedHat, ABBYY today released ABBYY FineReader Engine 12.8.0 that exports to DocLang.
ABBYY FineReader Engine with DocLang support provides developers a unified, AI-readable format to represent documents for language model and agentic AI consumption, saving them time and increasing document processing performance.
Share
ABBYY FineReader Engine with DocLang support provides developers a unified, AI-readable format to represent documents for language model and agentic AI consumption, saving them time and increasing document processing performance.
FineReader Engine with DocLang Improves Document Processing Performance
ABBYY recently demonstrated FineReader Engine processing unprecedented speeds of 2,160,000 pages per hour at its ABBYY Ascend event. Additionally, in a side-by-side benchmark, ABBYY compared the processing of a PDF and DocLang document. In the controlled experiment, the same document for the same complex task using the same AI model was configured identically. The only variable was the document representation in PDF and DocLang. FineReader Engine with DocLang significantly improved output quality, increased structural accuracy, decreased token usage, and reduced latency.
See the tangible benefits of AI-native document structuring firsthand by reviewing the ABBYY interactive benchmark www.abbyy.com/ai/doclang/.
The controlled benchmark tested three types of enterprise documents: an annual report, a clinical study, and a vendor contract. These documents, designed for human interpretation yet complex for machines to process, demonstrated successful results during testing.
“ABBYY FineReader Engine is already used by thousands of organizations processing billions of documents every year,” commented Max Vermeir, VP of AI Strategy at ABBYY. “Now with DocLang as an AI native format, more companies will be able to accelerate innovation and have faster access to their business data to make smarter, more impactful decisions.”
Why the DocLang Standard is Needed
ABBYY, IBM, HumanSignal, Nvidia and Red Hat, formed the DocLang working group to revolutionize AI document parsing. Current document formats such as PDF, HTML, Markdown, and others, were designed for human consumption rather than for AI interpretation. The result is a patchwork of partial solutions requiring custom parsing at every integration point that burdens developers with building custom parsers, is prone to hallucinations, and complicates regulatory compliance.
DocLang creates a reliable abstraction layer between unstructured data and intelligent AI systems. It standardizes the cacophony of digital document formats that enterprises operate on and gives AI systems the deterministic structure they need to perform reliably at enterprise scale.
Continued Vermeir, "DocLang is specifically engineered to address industry challenges with a minimal, standardized, and AI-native method for representing document structure, meaning, layout, and governance. FineReader Engine with DocLang support was designed for efficient machine processing and a predictable structure optimized for modern AI tokenization and modeling techniques. Organizations will see a significant difference with more reliable interpretation, increased accuracy, and lower computational costs."
More information about the DocLang working group can be found at www.abbyy.com/ai/doclang.
More information about ABBYY FineReader Engine can be found at https://www.abbyy.com/ocr-sdk/.
More information about the FineReader Engine 12.8 release with DocLang can be found at https://www.abbyy.com/blog/finereader-engine-12-r8-release/
To request a demo, visit abbyy.com.
About ABBYY
ABBYY helps organizations optimize processes, accelerate decisions, and drive better outcomes with purpose-built Document AI. More than 10,000 enterprises, including many Fortune 500 companies, rely on ABBYY’s 35 years of innovation to turn business data into actionable insights that improve the way we work and live. Headquartered in Austin, Texas, and offices in 13 countries, ABBYY leads the way for smarter agentic automation. For more information, visit www.abbyy.com/company and follow us on LinkedIn, X, Facebook, Instagram and TikTok. ABBYY can either be a registered trademark or a trademark and can also be a logo, a company name (or part of it), or part of a product name of ABBYY group companies and may not be used without consent of its respective owners.
Contacts
Editorial contact:
Gina Ray
gina.ray@abbyy.com
