Nanonets Agentic Data Extraction

AI document parsing and extraction.

Convert complex documents into clean, structured data for LLM-based use cases. RAG pipelines, AI agents, JSON extraction, and more.

Roche
Ryanair
Volkswagen
Schneider Electric
Procter & Gamble
Mondelez
Juniper Networks
Bayer
Topgolf Callaway
Publicis Groupe
PMI
CKE Restaurant Group
Roche
Ryanair
Volkswagen
Schneider Electric
Procter & Gamble
Mondelez
Juniper Networks
Bayer
Topgolf Callaway
Publicis Groupe
PMI
CKE Restaurant Group

Quick start

Upload a document. Get structured data back.

No templates, no per-format setup, no prompt engineering. The API returns clean markdown plus structured JSON for tables and fields.

View full API reference
Documents flowing through an extraction pipeline and routed into structured data lanes

Flexible APIs

APIs to parse, extract, chunk. One endpoint per job. Consistent schema across all of them.

Parse

Parse documents the way a person does. Layout, structure, and meaning preserved. The agentic layer reviews and corrects outputs in real-time, even on edge cases.

See API docs
invoice.md
Invoice · INV-2024-0421
Billed ToAcme Corp
DateApr 21, 2024
DueMay 21, 2024
Line Items
| SKU | Description | Qty | Rate | Amount |
| ------ | --------------- | --: | -----: | ------: |
| A-100 | Widget Pro | 2 | $120.00| $240.00 |
| B-200 | Gadget Basic | 1 | $180.00| $180.00 |
| C-305 | Premium Support | 1 | $33.60| $33.60 |
Subtotal$420.00
Tax (8%)$33.60
Total$453.60
Payment terms: Net 30 · ACH preferred · contact ap@acme.co for questions.
Nanonets OCR-3 standing on top of the IDP Leaderboard podium ahead of GPT-5.4 and Gemini 3 Pro

IDP Leaderboard

Ranked #1 overall.

Higher combined accuracy than GPT-5.4, Gemini 3 Pro, and every other VLM across OlmOCR, OmniDoc, and IDP Core.

Rank #1
85.9
Nanonets OCR-3
Rank #2
83.5
GPT-5.4
Rank #3
82.8
Gemini 3 Pro
Rank #4
82.0
Gemini 3 Flash

Available via API. Deploy in your own VPC or on-prem for strict data residency and compliance.

Capability profile

Strong across every dimension.

Text extraction, formulas, tables, layout, key information, and visual QA — one model handles every document understanding task in a single API.

Text Extraction93.2Formula87.7Tables89.4Visual QA73.0Layout88.8Key Info84.3

Real-world performance

Public benchmarks saturate. We score the documents that ship to production.

Trained and tested on the document types that hit real extraction pipelines every day — dense filings, multi-column legal text, clinical records.

94.5%
FinanceBench
Dense SEC 10-K filings averaging 143 pages with nested tables, footnotes and cross-references.
96.0%
DocBench Legal
Multi-column court filings and legislation with complex formatting, citations and structural hierarchy.
90.1%
HealthcareBench
Clinical notes, discharge summaries, lab reports, insurance EOBs, and prior authorization forms.

Enterprise-ready

Security, scale, support.

Built for your most demanding production pipelines.

Contact sales
Deploy in your environment
Run Nanonets on-prem or in your private VPC. Ideal for strict security, compliance, and data residency.
Enterprise support and SLAs
Forward-deployed support and custom SLAs tailored to your production requirements.
SOC 2, HIPAA, GDPR, ISO 27001
Enterprise-grade certifications for sensitive and regulated data. Security policies available on request.
99.9%+ uptime
Battle-tested infrastructure trusted in production at enterprise scale.

Feature-rich

Everything an agent needs. Nothing it doesn't.

Low latency, high throughput

Two-line setup, lightning-fast processing, enterprise-grade scalability.

Highest accuracy for RAG

State-of-the-art accuracy for document understanding and retrieval pipelines.

Built for document agents

Purpose-built outputs and integrations for agent frameworks and LLM tool use.

100+ languages

Native OCR and understanding across 100+ languages, including mixed-script documents.

Any input, any output

PDFs, images, Word, Excel in. JSON, Markdown, CSV, or custom shapes out.

Layout understanding

Classify document types and split multi-document files automatically.

Developers love Nanonets

Trusted by top AI teams and developers.

icymi there's two new Nanonets OCR models, Nanonets-OCR2-3B and Nanonets-OCR2-1.5B-exp. It even handles flowcharts and it's multilingual and Apache-2.0 licensed.

merve
Open Source @ HuggingFace

Just deployed Nanonets-OCR2 on nvidia RTX PRO 6000 blackwell. I cannot max it! The model is too small and the GPU is too powerful.

Maziyar Panahi
Creator of OpenMed.AI

Finally @nanonets released Nanonets-OCR2 on the hub. A 3B multilingual VLM which can handle complex document layouts, tables, math, watermarks and more.

Niels Rogge
ML Engineer @ ML6Team

After 1M+ downloads of their previous model, @nanonets is launching Nanonets-OCR2, a next-gen suite for image-to-markdown conversion.

Y Combinator
Backs Nanonets

One of the best OCR tools is @nanonets OCR. It works very well even when text and background colors are the same. I highly recommend it.

Shivam Baldha
Engineering Lead at DataCore

Nanonets played a pivotal role in AP automation at Asian Paints. Document extraction was key, and it led to faster reimbursement claims and reduced errors.

Prathmesh Khedekar
Manager at Asian Paints

Open-source contributions

Community pulse.

Open-sourcing models and research is how we contribute to the broader document AI community.

3M+
Downloads on Hugging Face
#1
Post on Hacker News
100k+
Views on technical thought pieces
4.8
Avg rating from open-source devs

See it run on your process, with your documents.

Start free. No credit card. Or talk to our team about your workflow.