Document APIs for agents and developers

Use the world's best models to transform documents into data. Built for AI agents and modern applications.

Get Started for free

Book a demo

Trusted by the world's leading enterprises

Community

The world's best open source models

Don't take our word for it, here's what the community has to say

1M+

DOWNLOADS ON HUGGINFACE

#1

POST ON HACKER NEWS

X

THIS GUY AT HUGGINGFACE SAID WE'RE COOL

Skip the build. Deploy AI Agents

Let our experts deploy a fully managed AI workforce that understands your documents and syncs your data natively to ERPs like SAP, Oracle, and Salesforce.

Try now Talk to an Expert

OUR APIs

From document to decision, automated

See what we offer for each of your document usecases

Document to JSON

Extract structured data from documents into JSON format. Perfect for feeding data into databases, APIs, and AI applications.

Custom schemas
Nested objects
Type validation

Learn more

Document to Markdown

Convert any document into clean, structured Markdown with perfect formatting preservation. Ideal for content pipelines and documentation workflows.

Preserves formatting
Table extraction
Header hierarchy

Learn more

Classify and Split

Automatically classify document types and intelligently split multi-document files. Streamline your document processing pipeline.

Auto-classification
Smart splitting
Batch processing

Learn more

FEATURES YOUR TEAM WILL LOVE

Everything you need for production-grade document processing

Layout Understanding

Automatically classify document types and intelligently split multi-document files. Streamline your document processing pipeline.

LLM Ready

Optimized outputs for large language models with structured data formats.

Multi-Language Support

Process documents in 100+ languages with native OCR and understanding capabilities.

All Input & Output Types

Support for PDF, images, Word, Excel, and more. Output to JSON, Markdown, CSV, or custom formats.

See what teams are saying about us

Trusted by Developers

icymi there's a two new Nanonets OCR models, Nanonets-OCR2-3B and Nanonets-OCR2-1.5B-exp

this model can handle forms (checkboxes), recognize watermarks, describes images, charts in docs and more!

it even handles flowcharts it's multilingual and Apache-2.0 licensed

merve

Open Source @ HuggingFace

just deployed Nanonets-OCR2 on nvidia RTX PRO 6000 blackwelli cannot max it!

the model is too small and the GPU is too powerful!

Maziyar PANAHI

Creator of @OpenMed_AI

One of the best OCR tools is @nanonets OCR.It works very well even when text and background colors are the same.I highly recommend it; it's much better than others.

Shivam Baldha

Engineering Lead at DataCore

Finally @nanonets (an SF-based document AI startup) released Nanonets-OCR2 on the hub.

A 3B multilingual VLM which can handle a huge range of things from checkboxes to LaTeX and complex tables

It beats Gemini 2.5 Flash and GPT-5 on their internal benchmarks

Niels Rogge

ML Engineer @ ML6Team

After 1M+ downloads of their previous model, @nanonets is launching Nanonets-OCR2, a next-gen suite for image-to-markdown conversion & visual QA.

From LaTeX math & tables → mermaid charts, signatures → watermarks. It handles it all.

Y Combinator

Only 2 models (GPT-5 and nanonets-ocr2-3B) pass the phishing test, again highlighting how you cannot blindly trust VLMs.

VLMs dont read text, they predict it.Attached individual results for all models tested.

Rann

gdpr, soc2, hipaa compliant

Data security is our top priority

Nanonets prioritises the confidentiality and integrity of your data. As a testament to our commitment, we adhere to stringent compliance standards, including GDPR, SOC 2, and HIPAA. Privacy Policy