![]() | ![]() | ![]() | ![]() | ![]() | ![]() | ![]() | |
---|---|---|---|---|---|---|---|
OCR accuracy on open datasets | TBD | 87.8 | 77.7 | 79.7 | N.A. | N.A. | N.A. |
Languages supported | 40+ | 200 | 300 | 6 | 200+ | 276 | 150+ |
Pre-trained document extractors | invoices, receipts, POs, bills of lading, bank statements, passports, driver license | bank statements, W-2s, passports, utility bills, identity docs, payslips, driver license, expenses, invoices | bank checks, bank statements, business cards, contracts, credit cards, general documents, health insurance cards, ID docs, invoices, marriage certificates, mortgage docs, oay stubs, receipts, tax docs | invoices, receipts, and ID docs | invoices | tax invoices, profroma invoies, POs, credit notes, debit notes, delivery notes | bank statements, passports, ID cards, finance docs, salary slips |
Zero-shot learning | Moderate | Moderate/High | High | Moderate | Low | Moderate | Low/Moderate |
Confidence Scoring | Yes | Yes | Yes | Yes | Yes | Yes | Yes |
Workflow automation potential | Yes – offers a workflow builder. | No native UI workflow | Yes via Power Automate | Workflow automation is DIY using AWS services. | Yes – but it may require significant configuration | Yes – built-in | Yes – built-in |
Table Extraction | Yes | Yes | Yes | Yes | Yes | Yes | Yes |
Train with custom dataset | Yes | Yes | Yes | Yes | Yes | Yes | Yes |
data export integration options | Multiple ERP and database integrations | No major options apart from google cloud storage | No major options apart from azure offerings | No major options apart from aws offerings | No OOB capability to integrate with other integrations | Multiple ERP and database integrations | No OOB capability to integrate with other integrations |
API support | Yes | Yes | Yes | Yes | Yes | Yes | Yes |
Asynchronous processing support | Yes | Yes | Yes | Yes | Yes | Yes | Yes |
multi page file support | 3000 pages without postprocessing limits | Depends on processor | 200 pages | JPEG/PNG ⇒ 10mb, pdf,tiff= 500mb upto 3000 pages | optimal number is 100. more pages can cause errors | 40 mb | 10 |
File Types supported | PDF, JPEG, PNG, HEIC, TIFF, EXCEL, CSV, WORD, TXT, HTML | PDF, GIF, TIFF, JPEG, PNG, BMP, WebP, HTML | JPEG, PNG, BMP, HEIF, PDF, TIFF, HTML, Word, Excel, Powerpoint | JPEG, PNG, PDF, TIFF | DOC, SPREADSHEET, PPT, PDF, GIF, TIFF, JBIG2, JPEG, PNG, BMP, PCX, etc | PDF, PNG, JPEG, TIFF, XLSX, DOCX | JPEG, PNG, PDF, HEIF, HEIFSequence, HEIC, HEICSequence, AVIF, AVIFSequence, TIFF, WebP, RTF, WORD, EXCEL, ODT, ODS, etc |
On Premise Support | Yes | No | Yes | No | Yes | No | Yes |
Security and Compliance | ISO 27001, SOC2, GDPR, HIPAA | ISO 27001, ISO 27017, ISO 27018, SOC 2, SOC 3, and PCI DSS, HIPAA, FedRAMP | offers variety of compliances as mentioned here - https://learn.microsoft.com/en-us/azure/compliance/ | HIPAA, SOC, ISO, and PCI | SOC2 Type 1 | ISO 27001, SOC2, HIPAA | ISO 27001 & 9001, GDPR |
Supported document import options | UI, Email, and various integrations such as google drive, sharepoint, onedrive etc | Google console UI, Google cloud storage, API | api/sdk | can upload documents stored in s3, local storage via api/sdk | UI interface, api/sdk | UI, Email and various integrations | api/sdk |
Human in loop | Yes | Deprecated now | Yes | Yes | Yes | Yes | Yes |
STP stats | Yes | No | No | No | No | No | No |