AlgoscaleDocumentIQ
AI-Powered Document Intelligence

Turn Documents into
Structured Data

Upload PDFs, define what to extract, and let AI do the rest. Single-pass extraction with GPT-5, Claude, and more — handles tables, line items, and complex layouts out of the box.

Start for Free

Enterprise: private fine-tuning on your data · email inbox intake · Talk to sales →

Contract ManagementInvoice ProcessingLegal Document ReviewInsurance ClaimsReal Estate LeasesFinancial Reports
1 Call
Per document extraction
PDF + DOC
File formats supported
Real-time
Progress tracking
Multi-LLM
OpenAI, Anthropic & more

Everything you need for
document intelligence

From extraction to chat to export — a complete platform for turning unstructured documents into actionable, structured data.

Pre-Built Field Templates

Start projects in seconds with 10 ready-made templates: invoices, contracts, BOMs, COAs, bills of lading, KYC, FNOL, and more. Save your own field sets as reusable templates per organization.

Single-Pass AI Extraction

Send entire PDFs directly to GPT-5, Claude, and more. All fields extracted in one pass — no text pre-processing needed. Handles scanned documents, tables, and complex layouts.

Multi-Row Line Item Extraction

Automatically detects and extracts repeating rows like invoice line items, order details, or contract clauses. No row limits — scales to hundreds of items per document.

Smart Annotations

Draw bounding boxes on PDFs to teach the AI exactly where to look. Set raw text and correct extraction values as few-shot examples. Enable or disable annotations per extraction run.

Multi-Model Support

Choose from OpenAI, Anthropic, and custom fine-tuned models per job. Admin assigns models per billing plan with independent credit rates. Compare outputs across providers.

Feedback & Re-Extraction

Mark extractions as correct, incorrect, or corrected. Re-process selected documents with corrections injected as ground truth. Continuously improve accuracy.

How it works

Get from raw documents to structured data in five simple steps.

01

Upload Documents

Drag and drop PDFs, DOC, or DOCX files. Documents are stored securely on Azure and ready for processing instantly.

02

Pick a Template or Configure Fields

Start from 10 pre-built templates (invoices, contracts, BOMs, COAs, BOLs, KYC, FNOL, and more) — or define your own fields. AI auto-suggests extraction instructions.

03

Annotate (optional)

Draw bounding boxes on a representative PDF to teach the AI which regions map to which fields. Becomes few-shot ground-truth for every similar document thereafter.

04

Extract & Review

Choose your LLM, hit extract, and watch results stream in real time. Review in a rich table, provide feedback, and re-extract to refine.

05

Chat & Export

Ask questions across all your documents and extracted data. Export results to CSV, Excel, or PDF. Build on top with the full API.

Field Templates

Start in seconds,
not hours

Skip the blank-canvas problem. DocumentIQ ships with 10 pre-built field templates covering the most common document workflows — each with field definitions, AI extraction prompts, and project context already configured. Apply one with a click, then customize as needed.

  • Invoices, POs, contracts, BOMs, COAs, BOLs, packing declarations, KYC, FNOL, and more
  • Multi-row line item structures encoded for tables and repeating data
  • Domain-specific extraction prompts (e.g., escalation clause types, ISO 6346 container numbers, CAS numbers)
  • Save your own custom fields as reusable templates for your team
Custom templates available on every plan
DocumentIQ template picker — 8 pre-built field schemas including Commercial Invoice, Bill of Lading, Mill Test Cert, Customs Declaration
DocumentIQ field manager — configure name, type, description, repeating flag, and the LLM extraction prompt per field
Configure once

Define exactly what
you want extracted

Every project has its own schema. Define each field's name, type, description, and a custom extraction prompt — and DocumentIQ writes the LLM instructions for you with a single click. Mark fields as repeating to capture multi-row tables with parent-child structure intact.

  • Field types: text, number, date, boolean, list
  • AI-suggested extraction prompts you can edit or rewrite
  • "Repeating (line item)" toggle for invoices, BOMs, customs manifests
  • Reorder, duplicate, or save fields as a reusable template
See results

Structured data,
with confidence scores

Every extracted value carries a confidence score and a link back to the source page and bounding box. Review thousands of rows in a sortable table, flag bad extractions inline, and export to CSV or Excel — or pull the data via API into your ERP, CRM, or BI tool.

  • Per-field confidence bars surface ambiguous extractions for review
  • Inline thumbs-up / thumbs-down feedback feeds the next extraction run
  • Summary cards: documents processed, fields extracted, credits used
  • Export to CSV, Excel, or pull via REST API
DocumentIQ extraction results — summary cards plus a table of extracted rows with confidence indicators and feedback buttons
DocumentIQ annotation interface — draw bounding boxes on a PDF to teach the AI which regions of a document map to which extraction fields
Teach the AI

Annotate once,
extract accurately forever

Draw a box on a representative PDF for each field — single value or multi-row table. The annotations become few-shot examples that dramatically improve accuracy across every similar document from that vendor, carrier, or format thereafter.

  • Auto-detect raw text from PDF regions using PyMuPDF
  • Single-value boxes for headers; multi-row boxes for line-item tables
  • Annotations become ground-truth examples in the LLM prompt
  • Toggle annotations per extraction run for A/B testing
U
What's the average total across all invoices from Q1?
Based on the extracted data from 47 invoices in Q1, the average total is $3,842.50. The highest was $12,400 from vendor_invoice_031.pdf and the lowest was $280 from misc_receipt_012.pdf.
DocumentsExtracted DataBoth
Project Chat

Ask questions across
all your documents

Three context modes let you query raw documents, extracted data tables, or both. The AI cites specific documents, renders markdown tables, and you can export entire conversations as PDF.

  • RAG-powered with vector search across document chunks
  • SQL generation for analytical queries over extracted data
  • Streaming responses with markdown rendering
  • Export conversations as PDF, Markdown, or JSON
Enterprise Ready

Built for teams and organizations

Multi-tenant architecture with role-based access, custom billing, and white-label branding. Deploy on Azure or on-premise.

RBAC & Multi-Org

Owner, admin, member, viewer, and billing roles per organization

White-Label

Custom logo, brand colors, and company name for each organization

Custom Invoicing

Enterprise billing with offline invoicing, no credit card required

Fine-Tuned Models

Bring your own fine-tuned models via LiteLLM for domain-specific accuracy

What teams are saying

“We went from manually reviewing 1,200 contracts over three months to having every escalation clause extracted and queryable in under 48 hours. The sell-side recovery alone paid for the platform 10x over.”

VP of Procurement

Global Mobility & Professional Services

“Our AP team was drowning in invoices from 200+ vendors in completely different formats. DocumentIQ handles all of them with a single field configuration. We eliminated manual data entry entirely.”

Procurement Director

Manufacturing

“The chat feature changed how we work with extracted data. Instead of building reports, dispatchers just ask "which shipments to Dallas are overweight?" and get an instant answer with cited documents.”

Operations Manager

Freight & Logistics

Enterprise capabilities

Built for teams that need more

Two capabilities reserved for our Enterprise tier — designed for the volume, privacy, and integration needs of larger operations.

Enterprise

Your AI gets smarter with every correction — privately

We fine-tune a model on the corrections your team makes. Your data never leaves your tenant. The resulting model is yours alone — competitors don't benefit from your accuracy gains.

  • Per-organization model trained on your feedback_entries
  • Accuracy compounds with usage — flat for every competitor
  • Strict tenant isolation; auditable training data lineage
Talk to sales about private fine-tuning
Enterprise

Forward an invoice. See it extracted in 30 seconds.

Point a shared mailbox at DocumentIQ. Attachments auto-ingest, get processed, and land in your queue — or push straight to your ERP. No upload step, no behaviour change for the people already in the email loop.

  • Dedicated inbound address per project (or forward from your own)
  • Auto-extracts PDFs, DOCs, and inline body text
  • Webhook + ERP push so extracted data lands where you need it
Talk to sales about email intake
Book a 30-minute Enterprise scoping call

Custom SLAs, dedicated support, and other Enterprise capabilities also covered.

Simple, transparent pricing

Start free, upgrade as you grow. No hidden fees.

Free

Get started with basic extraction

$0/mo
  • 2 projects, 50 documents
  • 10 fields per project
  • Basic extraction (GPT-4o Mini)
  • CSV export
  • Community support
Get Started
Most Popular

Pro

For teams processing at scale

$49/mo
  • Unlimited projects & documents
  • All LLM models (GPT-5, Claude, etc.)
  • Project chat assistant (3 modes)
  • PDF annotations & feedback loops
  • Document summaries & analytics
  • Excel, CSV & PDF export
  • Multi-row line item extraction
  • Priority email support
Get Started

Enterprise

For large organizations with advanced needs

Custom
  • Everything in Pro
  • Custom pricing & invoicing
  • Dedicated account manager
  • White-label branding
  • Fine-tuned model support
  • SSO / SAML integration
  • On-premise deployment option
Contact Sales

Frequently asked questions

Everything you need to know about DocumentIQ.

Do I have to define every field from scratch?
No. DocumentIQ ships with 10 pre-built field templates for common document types — commercial invoices, purchase orders, contracts, price escalation clauses, bills of lading, packing declarations, bills of materials, certificates of analysis, FNOL insurance claims, and KYC documents. Apply one with a click and your project is ready to extract immediately. You can also save your own field configurations as custom templates for your organization.
How is DocumentIQ different from traditional OCR?
Traditional OCR converts images to text but doesn't understand what the text means. DocumentIQ sends entire documents to LLMs that understand context, tables, and relationships — extracting specific fields like invoice totals, contract clauses, or line items. It handles varied layouts, scanned documents, and complex tables without custom templates.
What file formats are supported?
DocumentIQ supports PDF, DOC, and DOCX files. PDFs are sent directly to the LLM (including scanned/image PDFs when using models with vision capabilities). Word documents are parsed and text-extracted automatically.
Which AI models can I use?
DocumentIQ supports multiple LLM providers through LiteLLM, including OpenAI (GPT-5, GPT-4o), Anthropic (Claude), and custom fine-tuned models. Your admin assigns which models are available on your billing plan, each with independent credit rates.
What are annotations and how do they improve accuracy?
Annotations let you draw bounding boxes on a PDF to show the AI where a value appears. You provide both the raw text (what's in the document) and the correct extraction (what the AI should return). These become few-shot examples that dramatically improve accuracy — like teaching a new employee where to look.
Can DocumentIQ extract line items and tables?
Yes. Multi-row extraction automatically detects repeating rows like invoice line items, BOM entries, or contract clauses. There are no row limits — it scales to hundreds of items per document.
How does the chat assistant work?
Each project has an AI chat with three context modes: Documents (searches raw text via RAG), Extracted Data (queries structured results), or Both. The assistant cites specific documents, renders markdown, and you can export conversations as PDF.
Is DocumentIQ suitable for enterprise use?
Yes. It offers multi-tenant architecture with RBAC, multi-org support, white-label branding, custom offline invoicing, fine-tuned model support, and is deployed on Azure with enterprise-grade security.
How much does it cost?
Free ($0/month, 2 projects, 50 docs), Pro ($49/month, unlimited everything, all models, chat, annotations), and Enterprise (custom pricing with white-label, SSO, and dedicated support). Credits are consumed per extraction based on the model used.

Ready to extract smarter?

Join teams using DocumentIQ to turn unstructured documents into actionable, structured data — in minutes, not hours.