Document Workbench
Prototype prompts, schemas, OCR settings, source locations, tracing, and map-reduce in one authenticated workspace.
AI DOCUMENT OPS
Production document processing for teams that need data, not PDFs.
Extract structured JSON from documents, prove where every field came from, and automate the whole intake workflow with SDKs, jobs, batches, agents, and the new visual integration builder.
CONTROL PLANE
DocuDevs combines a developer API with an operations UI, so teams can prototype, inspect, automate, and govern document processing without rebuilding the same plumbing.
Prototype prompts, schemas, OCR settings, source locations, tracing, and map-reduce in one authenticated workspace.
Build visual flows for document intake, OCR, extraction, routing, and exports, then keep the generated YAML for review.
Run embedded integrations near private files while the cloud control plane manages configs, status, and orchestration.
Process folders of documents, track progress, and download normalized JSON, CSV, or Excel outputs when the run completes.
Describe the outcome you need and let the agent help draft extraction instructions, schema fields, and processing plans.
Use DocuDevs managed OCR and LLMs, or connect your own Azure Document Intelligence, OpenAI, and Azure OpenAI providers.
NEW: INTEGRATION BUILDER
The builder turns common integration work into a visual canvas: watch a folder, read documents, extract with a saved configuration, branch on values, and ship results to the next system.
tasks: - watch: /invoices - ocr: layout - extract: invoice_v3 - post: ERP_WEBHOOK_URL
DEVELOPER SURFACE
The same extraction engine powers API requests, UI jobs, saved configurations, batch runs, and edge flows.
Typed extraction
from docudevs import DocuDevsClient
from pydantic import BaseModel
client = DocuDevsClient(token="your-api-key")
class Shipment(BaseModel):
container_id: str
port_of_loading: str
gross_weight_kg: float
result = client.extract(
"bill-of-lading.pdf",
schema=Shipment,
source_locations=True
)Validated object
Shipment( container_id="MSCU4829910", port_of_loading="Helsinki", gross_weight_kg=21740.0 )
DOCUMENT PIPELINE
Every document run is observable: queued, processed, traced, stored, and ready for the next system.
Start from a prompt, JSON schema, saved configuration, template, or agent drafted extraction plan.
OCR converts layout, tables, figures, spreadsheets, and long reports into model-ready context.
Structured output is validated, enriched with source evidence, and tracked as an async job.
Return JSON to your API, export CSV or Excel, fill templates, or push through an integration flow.
USE CASES
Start with a single extraction and grow into cases, templates, quality checks, batch exports, and integration flows.
START FAST
READY
Start with the full platform, a token pack, and the same API surface you can keep in production.