Skip to main content
SDK

Python SDK

Full-featured Python SDK with async support, batch processing, type hints, and schema validation.

Quick Start

bash
pip install docdigitizer

Features

Async/Await Support

Full async client via AsyncDocDigitizer.

📁

Batch Processing

extract_batch() for concurrent processing.

{}

Type Hints

Fully typed, mypy/pyright compatible.

Schema Validation

Pydantic models or JSON Schema dicts.

🔄

Auto-Retry

Transient error handling with backoff.

📈

Streaming

extract_stream() for live progress updates.

How It Works

1

Install

pip install docdigitizer

2

Extract

Call extract() with file path, URL, or bytes

3

Use Results

Access typed attributes or .json() method

python
from docdigitizer import DocDigitizer

client = DocDigitizer(api_key="dd-YOUR_KEY")
result = client.extract("invoice.pdf")

print(result.vendor.name)    # "Acme Corp"
print(result.total)          # 1537.50
print(result.confidence)     # 0.99
✓ Extracted in 2.3s · 1 credit used

Ready to extract?

Get your API key in 30 seconds. First 50 extractions free.

Questions? → Talk to Us