Developer Tool
CLI Tool
Extract documents from the command line. Perfect for scripts, CI/CD pipelines, and automation.
Quick Start
bash
pip install docdigitizer docdigitizer auth dd-YOUR_API_KEY docdigitizer extract invoice.pdf
Features
Single-File Extraction
One command from file to JSON.
Batch Directory
Parallel extraction with concurrent limit.
Output Formats
JSON (default), CSV, NDJSON.
Pipe-Friendly
Stream to stdout, chain with jq/grep.
CI/CD Integration
Headless, DD_API_KEY env variable.
Config File
.docdigitizer.yaml in project root.
How It Works
1
Install
pip install or download standalone binary
2
Authenticate
docdigitizer auth <key> once
3
Extract
Point at file or folder
4
Automate
Add to Makefile, GitHub Actions, or cron
bash
# Single file docdigitizer extract invoice.pdf # Batch directory docdigitizer extract ./invoices/ --output ./results/ --format json # Pipe to jq docdigitizer extract receipt.jpg | jq '.total'
✓ 500 invoices in 3 minutes