Skip to main content
Developer Tool

CLI Tool

Extract documents from the command line. Perfect for scripts, CI/CD pipelines, and automation.

Quick Start

bash
pip install docdigitizer
docdigitizer auth dd-YOUR_API_KEY
docdigitizer extract invoice.pdf

Features

📄

Single-File Extraction

One command from file to JSON.

📁

Batch Directory

Parallel extraction with concurrent limit.

{}

Output Formats

JSON (default), CSV, NDJSON.

|

Pipe-Friendly

Stream to stdout, chain with jq/grep.

CI/CD Integration

Headless, DD_API_KEY env variable.

📅

Config File

.docdigitizer.yaml in project root.

How It Works

1

Install

pip install or download standalone binary

2

Authenticate

docdigitizer auth <key> once

3

Extract

Point at file or folder

4

Automate

Add to Makefile, GitHub Actions, or cron

bash
# Single file
docdigitizer extract invoice.pdf

# Batch directory
docdigitizer extract ./invoices/ --output ./results/ --format json

# Pipe to jq
docdigitizer extract receipt.jpg | jq '.total'
✓ 500 invoices in 3 minutes

Ready to extract?

Get your API key in 30 seconds. First 50 extractions free.

Questions? → Talk to Us