Deepgram Nova-3 Transcription

Transcribe audio files using Deepgram's Nova-3 model — more accurate and faster than OpenAI Whisper.

Why Deepgram over Whisper

5.26% word error rate (vs ~8-10% for Whisper)
40x faster for batch processing
Built-in speaker diarization (who said what)
Smart formatting (numbers, dates, punctuation)
$200 free credits on signup at deepgram.com

Setup

Store your API key:

echo "YOUR_DEEPGRAM_API_KEY" > ~/.openclaw/secrets/deepgram-api-key.txt

Or set the environment variable:

export DEEPGRAM_API_KEY="your-key-here"

Usage

bash scripts/transcribe.sh /path/to/audio.mp3
bash scripts/transcribe.sh recording.mp3 --out transcript.txt
bash scripts/transcribe.sh recording.mp3 --json --out full.json
bash scripts/transcribe.sh recording.mp3 --language es
bash scripts/transcribe.sh recording.mp3 --model nova-2

Models

Model	WER	Cost/min	Best for
nova-3 (default)	5.26%	$0.0077	Best accuracy
nova-2	~8%	$0.0043	Budget-friendly
whisper-large	~8-10%	$0.0048	Whisper parity

Supported Formats

mp3, wav, m4a, ogg, flac, webm, aiff

Fallback

If DEEPGRAM_API_KEY is not set, use the OpenAI Whisper skill instead (if installed).

deepgram-transcribe

Installation