SkillHub

youtube-voice-summarizer-elevenlabs

v1.0.0

Transform YouTube videos into podcast-style voice summaries using ElevenLabs TTS

Sourced from ClawHub, Authored by Franciscoandsam

Installation

Please help me install the skill `youtube-voice-summarizer-elevenlabs` from SkillHub official store. npx skills add Franciscoandsam/youtube-voice-summarizer-elevenlabs

YouTube Voice Summarizer

Transform any YouTube video into a professional voice summary delivered in under 60 seconds.

What It Does

When a user sends a YouTube URL, this skill: 1. Extracts the video transcript via Supadata 2. Generates a concise AI summary via OpenRouter/Cerebras 3. Converts the summary to natural speech via ElevenLabs 4. Returns an audio file the user can listen to

Requirements

This skill requires a running backend server. Deploy the summarizer service:

git clone https://github.com/Franciscomoney/elevenlabs-moltbot.git
cd elevenlabs-moltbot
npm install
cp .env.example .env
# Add your API keys to .env
npm start

Required API Keys

Service Purpose Get Key
ElevenLabs Text-to-speech https://elevenlabs.io
Supadata YouTube transcripts https://supadata.ai
OpenRouter AI summarization https://openrouter.ai

How to Use

When user sends a YouTube URL:

Step 1: Start the voice summary job

curl -s -X POST http://127.0.0.1:3050/api/summarize 
  -H "Content-Type: application/json" 
  -d '{"url":"YOUTUBE_URL","length":"short","voice":"podcast"}'

Returns: {"jobId": "job_xxx", "status": "processing"}

Step 2: Poll for completion (wait 3-5 seconds between checks)

curl -s http://127.0.0.1:3050/api/status/JOB_ID

Keep polling until status is "completed".

Step 3: Return the audio to user

When complete, the response includes: - result.audioUrl - The MP3 audio URL (send this to the user!) - result.teaser - Short hook text about the content - result.summary - Full text summary - result.keyPoints - Array of key takeaways

Send the user: 1. The teaser text as a message 2. The audio URL so they can listen

Voice Options

Voice Style
podcast Deep male narrator (default)
news British authoritative
casual Friendly conversational
female_warm Warm female voice

Summary Lengths

Length Duration Best For
short 1-2 min Quick overview
medium 3-5 min Balanced detail
detailed 5-10 min Comprehensive

Example Flow

User: "Summarize this: https://www.youtube.com/watch?v=dQw4w9WgXcQ"

  1. Start job:
curl -s -X POST http://127.0.0.1:3050/api/summarize 
  -H "Content-Type: application/json" 
  -d '{"url":"https://www.youtube.com/watch?v=dQw4w9WgXcQ","length":"short","voice":"podcast"}'
  1. Poll status with the returned jobId
  2. When complete, send the audioUrl to the user

Text-Only Summary (No Audio)

For faster, cheaper text-only summaries:

curl -s -X POST http://127.0.0.1:3050/api/quick-summary 
  -H "Content-Type: application/json" 
  -d '{"url":"YOUTUBE_URL","length":"short"}'

Troubleshooting

"Video may not have captions" - The video needs subtitles enabled on YouTube - Auto-generated captions may take time on new videos

Audio URL not working - Ensure BASE_URL in .env is publicly accessible - Check firewall allows traffic on port 3050

Cost Per Summary

Service Cost
Supadata ~$0.001
OpenRouter ~$0.005-0.02
ElevenLabs ~$0.05-0.15
Total ~$0.06-0.17