ybouhjira/claude-code-tts
π Text-to-Speech MCP plugin for Claude Code - hear audio feedback while coding (OpenAI TTS)
Installation
npx claude-code-ttsAsk AI about ybouhjira/claude-code-tts
Powered by Claude Β· Grounded in docs
I know everything about ybouhjira/claude-code-tts. Ask me about installation, configuration, usage, or troubleshooting.
0/500
Reviews
Documentation
Claude Code TTS Plugin
A Text-to-Speech MCP server plugin for Claude Code that converts text to speech using OpenAI's TTS API. Get audio feedback from Claude as you work!

Features
- Deterministic Auto-Speak: Every Claude response is automatically spoken (via Stop hook)
- 6 High-Quality Voices: alloy, echo, fable, onyx, nova, shimmer
- Worker Pool Architecture: Non-blocking queue with concurrent processing
- Mutex-Protected Playback: One audio plays at a time, no overlapping
- Cross-Platform: macOS (afplay), Linux (mpv/ffplay/mpg123), Windows (PowerShell)
- Standalone CLI:
speak-textbinary for direct TTS without MCP
Quick Install
# One-liner installation
curl -fsSL https://raw.githubusercontent.com/ybouhjira/claude-code-tts/main/install.sh | bash
Or install manually:
git clone https://github.com/ybouhjira/claude-code-tts.git ~/.claude/plugins/claude-code-tts
cd ~/.claude/plugins/claude-code-tts
make install
Requirements
- Go 1.21+ (for building from source)
- OpenAI API Key with TTS access
- Audio Player:
- macOS:
afplay(built-in) - Linux:
mpv,ffplay, ormpg123 - Windows: PowerShell (built-in)
- macOS:
Configuration
Set your OpenAI API key:
export OPENAI_API_KEY="sk-..."
Or add to your shell profile (~/.zshrc or ~/.bashrc).
Architecture
βββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ
β Claude Code β
β β β
β MCP Protocol β
β β β
β ββββββββββββββββββββββββΌβββββββββββββββββββββββββββββββ β
β β TTS MCP Server (Go) β β
β β βββββββββββββββββββββββββββββββββββββββββββββββ β β
β β β Tool Handlers β β β
β β β speak(text, voice) β tts_status() β β β
β β βββββββββββββββ¬ββββββββββ΄ββββββββββββββββββββββ β β
β β β β β
β β βββββββββββββββΌββββββββββββββββββββββββββββββ β β
β β β Worker Pool (2 workers) β β β
β β β βββββββββββ βββββββββββββββββββββββ β β β
β β β β Job βββββΊβ Queue (50 slots) β β β β
β β β β Submit β ββββββββββββ¬βββββββββββ β β β
β β β βββββββββββ β β β β
β β β ββββββββββΌβββββββββ β β β
β β β β Worker 1 β 2 β β β β
β β β ββββββββββ¬βββββββββ β β β
β β βββββββββββββββββββββββββββββββββββββββββββββ β β
β β β β β
β β ββββββββββββββββββββββββββββββΌβββββββββββββββββββ β β
β β β OpenAI TTS API β β β
β β β POST /v1/audio/speech β β β
β β β Model: tts-1 β β β
β β βββββββββββββββββββββ¬βββββββββββββββββββββββββββββ β β
β β β β β
β β βββββββββββββββββββββΌβββββββββββββββββββββββββββββ β β
β β β Audio Player (Mutex Protected) β β β
β β β macOS: afplay β Linux: mpv β Win: PowerShell β β β
β β βββββββββββββββββββββββββββββββββββββββββββββββββββ β β
β ββββββββββββββββββββββββββββββββββββββββββββββββββββββββ β
βββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ
Usage
speak(text, voice)
Convert text to speech and play it aloud.
Parameters:
| Parameter | Type | Required | Description |
|---|---|---|---|
text | string | Yes | Text to speak (max 4096 chars) |
voice | string | No | Voice to use (default: alloy) |
Available Voices:
| Voice | Description |
|---|---|
alloy | Neutral, balanced |
echo | Male, warm |
fable | British accent |
onyx | Deep male |
nova | Female, friendly |
shimmer | Soft female |
Example:
Use the speak tool to say "Build completed successfully!" with the nova voice.
tts_status()
Get the current status of the TTS system.
Returns:
{
"worker_count": 2,
"queue_size": 50,
"queue_pending": 0,
"total_processed": 15,
"total_failed": 0,
"is_playing": false,
"recent_jobs": [...]
}
Automatic TTS (Deterministic)
This plugin includes a Stop hook that automatically speaks the first sentence of every Claude response. No configuration needed - it just works.
How it works:
Claude responds β Stop hook fires β First sentence extracted β Audio plays
The hook runs in the background and won't block Claude's responses.
speak-text CLI
A standalone binary for direct TTS without going through MCP:
# Basic usage
speak-text "Hello world"
# With voice selection
speak-text -voice onyx "Error occurred"
Located at ~/.claude/plugins/claude-code-tts/bin/speak-text after installation.
Project Structure
claude-code-tts/
βββ cmd/
β βββ tts-server/
β β βββ main.go # MCP server entry point
β βββ speak-text/
β βββ main.go # Standalone CLI binary
βββ hooks/
β βββ auto-speak.sh # Stop hook for deterministic TTS
βββ internal/
β βββ audio/
β β βββ player.go # Cross-platform audio playback
β βββ server/
β β βββ server.go # MCP server & tool handlers
β β βββ worker.go # Worker pool implementation
β βββ tts/
β βββ openai.go # OpenAI TTS client
βββ plugin.json # Plugin metadata + hook config
βββ Makefile # Build automation
βββ install.sh # One-liner installer
Building from Source
# Clone the repository
git clone https://github.com/ybouhjira/claude-code-tts.git
cd claude-code-tts
# Build
make build
# Install to Claude Code plugins
make install
# Run tests
make test
Troubleshooting
"OPENAI_API_KEY environment variable is required"
Set your OpenAI API key:
export OPENAI_API_KEY="sk-..."
"No suitable audio player found on Linux"
Install one of: mpv, ffplay, or mpg123:
# Ubuntu/Debian
sudo apt install mpv
# Fedora
sudo dnf install mpv
# Arch
sudo pacman -S mpv
Audio not playing on macOS
Check that afplay works:
# Test with a sample audio file
afplay /System/Library/Sounds/Ping.aiff
Queue is full
The default queue size is 50. If you're hitting this limit:
- Wait for current jobs to complete
- Check
tts_status()to see pending jobs - The queue will drain as jobs are processed
High latency
- OpenAI TTS API typically takes 1-3 seconds per request
- Audio files must download completely before playing
- Consider keeping messages short for faster feedback
API Costs
This plugin uses OpenAI's tts-1 model:
- Cost: ~$0.015 per 1,000 characters
- Example: "Hello, world!" (13 chars) = ~$0.0002
Contributing
Contributions are welcome! Please see CONTRIBUTING.md for guidelines.
License
MIT License - see LICENSE for details.
Credits
- OpenAI TTS API
- mcp-go - Go MCP implementation
- Model Context Protocol
