📦

Wispr Action

Turn your perfectly transcribed speech into real-world actions

0 installs

Trust: 34 — Low

Other

Ask AI about Wispr Action

I know everything about Wispr Action. Ask me about installation, configuration, usage, or troubleshooting.

0/500

Loading tools...

Reviews

Documentation

🎙️ Wispr Action

Transform your voice dictations into automated actions. Wispr Action monitors your Wispr Flow transcriptions and uses Claude AI to intelligently route commands and execute automated tasks.

Stay updated on AI

Features

Intelligent Command Routing - Claude AI automatically matches voice commands to configured actions
Flexible Actions - Execute local scripts or trigger HTTP requests
Web Dashboard - Intuitive UI for managing commands and testing
Real-time Monitoring - Live status and execution logs
System Tray App - Easy start/stop controls from your menu bar
Parameter Extraction - AI-powered parameter parsing from natural language

Quick Start

Wispr Action requires Wispr Flow and an Anthropic Claude API key.

Installation

Clone this repository and navigate to the directory:

git clone <repository-url>
cd wispr-action

Create a .env file from the provided template:

cp env.example .env

Edit .env and add your Anthropic API key:

ANTHROPIC_API_KEY=your_api_key_here

Run the quickstart script:

./quickstart.sh

The quickstart script will automatically set up your virtual environment, install dependencies, validate your configuration, and launch the application.

Once started, the web dashboard will open at http://localhost:9000, the system tray app will appear in your menu bar, and the monitor will begin watching for voice commands.

Usage

Getting Started with the Default Command

When you first set up Wispr Action, a default welcome command is automatically created to help you test the system. You can trigger it by saying:

"Command, run default command"
"Command, run the default command"
"Command, execute default command"

This will make your computer say "Welcome to Wispr Action! Go ahead and create your first command." using AppleScript. It's a simple way to verify everything is working before creating your own commands.

Managing Commands

Commands are managed through the web dashboard interface. Creating new commands is straightforward through the UI, where you can configure command names, descriptions, parameters, and actions (scripts or HTTP requests). All commands are automatically saved to commands.json, which Wispr Action uses to route and execute your voice commands.

Parameters support several input types (string, number, email, url, boolean) plus an options enum type. When you choose options, provide the allowable values (strings or integers) and the parser will only ever pick from that list—perfect for things like repo names or environment choices.

MCP Clients (Stripe + Linear)

Wispr Action now understands Model Context Protocol servers. This lets you connect tools such as Stripe and Linear, securely store their API keys, and expose each MCP tool as a voice command.

Open the MCP Clients card in the dashboard and click Add MCP Client.
Choose a transport:
- HTTP for remote MCP servers that expose the Streamable HTTP transport (Stripe’s hosted MCP uses https://mcp.stripe.com).
- SSE for legacy MCP servers that still expose /.well-known/mcp over Server-Sent Events (e.g., older Linear deployments).
- StdIO if you run a local MCP server process (command, optional args, working directory, env variables).
Define required secrets (e.g., api_token). Actual secret values are saved to your macOS Keychain via the keyring library—mcp_servers.json only tracks metadata.
Click Save Client, then Save Secret Values, and finally Test Connection. A successful test lists the number of tools exposed by the MCP server.
Every enabled MCP tool is immediately available as a “virtual command” (you can speak to them without creating anything else). When you want more control, open the command editor, switch the Action to MCP Tool, pick the client/tool, and use Import Tool Parameters to populate the parameter table from the tool’s JSON schema.

Example Configurations

Vendor	Transport	Headers / Env	Secret Fields
Linear	SSE (`https://mcp.linear.app/sse`)	`Authorization: Bearer {{api_token}}`	`api_token` (Linear personal API key)
Stripe	HTTP (`https://mcp.stripe.com`)	`Authorization: Bearer {{stripe_api_key}}` (or OAuth once supported)	`stripe_api_key`

Tip: use templating ({{secret_name}}) anywhere a secret should be injected—headers for SSE servers or env vars for StdIO servers.

Example Voice Prompts

“Command, create a Linear ticket for Sahar to finish the mobile table view.”
“Command, open a Stripe test charge for 25 dollars to Sahar’s card.”

Both prompts will route through the MCP tool definitions you expose. When you need custom defaults (e.g., hard-coded team IDs), create a saved MCP command, set the defaults in the command editor, and they’ll be available in voice prompts.

Example Commands

Script Command with Optional Parameters

Name: Process Emails
Description: Run email processing script for a given inbox
Parameters:
  - email (string, required): The inbox address
  - days (number, optional): Days back to process
Action: Run Script
  Path: ~/scripts/process_emails.py
  Args: --email={email} [--days={days}]

Usage:

"Command, run email processor for sahar@gmail.com" (uses default days)
"Command, process emails for sahar@gmail.com from last 30 days" (specifies days)

Note: Wrap optional parameters in square brackets [] in your template. See OPTIONAL_PARAMETERS.md for details.

Script Command with Virtual Environment

For scripts that require a specific Python environment and environment variables:

Name: Refresh Submitted Events
Description: Refresh events using project's virtualenv and .env file
Parameters:
  - calendar_id (number, required): The calendar ID
Action: Run Script
  Path: ~/luma-automator/backend/scripts/refresh_submitted_events.py
  Args: {calendar_id}
  Python Interpreter: ~/luma-automator/backend/venv/bin/python
  Environment File: ~/luma-automator/backend/.env
  Working Directory: ~/luma-automator/backend

Usage: "Command, refresh submitted events for calendar 2"

Note: The optional fields allow you to:

Python Interpreter: Use a specific Python from a virtualenv
Environment File: Load environment variables from a .env file
Working Directory: Set the directory to run the script from

Tip: Use the "Import from launch.json" button to automatically fill in these fields from your VS Code launch.json configuration.

HTTP Command

Name: Play Music
Description: Play music via Spotify API
Parameters:
  - query (string, required): Song or artist name
Action: Call HTTP
  Method: POST
  URL: http://localhost:3000/api/music/play
  Body: {"query": "{query}"}

Usage: "Command, play bohemian rhapsody"

Starting the Monitor

Click "Start Monitor" in the dashboard or system tray
Speak commands with the activation word (default: "command")
The system will automatically parse and execute matching commands

Testing Commands

Use the Test Panel in the dashboard:

Enter a test phrase
Click "Parse Command" to see how it's interpreted
Review extracted parameters
Click "Execute" to run the command

Architecture

Wispr Action operates as an intelligent automation bridge between voice input and action execution:

The monitor component continuously polls the Wispr Flow database for new transcriptions (default: every 1-2 seconds)
When a transcript contains the activation word, the full text is sent to Claude AI along with all enabled commands as available "tools"
Claude analyzes the natural language input, determines which command matches best, and extracts any required parameters
The executor component runs the matched command with the extracted parameters, either executing a local script or making an HTTP request
Results are logged and displayed in real-time through the web dashboard and system logs

Configuration

All configuration is managed through environment variables in the .env file:

# Required
ANTHROPIC_API_KEY=your_key_here

# Optional (with defaults shown)
WISPR_DB_PATH="~/Library/Application Support/Wispr Flow/flow.sqlite"
ACTIVATION_WORD=command
POLL_INTERVAL=1.5
WEB_PORT=9000
CONFIRM_MODE=false
LLM_MODEL=claude-haiku-4-5

Configuration Options:

ANTHROPIC_API_KEY - Your Anthropic API key (required)
WISPR_DB_PATH - Path to Wispr Flow database (quote paths with spaces)
ACTIVATION_WORD - Word that triggers command processing
POLL_INTERVAL - Database polling frequency in seconds
WEB_PORT - Port for the web dashboard
CONFIRM_MODE - Enable confirmation prompts before execution (useful for testing)
LLM_MODEL - Claude model to use (supports all Anthropic models)

License

MIT License - see LICENSE file for details