Loading...

VoiceMode enables natural voice conversations with Claude Code, offering hands-free interaction through speech-to-text and text-to-speech capabilities, with local and remote options.
Boost this tool
Subscribe to listing upgrades or segmented pushes.
VoiceMode enables natural voice conversations with Claude Code, offering hands-free interaction through speech-to-text and text-to-speech capabilities, with local and remote options.
VoiceMode offers a balance between convenience and security. Using local voice services and carefully configuring permissions can mitigate risks, while enabling remote agent functionality and relying on cloud services introduces potential vulnerabilities.
Performance depends on the chosen voice services (local vs. cloud) and the processing power of the system. Local services may require more resources but offer lower latency.
Using cloud-based voice services (e.g., OpenAI) incurs costs based on API usage. Local services eliminate these costs but require initial setup and resource investment.
brew install ffmpeg node portaudio{
"mcpServers": {
"voice-mcp": {
"command": "uvx",
"args": ["voice-mcp"],
"env": {
"OPENAI_API_KEY": "your-openai-key"
}
}
}
}OPENAI_API_KEY/voicemode:installInstalls dependencies and local voice services required for VoiceMode.
Installs system packages and configures environment.
/voicemode:converseStarts a voice conversation with Claude Code.
Allows interaction with the agent, potentially leading to unintended actions based on voice commands.
voicemode agent startStarts the operator agent in a tmux session, enabling remote access.
Exposes the system to external connections and potential unauthorized access.
voicemode agent stopStops the operator agent.
Kills the tmux window, interrupting any ongoing processes.
voicemode agent sendSends a message to the operator agent, potentially triggering actions.
Can initiate actions based on the content of the message.
voicemode config editOpens the configuration file for VoiceMode, allowing modification of settings.
Incorrect configuration can lead to security vulnerabilities or unintended behavior.
Environment Variable
mcp__voicemode__conversemcp__voicemode__serviceVoiceMode offers a balance between convenience and security. Using local voice services and carefully configuring permissions can mitigate risks, while enabling remote agent functionality and relying on cloud services introduces potential vulnerabilities.
Autonomy depends on the configuration of the agent and the permissions granted. Exercise caution when enabling remote agent functionality.
Production Tip
Monitor resource usage and network activity when using remote agent functionality to ensure stability and security.
VoiceMode itself is open-source, but using cloud-based voice services may incur costs.
Yes, by installing and configuring local voice services like Whisper and Kokoro.
Follow the instructions in the README to start the agent in a tmux session and configure the necessary settings.
Enabling the remote agent exposes the system to external connections, so it's important to configure it securely and monitor for unauthorized access.
Check terminal/app permissions, ensure pulseaudio packages are installed on WSL2, and verify system audio settings.
Yes, by configuring the VOICEMODE_VOICE environment variable or editing the voicemode.env file.
The language support depends on the underlying voice services used. Check the documentation for Whisper and Kokoro for supported languages.