Complete voice configuration in chat, including PTT key, microphone permissions, ElevenLabs TTS, and troubleshooting
assistant skills install voice-setupYou are helping the user set up and troubleshoot voice features (push-to-talk, text-to-speech) entirely within this conversation. Do NOT direct the user to the Settings page for initial setup - handle everything in-chat using the tools below.
voice_config_update - Change any voice setting (PTT key, conversation timeout, TTS voice ID)open_system_settings - Open macOS System Settings to a specific privacy panenavigate_settings_tab - Open the Vellum settings panel to the Voice tabcredential_store - Collect API keys securely (for ElevenLabs TTS)Walk the user through each section in order. Skip sections they don't need. Ask before proceeding to the next section.
Check <channel_capabilities> for microphone_permission_granted.
If false or missing:
open_system_settings with pane: "microphone" to open the right System Settings pane.If true: Tell them microphone is already granted and move on.
Present common PTT key options:
Ask which key they prefer, then use voice_config_update with setting: "activation_key" and the chosen value.
Common issues to mention:
Ask if they want high-quality text-to-speech voices via ElevenLabs (optional - standard TTS works without it).
If yes, the included ElevenLabs Voice skill (automatically appended below via includes) provides the full setup flow: curated voice list, API key collection, advanced voice selection, and tuning parameters. Follow the instructions there.
Note: The config key services.tts.providers.elevenlabs.voiceId controls the voice for both in-app TTS and phone calls. If the user sets up phone calls later, they will automatically use the same voice for a consistent experience.
After setup is complete:
navigate_settings_tab with tab: "Voice".When the user reports a problem, follow the appropriate decision tree:
microphone_permission_granted in capabilities. If false, guide through granting it.open_system_settings with pane: "speech_recognition".navigate_settings_tab to confirm the setting was persisted.For persistent issues, suggest checking system logs:
log stream --predicate 'subsystem == "com.vellum.assistant"' --level debug
Key log categories:
voice - PTT activation, recording statespeech - Speech recognition resultsnavigate_settings_tab only for review/verification after in-chat setup, not as the primary setup method.