Voice-Driven Prompting with Handy
Speed up your prompting workflow by speaking commands instead of typing - fully offline and private.
⏱️ Estimated time: 5–10 minutes
Typing long, detailed prompts can slow you down - especially when you know exactly what you want to say. In this final module you’ll set up Handy, an offline speech-to-text tool, so you can speak your prompts naturally and let Copilot do the rest. This ties together everything you’ve learned: voice-prompt a skill invocation, query your MCP servers, or dictate notes to Obsidian - all hands-free.
What is Handy?
Handy is a free, open-source, offline speech-to-text tool. It uses OpenAI’s Whisper model running locally on your machine - no audio ever leaves your device.
Install Handy
Download the latest release from handy.computer or use a package manager:
macOS
brew install --cask handy
Windows
winget install cjpais.Handy
After installing, grant microphone and accessibility permissions when prompted.
First launch is slow. Handy needs to download a Whisper model on first run (100 MB–3 GB depending on model size). This only happens once. Subsequent launches are fast.
Initial Configuration
After installing Handy, take a moment to configure it for the best experience:
- Open Handy from your applications menu or system tray
- Set your hotkey: Go to Settings and assign a comfortable key combo (e.g.,
Ctrl+SpaceorCmd+Shift+Space). Pick something that won’t conflict with your editor. - Choose a model: Start with Whisper Medium for a good balance. You can upgrade later.
- Test it: Press your hotkey, say “Hello, this is a test,” and release. The text should appear wherever your cursor is.
Pro tip: Set the hotkey to something you can press with one hand while keeping the other on your mouse - this makes dictation feel natural during your workflow.
How It Works
- Press your hotkey to start recording
- Speak your prompt naturally - full sentences work best
- Release the hotkey - Handy transcribes with Whisper and pastes into the active text field
- Copilot responds to your spoken prompt
It’s that simple. No cloud, no accounts, no subscriptions.
Tips for Better Transcription
- Speak clearly but at a natural pace - you don’t need to slow down
- Use punctuation words like “period”, “comma”, or “new line” if your model supports them
- Pause briefly before and after technical terms to improve accuracy
- Spell out acronyms the first time (e.g., say “M-C-P servers” instead of “MCP servers”) until the model learns your vocabulary
Troubleshooting
| Problem | Solution |
|---|---|
| No transcription output | Check that microphone permissions are granted in system settings |
| Text appears in wrong window | Make sure the target text field has focus before pressing the hotkey |
| Poor accuracy | Switch to a larger model or reduce background noise |
| Hotkey doesn’t work | Check for conflicts with other apps; try a different key combo |
| High CPU usage | Switch to a smaller model or enable GPU acceleration |
Privacy
Handy runs entirely on your computer. No cloud processing, no data collection, no subscriptions. All audio and text stay local. It’s open source (MIT license), making it a privacy-friendly way to speak your Copilot commands.
References: Handy website · Handy GitHub repo · Download page
🎯 Try it yourself: Install Handy, configure a hotkey, and use voice to ask Copilot CLI: “List all the skills I have installed and describe what each one does.” Then try a multi-step voice prompt: “Find all markdown files in this project and create a summary of their contents.”
🎉 Workshop Complete!
Congratulations! You’ve completed the full workshop. Here’s what you’ve accomplished:
- ✅ Installed and authenticated Copilot CLI
- ✅ Connected MCP servers (WorkIQ, MSX, Power BI)
- ✅ Integrated Obsidian for note-taking
- ✅ Installed your first Agent Skill
- ✅ Learned to create custom skills
- ✅ Set up voice-driven prompting
Recommended Next Steps
- Daily driver: Start using Copilot CLI for your everyday tasks - the more you use it, the more workflows you’ll discover to automate
- Build team skills: Create custom skills for your team’s most common workflows and commit them to your repos
- Connect more data: Add MCP servers for the tools your team uses daily
- Explore the marketplace: Browse skills.sh for community-built skills
- Share feedback: Help improve the tools by reporting issues and suggesting features
What’s next? Start using Copilot CLI in your daily workflow! Create custom skills for your team’s specific needs, connect additional MCP servers, and explore the skills.sh marketplace for community-built skills.