Hold the hotkey
Anywhere on your desktop. Default is Alt+Space, change it in Settings.
SpacialVoice is the SpacialMind dictation app. Hold a hotkey, say the line, release — the transcript appears in whatever window already had focus. Editor, terminal, Slack, browser. No window switch, no copy-paste, no flow break.
Two engines under the same key. Local Whisper on-device for when the audio must not leave the machine. Hosted Whisper for 99+ languages and faster turnaround. You pick per session.
01 The loop
One global hotkey, available everywhere. The audio stream stays warm so capture begins in the first frame — no "ready?" pause.
Anywhere on your desktop. Default is Alt+Space, change it in Settings.
A persistent stream is already open, so capture starts in the first audio frame.
SpacialVoice transcribes, applies your dictionary, and types the result at the cursor.
Every transcript is logged locally with words and WPM. Recopy anything with one click.
02 Engines
Privacy when you need it, multilingual reach when you don't. The switch is one click — your hotkey, history, and dictionary stay the same.
on-device
Six ggml models, from a 75 MB Tiny to a 3.1 GB Large-v3. Runs through whisper.cpp inside the Tauri shell. No network calls, no telemetry, no audio leaving your machine.
cloud
Streamed through the SpacialMind API to a hosted Whisper Large-v3 cluster. Auto-detect across 99+ languages with sub-second end-to-end latency. Included with Pro.
03 Surfaces
SpacialVoice doesn't require integration. It uses the same OS-level keyboard path as your real keyboard, so it works in the app you already use.
VS Code, Cursor, Zed, JetBrains, Vim — any text input surface.
Inject commands, commit messages, and shell snippets straight into your shell.
Slack, Discord, Notion, Gmail, Linear — talk through the long ones.
Forms, GitHub PR descriptions, comments. If the cursor blinks, it works.
04 Dictionary
Define replacement rules for the things Whisper can't know. SpacialVoice applies them after transcription, before injection — so the typed text matches how you actually write code.
05 Built for it
One Rust crate per concern: capture, engine, hotkey, injection. Same code on all three OSes; native bindings only where required.
Local mode uses the same hyper-optimized engine ggerganov ships, wired in through whisper-rs. Hardware acceleration where the device has it.
API keys live in the OS keychain. Audio is held only in RAM until transcription finishes. History is local SQLite, on the disk you already trust.
06 Questions
A desktop dictation app for builders. Hold a hotkey, talk, and the transcript types itself into the app you're already focused on. Local Whisper for offline privacy or hosted Whisper for 99+ languages — your call per session.
In local mode, nowhere. Audio is captured through cpal, transcribed in-process by whisper.cpp, and discarded. In cloud mode, audio is sent over HTTPS to the SpacialMind API and discarded after transcription. There is no third option.
Anywhere the OS accepts keyboard input. SpacialVoice uses Win32 SendInput on Windows, Quartz events on macOS, and wtype / xdotool on Linux — the same path the keyboard already takes, so editors, terminals, chat apps, and browsers all just work.
On-device transcription is included with the Basic plan. Cloud transcription (99+ languages) is included with Pro and Ultra. Download the installer from the buttons above, sign in, pick an engine, done.
Six ggml models locally: Tiny, Base, Small, Medium, and Large-v3 (English-only and multilingual variants where applicable). You download them on demand from inside the app; nothing is bundled. Cloud uses Whisper Large-v3 with multilingual auto-detect.
Any microphone. Built-in laptop mics work; a USB headset is better. CPU inference for Small fits comfortably on modern laptops; Medium/Large benefit from a GPU. The cloud engine doesn't care about your hardware.
Local mode does — once a model is downloaded, you never need the network again. Cloud mode obviously needs it.
Windows 10+, macOS 12+, and Linux (x11 and Wayland). Built on Tauri 2 with the same Rust core on every platform. macOS builds are notarized; Windows installers are NSIS .exe.
07 Access
Local Whisper is included with Basic. Hosted multilingual Whisper unlocks at Pro.
Local dictation
$20/mo Save 20%
Get StartedAdds the cloud engine
$50/mo Save 20%
Get StartedTop-tier limits
$100/mo Save 20%
Get UltraSpacialVoice slots into the SpacialMind toolkit. Use it on its own or alongside the rest of the family.
Vibe-coding workspace where terminals, tasks, code context, and AI agents share one focused desktop room.
Model Context Protocol server that gives your AI editor the ability to create projects, manage tasks, and configure agents.
Agent-first desktop IDE with a multi-panel workspace and plan-based task execution — purpose-built for vibe coding.
Download SpacialVoice and pipe your voice into anywhere a keyboard works.