Speak English, send Spanish: dictation that's translation-native

use-case

Every dictation tool on the market can turn your Spanish into Spanish text. Point one at a German speaker and you get German on the screen. That part is solved. The thing almost nobody does is the move you actually need when you're writing across a language barrier: speak in the language you think in, and have finished text show up in the language your reader speaks — as one motion, not two.

That gap is small to describe and exhausting to live inside. If your client reads only Spanish and you think in English, every message is a detour. You draft in your head, you reach for a translator, you paste, you copy the result back, you fix whatever the formatting mangled on the way through. Multiply that by a day of replies and you start writing less than you mean to. The barrier isn't language. It's the friction stacked on top of it.

davr closes that gap by treating translation as part of dictating, not as a separate errand. You pick the output language once, you talk in yours, and the text that drops into your email, your chat, or your CRM is already in theirs. Speak English, send Spanish. No second tab, no paste shuffle, no app-switch.

The detour nobody budgets for

Picture the actual sequence. You're answering a supplier in Mexico City, and your Spanish is good enough to read but slow to write under pressure. So you write the reply in English first, because that's where you're fast. Then you open a translator in another window. You paste your draft. You wait. You copy the Spanish back. You switch to your email, paste again, and now the line breaks are wrong and a stray quotation mark turned into something weird, so you clean that up too. Then the supplier replies, and you do the whole loop again.

Each round trip costs maybe ninety seconds. That sounds trivial until you count the round trips. A customer-support agent handling bilingual tickets, a project manager coordinating an offshore team, a nurse messaging a family who speaks Tagalog — these people don't make the detour once. They make it dozens of times a day, and the cost isn't only minutes.

The quieter cost is that you flatten yourself. When writing in a second language is slow, you write shorter. You drop the warmth. The email that would have had a friendly opener in your own language becomes a terse block of instructions, because the friction taxes every extra word. The reader on the other end doesn't experience your competence or your kindness — they experience the residue of your tooling. Across a working relationship, that adds up to something real.

And the standard fixes don't actually fix it. General-purpose voice translators — Google Translate's conversation mode, the various Spanish-English voice apps — are genuinely good at turning speech in one language into text in another. (Air Apps roundup) But they live in their own window. They don't type into the app you're actually working in. You still have to ferry the result across to your email or your ticketing tool by hand. The translation got easier; the detour stayed.

What "translation-native" actually means

Here's the distinction that matters, and it's easy to miss because the marketing language blurs it.

Most dictation tools are built to transcribe within the language you speak. You talk in Spanish, you get Spanish. You talk in Japanese, you get Japanese. That's the design goal, and the good ones are excellent at it. Wispr Flow, for instance, supports a large set of languages (Wispr language research) — but its guidance to multilingual users is to narrow the set you have active, because the tool is optimizing for faithfully transcribing the one language you're speaking, not for converting it into a different one. (Wispr multi-language docs) If you want output in another language, the path is a separate step after the fact: highlight the text you already dictated and issue a "translate to French" command on it. (Wispr features) That works, and it's a reasonable design. But notice the shape of it — you dictate, then you translate. Two moves. The translation is a thing you do to text that already exists.

Translation-native dictation inverts that. The output language is set before you speak, as a property of how the words come out. You never produce the intermediate same-language draft at all. You think in English, you talk in English, and Spanish is what reaches the page — because converting to the target language is part of the same dictate-to-output flow that injects ordinary text into your apps. There's no highlight step, because there's nothing to highlight; the first text that appears is already the text you wanted to send.

That's what davr does. Dictate in one language, output in another — Spanish, French, Japanese, and across roughly 40 languages, every one of them valid as the target you output into. The translated text is injected into whatever app is focused, exactly the way plain dictation is. Same hotkey, same motion, different language out the other side.

How you turn it on

The piece that makes this practical is that it isn't a separate app or a tool you alt-tab into — it's a setting. In davr, you turn translation on in the Audio section of settings and pick your output language from the dropdown (there's a tray shortcut for switching languages when you do it often). Once it's set, talking is talking. You press your hotkey, you speak English, and the Spanish appears where your cursor already was.

It's worth seeing this as one instance of a broader pattern rather than a single trick. davr's whole design puts an instruction between your voice and the text that comes out — and translating to another language is just one of those instructions. The same hotkey-to-text machinery that outputs another language can also answer a question or reshape your tone, which is the ask-Claude-from-a-hotkey idea applied to language. Translation isn't a bolt-on; it's one setting of a general dial.

There's a companion move for text you already have. If a paragraph is already sitting in your editor — something you wrote earlier, or pasted, or received — you can highlight it and change its language by voice using Transform Text. So the two cases are covered from both ends: dictate fresh and have it come out translated, or take existing text and convert it in place. Either way you stay in the document you're working in.

The honest framing, for anyone weighing this against a tool they already use: if you want a head-to-head, the davr vs Wispr Flow comparison goes deeper. The short version is that this is a difference in where translation sits in the workflow — before you speak versus after you've spoken — and that difference is the whole point of this page.

A note on where your words go

Translation means your speech becomes text, and that text passes through a language model to come out in the target language. So it's worth being precise about the data path, because vague privacy claims are worse than none.

davr gives you two ways to run the pipeline. You can bring your own API key — connect your own OpenAI and Anthropic accounts, and your audio and text flow through your provider account rather than through a middleman. That path is uncapped and private in the specific sense that matters: there's no davr-operated server sitting in the middle of your translations, and you fund the API usage directly at cost. Keys are stored in Windows Credential Manager, not in plaintext on disk.

Be clear-eyed about the limits. davr's on-device privacy is two separate toggles. Local runs Whisper on your machine, so the raw audio of you talking never goes to OpenAI. Privacy Mode turns off the Claude cleanup step, so that text never goes to Anthropic. Flip both on and the ordinary dictation pipeline stays entirely on your machine. Translation, though, is a different animal: converting English speech into polished Spanish is a language-model call by definition, so that step does go to whichever provider you've pointed davr at. With your own key, that's your own account; with the managed option, it's davr's metered service. There's no fully-offline translation path — the model has to see the words to render them in another language — and we'd rather tell you that plainly than imply translation happens in a vacuum. What you do get to choose is whose model sees them.

Who this is for

This stops being a feature and starts being a relief for a few specific people.

The second-language professional. You work in a company whose language isn't your first, and you spend real energy every day translating your own competence into prose that doesn't quite sound like you. Speaking in your strongest language and having the team's language come out closes the gap between how sharp you are and how sharp you read.

The expat doing local-language admin. Landlords, school forms, the bank, the utility company — life abroad is a stack of correspondence in a language you're still climbing. Dictating in English and sending the local language turns an afternoon of dread into a few spoken sentences.

The global or remote team. When the working language isn't everyone's first, the people writing in their second language quietly carry more load. A tool that lets each person speak in their strongest language and output the shared one spreads that load more evenly, and the messages get warmer because they're no longer rationed by friction.

The bilingual family. A grandparent who reads only Spanish, a grandchild who thinks in English, a parent in the middle relaying both directions. Speaking naturally and sending text the other generation can read is, in the smallest and most important way, what the tool is for.

Try it

If you write to anyone who reads a different language than you think in, the detour through a translator tab is a step you've stopped noticing. davr removes it by making translation part of dictating: set your output language, speak in yours, and finished text in theirs drops straight into the app you're already in — across roughly 40 languages, any of them a valid output target, no second window.

Connect your own OpenAI and Anthropic key and dictation is free — $0. Want the AI modes that handle translation without wiring up your own keys? Try them free for 14 days, no credit card. Either way, the test is simple: write your next cross-language message by talking, in your own language, and watch it arrive in your reader's.

Download davr — Free