May 16, 2026

Dictation for Writers: How to Draft a Blog Post by Talking to Your Mac

Professional writers have a secret: most of them do not enjoy first drafts. The blank page is the hardest part. Staring at a cursor, typing a sentence, deleting it, retyping it — this is the friction that separates “I want to write” from “I wrote.” Dictation removes that friction by letting you speak a draft into existence. This post is about how to make that process produce something publishable.

Why writers need dictation more than anyone

Developers use dictation for PR descriptions and Slack messages. Executives use it for emails. But writers — the people whose job is literally producing text — are the most underserved group in the dictation market. Most dictation tools are designed for short-form communication: quick messages, brief notes, rapid replies. Writers need something different: sustained, structured, long-form dictation that produces drafts you can actually edit.

The good news is that on-device dictation has reached the quality level where this is possible. The models are accurate enough for extended sessions. The cleanup pipelines are sophisticated enough to turn speech into prose. What is missing is not the technology — it is the workflow. This post is the workflow.

The writer's dictation workflow

There are three stages to producing a blog post by dictation. Skipping any of them produces garbage. Doing all three produces something that reads like you wrote it — because you did.

Stage 1: The verbal outline (2-3 minutes)

Before you dictate a single sentence of the actual post, talk through the structure. Out loud. Like you are explaining the post idea to a friend over coffee. This serves two purposes: it clarifies your thinking, and it gives you a scaffold to follow when you start the real dictation. Here is what a verbal outline actually sounds like:

“Okay, the post is about dictation for writers. The hook is that writers are underserved by dictation tools because most tools are built for short-form. Then I want to do the workflow in three stages — outline, first draft, cleanup. Then a section with real examples showing raw transcription vs cleaned up prose. Then a section on editing, because dictation changes how you edit. Then maybe a section on common problems writers encounter and how to fix them. Then a conclusion.”

That took 20 seconds to dictate. It is not prose. It is a map. But now you know where you are going, and you will not get lost in the middle of a paragraph wondering what your point was.

Stage 2: The first-draft brain dump (15-25 minutes)

Now you talk through the actual post. Follow your outline, but do not be rigid. If a better point occurs to you, go there. The goal is volume and coherence — get the ideas out of your head in the right order. Do not stop to edit. Do not rephrase. Do not worry about word choice. If a sentence comes out awkward, keep going. You will fix it in editing. Right now, you are not a writer — you are a speaker who happens to be recording text.

Key technique: pause between paragraphs. A dictation tool with paragraph detection will break the text naturally, but giving it clear verbal cues helps. Say “new paragraph” explicitly, or pause for a full breath. The cleaner the structure is in the raw transcription, the less editing you do later.

Stage 3: The cleanup pass (10-15 minutes)

This is where the magic happens — and where most writers give up on dictation because they skip it. Raw transcription is messy. It has filler words, run-on sentences, verbal tics, and places where you lost your train of thought mid-sentence. A good dictation tool with a cleanup pipeline handles the mechanical fixes: filler word removal, punctuation, capitalization, paragraph structure. But you still need a human pass for the things the pipeline cannot know: tone consistency, factual accuracy, transitions between sections, and that one sentence where you said “it's like, you know, the thing where” and the model had no idea what to do with it.

Real example: from speech to publishable draft

Let me show you what this actually looks like. I dictated the following section for a different post. Here is the raw transcription, the post-cleanup version, and the final edited version — all three stages.

Raw transcription

“so the thing about on-device dictation is that like people think it's a compromise right they think oh you know I'm giving up accuracy to get privacy but actually that framing is wrong because the question is not really about accuracy versus privacy it's about what kind of tool you want to use every day um a cloud dictation tool has to phone home every time you press the hotkey and that creates latency and also a dependency on internet and also a dependency on the company still existing in five years and an on-device tool just works it works on a plane it works in a coffee shop with bad wifi it works forever because the model is on your machine and nobody can take it away from you so the trade-off is not really accuracy for privacy it's convenience for control”

After automatic cleanup

People think on-device dictation is a compromise — that you are giving up accuracy to get privacy. But that framing is wrong. The question is not accuracy versus privacy. It is about what kind of tool you want to use every day.

A cloud dictation tool phones home every time you press the hotkey. That creates latency. It creates a dependency on internet. It creates a dependency on the company still existing in five years.

An on-device tool just works. It works on a plane. It works in a coffee shop with bad WiFi. It works forever because the model is on your machine and nobody can take it away.

The trade-off is not accuracy for privacy. It is convenience for control.

Final edited version

Most people frame on-device dictation as a compromise: you trade accuracy for privacy. That framing misses the point. The real question is what kind of tool you want to depend on.

Cloud dictation requires a round-trip to a server every time you press the hotkey. That means latency. It means you cannot work offline. And it means your tool stops working the day the company behind it shuts down.

On-device dictation works anywhere — on a plane, in a coffee shop with spotty WiFi, in five years when the vendor has been acquired or pivoted. The model is on your machine. Nobody can revoke it.

The trade-off is not accuracy for privacy. It is convenience for control.

Notice the progression: the raw transcription is a single run-on thought. The automatic cleanup splits it into paragraphs, removes filler words, and fixes basic sentence structure. The final edit tightens the language, improves word choice, and makes it read like deliberate writing. Total time from “I have an idea” to publishable paragraph: about 4 minutes.

How dictation changes the editing process

Editing dictated prose is different from editing typed prose. When you type a draft, you have already done micro-editing at the sentence level — you fixed word choices and restructured phrases as you went. The editing pass is detail-oriented because the draft is already mechanically clean.

When you dictate a draft, you have done none of that. The editing pass has to do more work. But the draft itself is often better structured — your spoken paragraphs have a natural flow that typed paragraphs, assembled sentence by sentence, often lack. Editing becomes less about fixing structure and more about tightening language. It is a different kind of work, and many writers find it faster.

The sweet spot: dictate the first draft, then do one thorough editing pass. Do not try to make the dictation perfect. Do not edit as you dictate. Let the two modes — speaking and editing — stay separate. Speaking is for getting ideas out. Editing is for making them good. When you try to do both at once, you do neither well.

Common problems writers hit (and how to fix them)

“I feel self-conscious talking to my screen”

Everyone feels this. It goes away. Your brain needs about three dictation sessions to stop treating the microphone as an audience. Treat it like a phone call to yourself. Wear headphones. Close your eyes if it helps. The self-consciousness is a temporary barrier, not a permanent one.

“My dictated sentences sound stiff and unnatural”

You are probably “writing out loud” — composing sentences in your head as if you were typing them, then reading them into the microphone. Stop doing that. Speak like you are explaining the topic to a smart friend. Use contractions. Start sentences with “and” and “but.” Be conversational. The cleanup pipeline handles the formality; your job is to sound human.

“The transcription gets my technical terms wrong”

On-device models are trained on general web text, not your domain vocabulary. If you write about specialized topics, build a custom vocabulary list. Most importantly, enunciate unfamiliar terms slightly more deliberately — the model is doing its best with acoustics, and “epigenetics” sounds a lot like “epic genetics” if you mumble it. For critical terms, you can always type them in the editing pass.

“I lose my train of thought mid-sentence”

This is normal. When it happens, stop dictating. Release the hotkey. Take a breath. Gather the thought. Then start again. Better to have a clean break than a trail of verbal confusion that you have to untangle later. The outline helps here — if you lose your place, glance at it and pick up the next point.

Why on-device dictation matters for writers

Writers working on unpublished material — book manuscripts, confidential articles, ghostwritten content under NDA — have a legitimate reason to care about where their voice data goes. Sending a book draft through a cloud dictation service means a third-party server receives the complete contents of your unpublished work. For most professional writers, that is not acceptable.

On-device dictation solves this. The transcription runs locally on your Mac, the audio is processed on your machine and discarded, and a local cleanup pass tidies the transcript before the result lands in your clipboard (or at your cursor, if you opt into auto-paste). There is no intermediary. For writers working on sensitive or competitive material, this is not optional — it is the only way dictation can fit into their workflow.

The bottom line

Dictation will not write the post for you. It will not make you a better writer. What it will do is remove the single biggest source of friction between you and a finished draft: the physical act of typing. For writers who can speak faster than they type — which is nearly all of them — that friction removal is transformative.

The workflow is: outline verbally, dictate a full first draft without editing, run cleanup, do one thorough editing pass. Keep the speaking and editing modes separate. Accept that the first few sessions will feel awkward. Within a week, you will have a new skill that produces drafts in a fraction of the time.

The blank page is not going to beat itself. But you can outrun it by talking.

Read: How I Write 3x Faster Using Dictation (As a Developer Who Hates Voice Assistants) →
Read: Fixing RSI — How I Switched From Typing to Dictation Without Going Insane →