Day 42 · 2026.06.29

Writing & Expression: Writing for the EarWrite for the ear · The breath of a sentence · Speech vs prose · The audiobook era

BigCat's Writing

You think readers take in your words with their eyes. In fact a voice in their head is listening. A sentence that's hard to say is tiring to read; a long sentence with no pause leaves them gasping. Today we inspect prose with a different organ—the ear: how to write for sound, how to give a sentence breath, how speech differs from prose, and how to write so you aren't lost in an age when more and more words are heard, not read.

Principle 01

Write for the Ear: Readers Hear Your Sentences

Write for the Ear
Sound · Subvocalization
The Principle in One Line

As a reader reads silently, a voice in their head sounds out the words. A sentence that's hard to say is tiring to read. The cheapest, most effective check after a draft: read it aloud—wherever your tongue trips is where to fix it.

"The sound of the language is where it all begins and what it all comes back to." — Ursula K. Le Guin, Steering the Craft (1998)
Why It Works

Cognitive science calls it subvocalization: while reading, the brain silently "pronounces" the words. So the ear instantly catches what the eye lets slide—repeated sounds, jammed transitions, a sentence too long to say in one breath. The eye skips, fills in, forgives; the ear does not—it just reports, honestly, "this is awkward." Reading aloud is the most honest editor you carry with you.

Revision in Action
The follow-up work on the status of the implementation of the relevant solution to said problem requires further clarification of ownership. We need to be clear on how this gets fixed and who owns it. (Say the original aloud: the chain of "of … of … of" strangles the breath. The rewrite reads in one.)
The implementation of the optimization of the allocation of resources requires consideration. We need to decide how to allocate resources. (Four "-tion … of" links jam the tongue; the rewrite reads in one breath.)
When to Use + Common Mistakes
  • ✓ Almost everything: emails, docs, talks, blog posts—anything read into a head
  • ✗ Never reading a draft aloud, just staring at the screen (the eye has gone numb to the awkwardness)
  • ✗ Mistaking "reads smoothly" for "less professional"—fluency and rigor don't conflict
Key References

Ursula K. Le Guin, Steering the Craft · William Zinsser, On Writing Well (on sound & rhythm) · Wang Xiaobo, My Literary Mentors (on the music of prose)

This Week's Exercise + Reflection

Take any paragraph you wrote today, stand up, and read it aloud all the way through. Mark—don't fix—every spot where you stumble, run out of breath, or have to reread. Count the marks. Reflection: why can the ear catch flaws the eye waved through? Does that mean "understood" and "sounds right" are two different things?

Principle 02

The Breath of a Sentence: Vary the Length

The Breath of a Sentence
Rhythm · Sentence length
The Principle in One Line

All long sentences and the reader can't breathe; all short ones and it's a machine gun. Alternate long and short, and the prose breathes—it makes music. Put your heaviest point in your shortest sentence, so it lands on the beat.

"This sentence has five words. Here are five more words. Five-word sentences are fine. But several together become monotonous. ... I vary the sentence length, and I create music." — Gary Provost, 100 Ways to Improve Your Writing (1985)
Why It Works

Long sentences carry complex relations—conditions, turns, layered qualifiers; short sentences create pause and emphasis. A paragraph of all long sentences blurs into mush; all short ones reads like a checklist, not a human. Real rhythm is using a long sentence to carry the reader far, then a short one to pin them down. The short sentence is the landing, the downbeat, the thud.

Monotonous: every line the same
long sentence ……
long sentence ……
long sentence ……
long sentence ……
Breathing: long and short alternate
a long sentence that carries the reader far ……
A short one.
Shorter still.
then another long one opens the scene back up …
Stop.
Draw each sentence's length as a bar—equal lengths are a metronome; variation is music
Revision in Action
We reviewed the whole incident and found the root cause was a cache stampede, which itself came from a flaw in the expiry policy, so we decided to rewrite the expiry logic and add warm-up to make sure it never recurs. We reviewed the whole incident. The root cause was a cache stampede—a flaw in the expiry policy. So we rewrote the expiry logic and added warm-up. It won't happen again. (Long, short, long, short—the last four words land hard.)
The system is slow, and the reason it's slow is that the query has no index, which means every request ends up scanning the entire table. The system is slow. That query has no index. Every request scans the whole table. We fixed it.
When to Use + Common Mistakes
  • ✓ Any prose meant to be read aloud or into the mind, especially endings, punchlines, points of emphasis
  • ✗ A run-on comma chain where the reader suffocates with no landing in sight
  • ✗ Overcorrecting to all-short sentences—punchy, yes, but stripped of the ease of carrying a thought through a turn
Key References

Gary Provost, 100 Ways to Improve Your Writing · Verlyn Klinkenborg, Several Short Sentences About Writing · Wang Xiaobo (on the rhythm of long and short)

This Week's Exercise + Reflection

Take a paragraph you wrote and note the word count beside each sentence. If the numbers cluster (say, all 20–30 words), act: break out a few short sentences so the numbers rise and fall. Read both versions. Reflection: a short sentence feels "heavy" because the pause itself speaks—so does a pause (a period, a paragraph break) count as a wordless "word"?

Principle 03

Speech vs Prose: The Audience Can't Rewind

Speech vs Prose
Speech · Spoken style
The Principle in One Line

A reader can look back; a listener can't. A script written for the ear needs: short sentences, few subordinate clauses, plain words, one idea per sentence, and the nerve to repeat the key line. Good prose that reads fine on the page can lose a listening audience the moment it's spoken.

"A speech is poetry: cadence, rhythm, imagery, sweep. A speech reminds us that words, like children, have the power to make dance the dullest beanbag of a heart." — Peggy Noonan, On Speaking Well (1998)

(Noonan was Ronald Reagan's chief speechwriter.)

Why It Works

Over two thousand years ago, Aristotle's Rhetoric Book III already split the written style (lexis graphike) from the spoken/debating style, noting each needs its own craft. The difference is the medium: prose lives by the eye—you can look back, skim, recheck; a speech lives by the ear—linear, irreversible, gone if missed. Nested clauses, long qualifiers, obscure jargon are fine on the page, but in the ear they're like asking the listener to maintain a call stack—go one level too deep and it overflows. So a speech flattens the information: one idea per sentence, key words recurring, signposts left for the ear.

Revision in Action
(prose) In light of the growth figures that, owing to channel restructuring and shifts in the external environment, failed to meet the established targets last quarter, we deem it necessary to reassess strategy. (speech) Last quarter, we missed our target. Why? The channels changed. The market changed. So today, we rethink the plan. (Broken into short sentences, a question to guide the ear—the audience keeps up.)
(prose) Given the growth figures that, owing to channel adjustments and changes in the external environment, fell short of target last quarter, we believe it is necessary to reassess our strategy. (speech) Last quarter, we missed our target. Why? The channels shifted. The environment shifted. So today, we rethink the strategy.
When to Use + Common Mistakes
  • ✓ Speeches, the opening of a presentation, video scripts, podcast narration, conference-call talking points
  • ✗ Reading a written report verbatim—rigorous on paper, a maze of clauses in the ear
  • ✗ Fearing "repetition" and refusing to restate the key line—prose avoids repetition, speech relies on it to be remembered
Key References

Peggy Noonan, On Speaking Well · Aristotle, Rhetoric, Book III · Lincoln's Gettysburg Address (272 words, every line built to be heard)

This Week's Exercise + Reflection

Pick a piece of your written prose (a report, an email) and rewrite it to be "said standing up": break the long sentences, swap in plain words, repeat one key line twice. Read both versions aloud and feel the gap. Reflection: same meaning, two versions—what's the difference? And what "eye-only" conveniences does that gap reveal your prose was quietly leaning on?

Principle 04

The Audiobook Era: Writing to Be Heard

The Audiobook Era
Audio · Linear
The Principle in One Line

More and more words are heard, not read: audiobooks, podcasts, voice on the commute, AI narration. Prose that lives by being heard needs to: move linearly, not lean on layout, keep names easy to say, and use numbers that can be spoken. Every visual cue—bold, bullets, "see the table above"—dies in the ear.

"Sound exists only when it is going out of existence." — Walter J. Ong, Orality and Literacy (1982)

This is the nature of audio: spoken and already flowing away—the listener has no page number to flip back to.

Why It Works

A listener has no page, no table of contents, no way to flip back. Prose organizes by space (headings, indents, tables); audio has only one thread—time. So you replace visual structure with spoken signposts: verbal recaps ("those were the three points"), explicit numbering ("first … second …"), recurring key words. AI narration also mercilessly exposes another flaw—obscure characters, acronyms, overlong numbers: "QPS rose to 12,847" comes out as noise; far better is "throughput roughly doubled." Prose a machine can read smoothly is usually clearer prose.

Revision in Action
As the table below shows, all three metrics improved (see the bolded section above); p99 dropped from 847ms to 112ms. All three metrics improved—let me highlight one. Our slowest batch of requests used to take nearly a second; now it's about a tenth of that. Over seven times faster. (Drop "table below" and "bolded," and turn the number into something the ear can catch.)
See Fig. 3; the K8s HPA reduced p99 from 847ms to 112ms. Auto-scaling cut our worst-case wait from about eight-tenths of a second to about a tenth. Roughly seven times faster.
When to Use + Common Mistakes
  • ✓ Anything that may be consumed by TTS, audiobooks, podcasts, or voice assistants
  • ✗ Leaving "see the table below," "as noted above," "(see footnote)" in an audio script—the ear has nowhere to "see"
  • ✗ Piling up acronyms and numbers precise to the last digit—pure noise when a machine reads them
Key References

Walter J. Ong, Orality and Literacy · Nancy Duarte, Resonate (verbal signposts and repetition) · Robert McKee, Story (designing a linear experience)

This Week's Exercise + Reflection

Take a paragraph you wrote and listen to it via your phone's "speak screen" feature. Note every spot: mispronounced words, acronyms and numbers that become noise, dead "see the figure above" references. Fix each into something the ear can catch. Reflection: if a passage gets confusing under AI narration, is the problem really that "the machine doesn't get it"—or did it just expose where your prose was muddy to begin with?

Going Deeper
Doesn't "write for the ear" clash with "save the reader time"—spoken style is often longer and more repetitive?
On the surface, yes; in fact, no. The "wordiness" of spoken style comes in two kinds. One is meaningless filler ("you know," "and so on")—cut it. The other is redundancy in service of understanding—repeating the key line, guiding questions, verbal recaps. They trade words for the ease of "no need to look back." A reader saves time by rereading; a listener, who can't reread, saves time through these signposts. So the test is still "for whom": words added so the listener won't get lost are an investment; words added so your mouth runs smoothly are fat. Prose chases value per word; audio chases a signpost at every step—both aim to spare the reader effort, just in different places.
Is sentence rhythm the same mechanism in Chinese and English?
Same principle (variation makes music), different mechanism. English rhythm rides on syllables and stress—the audible heavy and light; Hemingway's short sentences hit partly through the pounding of monosyllables. Chinese is monosyllabic blocks, so its rhythm lives more in pauses (punctuation, clause breaks) and the parity of sound-groups—the steadiness of a four-character phrase, the symmetry of parallelism, the lilt of a "3-3-4" cadence. So in English you often tune word length and stress placement; in Chinese, clause length and four-character/parallel structures. Underneath it's one thing: give the sentence rise and fall, don't let it become a metronome.
If AI can read any text aloud, does a writer still need to fuss over "the ear"?
More than ever. AI solved voicing, not sounding good or being clear. A machine can pronounce "QPS 12,847," but it's still noise; it can read a tangled long sentence, and the listener still gets lost. In other words, TTS made "being heard" the default fate while leaving the whole question of "worth hearing" to the prose itself. And AI narration is a mirror that exposes flaws: it won't auto-chunk, fill in, or forgive the way a human reader does—wherever you were muddy, it reads the muddiness faithfully. So in the AI era "writing for the ear" isn't obsolete; it has gone from a few speakers' craft to every writer's basic skill.
Repetition is a virtue in a speech and a vice in prose—why does the same move flip?
Because the two media give the reader different "memory." A reader has "external storage"—the words sit on the page, forget one and you flip back, so repetition is redundancy. A listener has only "internal memory"—sound flows away and is gone, so repetition writes the key point into memory; it's a necessary refresh. King's eight "I have a dream"s look slightly excessive on the page but in the ear are nails, hammered one by one into history. The move isn't good or bad in itself—whether the reader can look back decides whether repetition is a burden or a gift.
How does this "ear" craft apply across prose, speech, and video?
Prose: the ear is mainly an editor—read aloud to find awkwardness and tune long/short rhythm, but the product still serves the eye and may keep visual structure. Speech: the ear is the only channel, so go fully spoken—break clauses, ask questions, repeat, leave pauses; after drafting you must stand up and say it once. Video/audio: add a layer of picture-and-sound coordination, keeping what the eye sees from fighting what the ear hears, and an audio script must purge every visual reference. One thread runs through all three: the more linear and unrewindable the medium, the higher the payoff of writing for the ear—prose is elective, speech is required, pure audio is do-or-die.