Bloated: how chat made you fat

By Iain,

A hand holding a fountain pen drawing squiggles on a sheet of paper, on a teal background

The pitch for writing with a language model is that it saves you time: you describe the memo, the model produces it and 90 seconds later you have four pages (okay, maybe forty) instead of a blank document. Someone still has to read those pages though. The model did not remove that work. It just moved it downstream to your colleagues or suppliers, and on the way it produced more than any human would have ever bothered to write.

A person writing badly under time pressure tends to write too little. They run out of patience before they run out of points. A model has the opposite failure. It never runs out of patience and it has no deadline. So it gives you the paragraph. Then it gives you the caveat on the paragraph, the worked example beneath the caveat, and a closing section that restates everything above it in case you skipped to the end. Everything and the kitchen sink goes in. The model has no view on what you could safely leave out, because forming that view is a task it was never trained to do.

Why the machine pads

The padding is not a flaw in any one model. It is what the training rewards. Modern assistants are tuned on preference data, where a human assessor looks at two answers and picks the better one. Put a thorough answer next to a terse one and an assessor skimming for completeness will choose thorough almost every time. Length reads as a proxy for effort. Caveats read as care. The hedge is never marked wrong, because a sentence that commits to nothing cannot be contradicted.

When researchers at UT Austin and Princeton took RLHF apart in 2023, they found that most of the reward gain came down to a single variable, the answer getting longer, and that a reward linked to length alone reproduced most of the improvement the fine-tuning was supposed to deliver. Train on enough of those judgments and you get a system that has learned, correctly, that more, not less, is safer and better.

Brevity is evidence that someone decided

A short document is hard to write because brevity requires making difficult choices. To cut a paragraph you have to know it does not earn its place, and to know that you need a view about who the reader is and what they are specifically looking for. Every omission is a small act of judgment. This is why a tight memo is rare and faintly intimidating to receive. It is the output of someone having worked out exactly what to exclude.

A model dumping everything into the answer is telling you, plainly, that it has not done this. It has not decided, because deciding means risking being wrong about what mattered, and the safe move is to hand over the lot and let you sort it out. The discernment becomes the new work. What lands in your inbox looks finished but is in fact merely unedited, which is a worse state than a rough human draft, because at least the rough draft knows it is rough. What we are now seeing routinely is voluminous unedited model outputs being passed on verbatim, masquerading as the finished product.

None of this is new

The complaint is older than the technology. In 1940, with considerably more at stake than a status update, Winston Churchill sent the War Cabinet a memo titled “Brevity”, objecting that the papers everyone had to read were far too long, that this wasted time, and that the discipline of setting out the important points concisely was an aid to clear thinking in its own right. He had spotted the mechanism. Bad length is more than tiring to read. It is a sign the writer has not finished thinking.

The American military codified bottom line up front, the rule that the main point goes in the opening line and the background waits its turn, because a commander should not have to reach page four to find out what the memo wants or suggests. Barbara Minto handed consultants the pyramid, a single governing thought up top with the support stacked underneath. And Amazon banned slide decks from its decision meetings in favor of a maximum six-page narrative read in silence at the start of the meeting.

Every one of these techniques was a response to humans being verbose. The machine has not introduced a new disease. It has just automated an old one, stripped out the friction that used to limit it, and handed everyone a powerful pump to fire this bloat out into the ether.

If slop describes generic, low-effort generative output, perhaps bloat is its textual counterpart, slowing you down as you wade through a 20-page document that should have been two. Netted out across author and reader, any efficiency gains are lost.

The fix is a wall, not a wish

You will not get a leaner document by asking the model to be concise and hoping for the best. Concise is a direction, not a constraint, and the prose quickly drifts back to safe and long inside a paragraph. The lever that works is a structure imposed from outside. Decide the form before it writes a one-pager or a memo, and give it the ceiling as a hard number. Make it state the recommendation in the first line, so the bloat has nowhere to hide. Push the supporting detail into an appendix recipients are free to ignore.

The word count is beside the point. What a ceiling does is force the model into a decision it would otherwise dodge, because once space runs short something has to be left out, and leaving things out is the whole point.

It helps to remember the time you save generating a document is not free. It is borrowed from every person who has to read it, at interest, and the longer the distribution list the worse the rate of return. A model will always offer you seconds. It will always have another paragraph ready, another caveat, another section that could go in. The discipline, the entire discipline, is being willing to push the plate away while there is still food on it.

More from the blog

  • Apple’s bicycle without a chain

    By Iain,

    Steve Jobs described the computer as a bicycle for the mind. Apple Intelligence so far is more like a bicycle with no chain. The frame is gorgeous, and the engineering is extraordinary, but you cannot get far with it.In early 2025, Xe Iaso published a piece that landed like a brick through a window in the Apple developer community. The argument was simple and damning: Apple had built the holy grail of trusted compute with Private Cloud Compute, a genuinely unprecedented piece of security infrastructure, only to fill it with half-baked notification summaries and an image generator that produce…

  • Weeknotes vol. 17: business, schmizness

    By Garrett,

    Hello and happy casual weeknotes Friday.I stopped writing these about a year ago when I began the transition into consulting (solving fun and challenging problems), and to say a lot has changed since then would be the understatement of the century.In summary: Iain joined full time, we’re helping people solve operational problems and optimize their work across pretty much all aspects of business, and we’re having a lot of fun doing it. Iain has his masters in AI for Business, which has pushed me to go down the biggest rabbit hole I’ve been down since HTML/CSS in college (and we know where that…

  • The ten trillion dollar gamble

    By Iain,

    In November 2025, on stage at the Wall Street Journal’s Tech Live event, the chief financial officer of OpenAI was asked how her company planned to honor roughly $1.4 trillion in compute contracts on $13 billion of revenue. Sarah Friar said she was looking to assemble a network of banks, private equity, and a federal “backstop” or “guarantee.” By the following evening, she had posted to LinkedIn explaining that “backstop” had muddied the point, that what she meant was something more like a public-private partnership, and that the United States government has been “incredibly forward-leaning” …

  • Never talk about goblins

    By Iain,

    Buried in a JSON file that OpenAI posted to GitHub recently, inside the configuration for its newest coding agent, sits an instruction that reads like a footnote written by someone losing their composure. “Never talk about goblins, gremlins, raccoons, trolls, ogres, pigeons, or other animals or creatures unless it is absolutely and unambiguously relevant to the user’s query.” The line appears more than once. Whoever wrote it wanted to be sure the model understood.Most readers, including ones who follow AI closely, may be unaware of what a “base instruction” is, where it lives, or why anyone a…

  • Read the frickin’ manual: the end of the user interface

    By Iain,

    When a new sales rep joins a company, whilst manuals and process documents probably exist somewhere in varying degrees of obsolescence, in practice the rep just asks where the opportunity stage field lives. Someone on the team shows them. A few months in, the same rep is teaching the next hire. The institutional knowledge of how to use the CRM reproduces itself like a folk song.But agents do not pick up folk songs. They read the documentation, the tool description, the parameter schema, and whatever else has been put in front of them in machine-readable form. If a piece of operating knowledge…

All blog posts