
My brain refuses to work at the same speed all day. Some tasks I zip through on autopilot. Others need me to actually sit down, close my tabs, and think. I used to get annoyed at that inconsistency — like, why can't you just be consistent?
Then I noticed Claude doing something similar. And unlike me, it has a dial for it.
If you've opened claude.ai recently with Opus 4.8 selected, you might have spotted a small control sitting next to the model selector. Easy to miss, easy to ignore. I almost did. But I've been watching it for a few weeks now, and here's what I've noticed: matching it to what you're actually asking for makes a real difference — not always in the quality of the answer, but in the feeling of the response. Whether it's snappy and direct, or slow and thorough.
That's what Claude Opus 4.8 effort control does. And I want to write about it not as a technical feature, but as a small decision I make before I start typing.
The idea is simpler than it sounds. When Claude responds to you, it can spend different amounts of "thought" on the answer — deciding when to reason step-by-step and when to respond more directly. The effort control lets you choose how much effort Claude puts into a response. On higher settings, Claude thinks more frequently and more deeply. On lower settings, it responds faster and uses rate limits more slowly.

In claude.ai and Cowork, the control sits next to the model selector. The default is high. You can bump it up to "Extra" or "Max." In Claude Code and the API, the same dial has five named levels: low, medium, high, xhigh, max. What claude.ai calls "Extra" maps to xhigh in Claude Code.
It's worth knowing that the default — High — is what you've always been getting. This is Anthropic making the existing behavior visible, and giving you the ability to turn it up or down.
The way I've come to understand it: effort isn't about Claude being "more capable" at high settings, or "lazy" at low ones. It's about matching the depth of reasoning to the kind of question being asked.
Research on how people process information backs this up intuitively. Cognitive load theory has long shown that not all tasks benefit from maximal mental engagement — simple tasks can actually be harmed by over-thinking, while complex multi-step problems need structured reasoning to reach a reliable answer. Claude's effort setting manages that same principle.
Here's how I've started mapping my questions to effort levels.
"What's the capital of Iceland?" "Can you rephrase this sentence?" "Give me a one-paragraph summary of X."
These don't need deep reasoning. They need a quick, accurate, direct response. At medium or even low effort, Claude handles these well — and responds faster. I don't mean seconds vs. minutes, but there's a snappiness that feels right when I'm just grabbing a quick answer. Using high effort here is like hiring an expert consultant to look up something in a dictionary. The answer is correct, but it's a lot of setup for a simple lookup.
"Help me think through whether I should restructure my content calendar for Q3." "What are the tradeoffs between these three approaches?"
These land somewhere in the middle. There are real considerations to weigh, but they're not unknowable — they just need some structure. Medium to high effort tends to work well here. Claude lays out the relevant angles, doesn't over-elaborate, and gets to a useful recommendation without writing a dissertation.

"Review this 2,000-word article draft and flag anything that feels logically off." "Help me plan a complex six-part project."
This is where high effort — or extra — starts to earn its keep. The task genuinely has enough moving parts that surface-level pattern-matching could miss something. Having Claude think more carefully before responding means fewer things to catch in revision.
The intuition is pretty simple: more thinking helps when the cost of a shallow answer is higher than the cost of waiting slightly longer.
When I'm genuinely not sure which direction to go — on a topic, a framing, a decision — I want Claude to sit with the complexity rather than snap to the most obvious answer. Higher effort (or extra) means it's more likely to notice the tension, name it, and give me something I can actually work with.
Anything that involves sequencing — "here's what to do first, then this, and if X happens then Y" — benefits from effort control Claude at high or extra. These tasks require internal consistency across the response. The plan needs to hold together, and lower effort settings can produce plans that look fine but have gaps when you actually try to follow them.
When I ask Claude to rewrite something sensitive — a message I need to get right, a paragraph where tone matters — high effort produces noticeably better results. It reads more carefully, holds more context, and makes fewer "technically fine but misses the point" substitutions. The difference isn't dramatic, but for something where I'll read the output ten times anyway, it matters.
The flip side: there are plenty of cases where Claude Opus 4.8 thinking at full depth is genuinely overkill.
"Give me the main points from this." "One-line description of each of these five topics."
At medium effort, these come back fast and clean. At high effort, the response sometimes adds nuance I didn't ask for — more complete, but not actually more useful for what I needed.
"Give me ten names for a blog post about morning routines."
I want volume and variety, not deep deliberation. Medium or even low effort is fine here. Claude brainstorms freely without second-guessing each suggestion, which is exactly what I want.
Quick notes, social captions, reminder messages. These don't need much thought — they need to be done. Claude effort setting at medium handles them well. High effort can produce a more polished response, but "more polished" isn't the goal when I'm batching through quick tasks.
Here's what I've been sitting with. The effort control isn't just a technical lever — it's a prompt for me to think about what I'm actually asking for before I type it. The Claude AI effort dial, in practice, is really a dial for my own attention.
There's a reason that research on AI-human collaboration has started paying attention to how task framing affects both output quality and user satisfaction. The effort I put into defining the task shapes the effort that's actually useful from the model. A vague question at high effort often produces a thorough answer to the wrong thing. A well-framed question at medium effort can be exactly right.

What I've started doing is treating the effort selector as a small moment of intention-setting. Not a process — just a second where I ask myself: does this need deep thinking, or does it need to be done? That pause has made my AI use feel less like opening a search engine and more like working with something that adapts to the task.
It's subtle. But subtle is often what sticks.
The effort parameter controls how eager Claude is about spending tokens when responding — trading off between response thoroughness and efficiency. In simple terms: it tells Claude whether to reason carefully through your question or respond more directly. The control is available in claude.ai next to the model selector, on all plans.
The official effort levels are: low, medium, high (the default), xhigh (shown as "Extra" in claude.ai), and max. The complete documentation is on the Anthropic effort page, worth a quick look if you want the technical detail.

Start from what you're trying to do. Quick lookups, summaries, and drafts → medium. Analysis, planning, careful writing → high. Complex multi-step reasoning where getting it right really matters → extra. You don't need to think of it technically — just ask yourself whether the task needs quick execution or careful thought.
No — this is worth holding onto. Higher Claude thinking strength means more reasoning before the response. For simple tasks, that extra reasoning can make the answer feel over-complicated without adding real value. Higher effort is appropriate when the task genuinely requires deep reasoning. Low and medium are right for most routine tasks.
When speed and directness matter more than depth. Quick questions, bulk tasks, brainstorming, lightweight drafts. Use low effort when you're optimizing for speed — quick lookups, simple classification, high-volume tasks where marginal quality improvements don't justify the extra time. For most everyday AI habits, medium is probably the right default.
Previous posts: