
Hey, I'm Anna.I didn't plan to "use Codex and Opus together." I just wanted my little weekend script to stop breaking every time I renamed a folder. I opened a new chat out of mild annoyance, not ambition. And then something quiet clicked: when I let a reasoning‑first model shape the plan and a code‑first model do the typing, the work felt… lighter. Not faster at first, but less mentally heavy. Enough to keep going.

I'm not building a system or chasing perfect automation. Most days, I'm smoothing small frictions: a script that files PDFs, a nudge for a weekly review, a one‑off parser so I don't do the same copy‑paste dance again.
Here's what I noticed, after a few short runs across a handful of tasks: pairing a strong reasoning model (Claude Opus via the Anthropic console) with a code‑centric model (what many of us still shorthand as "Codex," though OpenAI's original Codex is retired, these days it's a code‑forward GPT or Copilot) felt natural. Not a grand workflow. More like passing a note across the table.
Opus handled the fuzzy parts: constraints, weird edge cases, "what am I missing?" It didn't rush me. The code model handled the literal parts: function signatures, import paths, unit tests. When I tried to make either do both, I spent more time correcting tone shifts, either overthinking in code, or under‑thinking in planning.

I'm stating the obvious, but it matters: tiny projects have phases even when you do them in one sitting.
Opus was best before and after. The code model was best during. When I respected those roles, I noticed fewer "why did I do it that way?" moments later.
I kept this simple: one chat with Opus for planning, one thread with the code model for building, then back to Opus for a sanity pass. No fancy orchestration, no plugins I had to babysit.
A real example: I had a folder of monthly statements named semi‑chaotically (Jan_2025, 2025-02, "March 2025 final"). I wanted a script to normalize names and move files into a year/month structure without touching any PDFs that were already tagged "reconciled."
What happened:
None of this felt "automated." It felt like pairing with two coworkers who are good at different things and don't mind short, specific asks.
Our Macaron is designed to help you solve the problem of dealing with such fragmented tasks: Every note, every small script reminder, can be directly transformed into an actionable next step. Even temporary ideas won't be left in folders or chat records.
Try Macaron for free now!

That's it. I tried pasting entire folder trees and long rants: it didn't help. Tight inputs, better outputs.
The chaos shows up when I blur roles or lose the thread. A few things that kept me sane:
Also, small note on reality: OpenAI's classic Codex is gone: what I'm calling "Codex" here is just the code‑forward model in tools like GPT or GitHub Copilot.

Plenty of days, I don't split it. If I'm writing a tiny bash alias or renaming ten files, the code model alone is fine. If I'm sense‑making, drafting a weekly note, deciding what "done" looks like for a mini‑project, Opus alone is calmer.
I'll keep using Codex and Opus together, casually. When the plan is fuzzy, the code is literal, and my attention is thin, the handoff helps. I'm curious whether this stays true the next time I forget to save a dry‑run and have to explain to myself why three files mysteriously vanished.