Blog

2026

Self-Harness: Harnesses That Improve Themselves
Self-Harness is best read as evidence-driven harness versioning: mine failed traces, propose bounded scaffold edits, and promote only changes that survive regression tests.
10min read · June 15, 2026
2026 · self-harness · harness-engineering · agents · research
Codex Source Dive (I): the agentic loop is a runtime boundary
TLDR: A Codex turn is not one model call. It is a managed execution window where user input, tool calls, tool results, cancellation, compaction, and final answers are ordered by the runtime.
11min read · June 11, 2026
2026 · codex-source-dive · agent-runtime · agents · systems
Codex Source Dive (II): goals are runtime state, not prompts
TLDR: A Codex Goal is a thread-level long-running task state machine. It stores objective, status, budget, usage, resume state, and continuation gates instead of relying on one remembered prompt.
11min read · June 11, 2026
2026 · codex-source-dive · agent-runtime · agents · systems
Codex Source Dive (III): subagents are a thread tree
TLDR: A Codex subagent is not a background model call. It is a persistent child thread with identity, inherited runtime policy, forked context, mailbox communication, capacity limits, and resume behavior.
13min read · June 11, 2026
2026 · codex-source-dive · agent-runtime · agents · systems
A reading stack for the AI-coding era
TLDR: As code generation gets cheaper, the scarce skills move toward judgment: architecture, strategy, systems thinking, safety, measurement, customer truth, organization, and cooperation.
7min read · June 10, 2026
2026 · ai-coding · reading-list · reading
The Mythical Man-Month: Chapters 13-End
TLDR: In the second half of The Mythical Man-Month, Brooks closes the argument around integration, milestones, documentation, no silver bullet, incremental development, and the architect role.
20min read · June 10, 2026
2026 · mythical-man-month · software-engineering · reading · systems
RLM: Recursive Language Model
TLDR: RLM's real insight is not recursion as a slogan. It moves long context out of the Transformer window and into an external environment that the model can inspect, slice, search, and delegate over.
8min read · June 10, 2026
2026 · recursive-language-model · agent-systems · reading · agents
Apodex-1.0: deep research as multi-agent verification
TLDR: Apodex-1.0 is most interesting as a verification-centric agent-system design: independent subagents explore, a shared report pool accumulates evidence, and verifier agents audit claims from outside the worker trace.
3min read · June 9, 2026
2026 · deep-research · agent-systems · reading · agents