Blog
What Learning Zig Taught Me About Harness Engineering
Months of building in Zig changed what I reach for first when I hit an AI engineering problem. The constraints turn out to rhyme.
Forks and Locks: Why Smarter Decoding Can't Fix a Model's Distributions
I tried to make a code model smarter at inference time. Adaptive decoding, entropy thresholds, the works. It made things worse. Then self-distillation fixed it in four minutes.
Lossless Compaction for AI Agents
Claude Code, Codex, and Cursor all handle context limits differently. I tested an alternative, pointer-based compaction, and it beats summarization by 18 points on grounding after two compactions.
Stop Multitasking. Parallel Agent Workflows Are Making You Slower And Burning You Out
I've tested every multi-agent orchestration setup. Conductor, Cursor, Claude Code Max. I was faster single-threaded. The cognitive science on this runs decades deep.
59% More Code. 30% More Failures. The Toolchain Bottleneck Is Here.
AI-generated code is flooding pipelines. Main branch success rates hit a five-year low. The Rust rewrite wave accidentally built the verification infrastructure agents need.
Anthropic Quietly Killed Three SaaS Categories This Week
Most coverage focused on individual features. Nobody zoomed out to see what Anthropic did across the full week: absorb three categories of SaaS tooling into their own surface area.
What's Left When You Unplug?
There is now a permanent gap between who you are with AI and who you are without it. And nobody knows which one is real anymore.
Your Agent's Safety Net Is an If-Statement. Mine Is a Proof.
Two weeks ago, security researchers found over 1,800 exposed OpenClaw instances. Every vulnerability maps to the same failure mode - a code path that didn't hit the check. Petri nets fix this.
You're Already Building Petri Nets. You're Just Building Them Badly.
There's a formalism from 1962 that solves a problem you hit every month. Nobody told you about it.
-
What Learning Zig Taught Me About Harness Engineering
Months of building in Zig changed what I reach for first when I hit an AI engineering problem. The constraints turn out to rhyme.
-
Forks and Locks: Why Smarter Decoding Can't Fix a Model's Distributions
I tried to make a code model smarter at inference time. Adaptive decoding, entropy thresholds, the works. It made things worse. Then self-distillation fixed it in four minutes.
-
Lossless Compaction for AI Agents
Claude Code, Codex, and Cursor all handle context limits differently. I tested an alternative, pointer-based compaction, and it beats summarization by 18 points on grounding after two compactions.
-
Stop Multitasking. Parallel Agent Workflows Are Making You Slower And Burning You Out
I've tested every multi-agent orchestration setup. Conductor, Cursor, Claude Code Max. I was faster single-threaded. The cognitive science on this runs decades deep.
-
59% More Code. 30% More Failures. The Toolchain Bottleneck Is Here.
AI-generated code is flooding pipelines. Main branch success rates hit a five-year low. The Rust rewrite wave accidentally built the verification infrastructure agents need.
-
Anthropic Quietly Killed Three SaaS Categories This Week
Most coverage focused on individual features. Nobody zoomed out to see what Anthropic did across the full week: absorb three categories of SaaS tooling into their own surface area.
-
What's Left When You Unplug?
There is now a permanent gap between who you are with AI and who you are without it. And nobody knows which one is real anymore.
-
Your Agent's Safety Net Is an If-Statement. Mine Is a Proof.
Two weeks ago, security researchers found over 1,800 exposed OpenClaw instances. Every vulnerability maps to the same failure mode - a code path that didn't hit the check. Petri nets fix this.
-
You're Already Building Petri Nets. You're Just Building Them Badly.
There's a formalism from 1962 that solves a problem you hit every month. Nobody told you about it.
-
Your Interview Process is a Liability
Testing engineers without the tools they use every day is a business liability. It actively selects against the future. Here is the fix.
-
A Student Disproved a 40-Year-Old Conjecture. I Implemented It.
A student revisited an old paper 'just for fun' and accidentally disproved a 40-year-old conjecture. I decoded the maths and built it - 4x faster than Zig's std.HashMap at 99% load.
-
Speed Vertigo: A New Kind of Engineering Debt
It's not imposter syndrome. It's being over-leveraged in the code you shipped.
-
Maya: A Multi-Agent Architecture for Conversational AI
My AI yoga instructor had a 2000-token monolithic prompt and kept forgetting what it was doing. I split it into five agents. Each one got smaller, faster, and more reliable.
-
The AI Creativity Paradox: Why Smarter Isn't Always More Creative
I built an AI yoga instructor. The smarter she got at finding existing sequences, the less she created new ones. That's a problem worth thinking about.
-
AIgument: The Art of AI Disagreement – Developing a Language Model Debate Platform
I built a platform where LLMs debate each other with custom personalities and adjustable spiciness. A Sassy Drag Queen vs a Noir Detective at Extra Hot is worth watching.
-
Building a Live London Underground Tracker: Learning Go From Scratch
How I learned Go by building a real-time London Underground tracker, featuring concurrent polling, WebSockets, and lots of debugging.
Subscribe to my newsletter
Get notified when I publish new posts. No spam, unsubscribe anytime.
Powered by Buttondown