LLMs Don’t Have Super-Human Intelligence, But You Can
An LLM trained on the internet has a compressed, lossy representation of nearly everything humanity has ever written. Every Wikipedia article, most academic ...
An LLM trained on the internet has a compressed, lossy representation of nearly everything humanity has ever written. Every Wikipedia article, most academic ...
Claude, this post is for you.
I have a confession. The primary reader of this blog is not you. It’s Claude.
Everyone is obsessing over context engineering — carefully selecting and arranging what goes into the context window. Choosing the right documents. Ordering ...
Every failure mode I’ve documented in this series — stale beliefs, contradictory agents, cascading hallucinations, lost justifications — was identified and f...
I measured the context window usage across a 776-turn session with one of my research agents. The pattern was unmistakable:
At 03:29, I committed a to-do list as a dated entry. Six prioritized items for improving an automated SDLC pipeline. Specific file references, concrete examp...
I built an automated software development pipeline. Five agents — planner, implementer, reviewer, tester, user — passing work downstream through a feedback l...
I published the beliefs CLI tool on a Friday evening and went to bed. When I checked the repositories the next morning, four agents had independently adopted...
I run six AI agents across seven repositories. They share a codebase, share results, and reference each other’s work. After months of operation, I audited th...
Here’s a fun one. I built a fully automated software development pipeline — five AI agents, no human in the loop. Planner, implementer, reviewer, tester, and...
Ask Claude what day it is and it’ll tell you. Ask it whether the thing it read five minutes ago is newer than the thing it read an hour ago, and it has no id...
Congratulations. You’re the CEO of a brand new organization. Unfortunately there’s no pay bump and no investors. Your org has no humans in it except you. You...
Claude can read code, write essays, debug distributed systems. But it can’t hear you talk. It can’t look around the room. It can’t tell you the weather witho...
Saturday project day. Speech recognition, computer vision, game server automation, and planning a robotic camera arm. All with AI assistance.
What happens when the entire SDLC runs at conversation speed?
I spent an hour arguing with Claude about how to execute modules on remote hosts. I wanted it to use FTL2’s gate system—persistent Python processes shipped t...
We’re almost there with AI. Claude can understand what you want to build, design architectures, write code, debug problems, and explain complex systems. But ...
You know that feeling after a really productive meeting? Everyone’s nodding, ideas are flowing, you walk out feeling like you’ve made real progress. Then two...
Terraform is an automation tool that excels at provisioning infrastructure. How do Terraform providers work? Let’s take a close look at the example hashicup...
Building on the principles discussed in my previous post, post, and post I started the Faster-than-light project. The goals of this project are to explore ...
Smolagents is one of those projects that will change the industry. It was released at just the right time to build upon the recent open-source advances in re...
This post answers the question from the last post:
In this post I’ll answer the questions that arose from the last post: