Andrej Karpathy just open-sourced something that quietly broke my brain.
It's ~630 lines of Python. It runs on a single GPU. It does more useful ML research in one night than most people accomplish in a week. And the most intellectually demanding part - the part where you actually think about what experiments to run - is written in a Markdown file.
He called it AutoResearch. Let's break down what it is, why it works, and why it probably matters more than most things released this year.
The Problem With How We Do ML Research
Here's the dirty secret of machine learning research: a huge fraction of it is vibes-driven trial and error.
You have a hypothesis - "maybe ReLU² activations are faster than GeLU here", you write the code change, kick off a training run, wait hours, squint at a loss curve, say "hm, maybe," and try something else. Repeat for months. This is what people at frontier labs do. Brilliant, expensive people. Doing something that is, at its c...
