12/13/2024

Thread

Tweet 1

The only benchmark that matter is is the agent writing itself. If it's not then it doesn't work.

---

@paulgauthier's "aider wrote x% of this release" is the most important number in the field

---