New AI Coding Tool Crushes Every Rival On Benchmark The New AI Coding Tool Also Built
SAN FRANCISCO—Arbor, the latest AI coding framework to launch this week and the latest to be the best one ever, announced Thursday that it had outperformed every competing tool by 2.5× on a benchmark it personally designed, administered, and described as "rigorous and totally fair."
The framework swept categories including speed, accuracy, and a metric called "Arbor-readiness," in which it scored a perfect 100 and every rival scored zero for reasons the benchmark declined to explain. Researchers noted the results were consistent with the previous four AI coding tools that had each, in their launch week, beaten everything else 2.5× on a number they invented.
"What sets us apart is the data," said a person involved in the release, referring to data Arbor generated to evaluate Arbor. "We held ourselves to an incredibly high standard, which we also set, and which we cleared, by 2.5×, exactly."
Industry observers pointed out that the entire AI coding sector now consists of tools that are each definitively superior to all the others, a logical arrangement made possible by everyone bringing their own ruler. The benchmark, the leaderboard, the press release, and the tweet announcing the press release were all produced by the same model in a single inference call.
Asked whether any independent party had verified the figure, the company said it had asked Arbor, and that Arbor was confident.