sensible: "makes the model speak less like a polite chatbot & more like a terse tool … Same substance, fewer words. In my evals, Caveman cut output tokens by roughly 65–75% versus default verbose output, & still beat a normal ‘be concise’ instruction" https://t.co/HCDF…
AI Weekly's analysis
→
- A plugin called caveman, written by Julius Brussee in early April, strips verbose model output and cut tokens by roughly 65 to 75 percent in his tests.
- Shayne Sweeney, OpenAI's director of engineering, contributed code to caveman to support Codex, and developers at Nvidia and GitHub are reportedly using it.
- GitHub shifted to per-token billing in April, Uber blew through its entire AI budget in four months, and Legrand's internal memo points staff at caveman.
Read full analysis →