Max Woolf

Senior Data Scientist at BuzzFeed in San Francisco // AI content generation ethics and R&D // plotter of pretty charts https://minimaxir.com

Articles & links

New (short!) blog post up: on OpenRouter's AI Model Rankings, I noticed a peculiar new LLM topping the rankings by a large margin: Hy3. I looked into the data and only became more confused. minimaxir.com/2026/05/open...

minimaxir.com
View on Bluesky · ♥ 12 ↻ 1 ↩ 1 · 3 from the directory shared this · 3d ago

Recent commentary

Two things can be true simultaneously: a) Modern LLMs can count the amount of letters in a word despite the counterintuition of tokenization b) Google Search Overview's LLM can fail to the amount of letters in a word because it's a quantized LLM There's a nuance that tbh no one care about anymore.

View on Bluesky · ♥ 73 ↻ 8 ↩ 6 · 1d ago

the fact that more people are defending the jqwik secret message to get AI to delete itself than are condemning it is concerning.

View on Bluesky · ♥ 22 ↻ 0 ↩ 3 · 18h ago

I kinda want to see what would happen (both technically and community-wise) if an agentic LLM ported WordPress from PHP to Rust.

View on Bluesky · ♥ 19 ↻ 0 ↩ 3 · 1d ago

First time I've seen a disclaimer like this on a tool with obvious AI-generated copy.

View on Bluesky · ♥ 8 ↻ 1 ↩ 3 · 15d ago

A fun pricing nit with agentic LLMs on OpenRouter: in DeepSeek V4 Flash's case, the LLM provider is extremely relevant as DeepSeek is the cheapest by a large margin as their caching is extremely cheap. Unfortunately downstream agent clients may not let you choose a specific provider.

View on Bluesky · ♥ 8 ↻ 0 ↩ 2 · 12d ago

Asked GPT-5.5 to make a WEBP encoding workflow from scratch while keeping outputs perceptually similar to the source image and it did it in 10 minutes lol (15% smaller, 10x faster encoding)

View on Bluesky · ♥ 4 ↻ 0 ↩ 2 · 4d ago

asked GPT 5.5 to find all grammatical errors in my short blog post for tomorrow and it found over 100 of them

View on Bluesky · ♥ 2 ↻ 0 ↩ 1 · 3d ago