Max Woolf

Senior Data Scientist at BuzzFeed in San Francisco // AI content generation ethics and R&D // plotter of pretty charts https://minimaxir.com

Articles & links

son of a bitch blog.google/innovation-a...

DiffusionGemma: 4x faster text generation blog.google
AI Weekly's analysis
  • DiffusionGemma generates 256 tokens per forward pass using bidirectional attention, reaching 1,000+ tokens/sec on a single H100 GPU.
  • With only 3.8B active parameters during inference and an 18GB VRAM footprint when quantized, it runs on consumer hardware without server-grade resources.
  • Google recommends DiffusionGemma only for speed-critical workloads like in-line editing and code infilling, not for applications requiring maximum quality.
Read full analysis →
View on Bluesky · ♥ 226 ↻ 23 ↩ 4 · 6 from the directory shared this · 8d ago

Recent commentary

Two things can be true simultaneously: a) Modern LLMs can count the amount of letters in a word despite the counterintuition of tokenization b) Google Search Overview's LLM can fail to the amount of letters in a word because it's a quantized LLM There's a nuance that tbh no one care about anymore.

View on Bluesky · ♥ 73 ↻ 8 ↩ 6 · 22d ago

It's both funny and logical that OpenRouter is now the place for running Chinese OSS LLMs without jumping through hoops.

View on Bluesky · ♥ 40 ↻ 0 ↩ 3 · 4d ago

the fact that more people are defending the jqwik secret message to get AI to delete itself than are condemning it is concerning.

View on Bluesky · ♥ 24 ↻ 0 ↩ 5 · 20d ago

I kinda want to see what would happen (both technically and community-wise) if an agentic LLM ported WordPress from PHP to Rust.

View on Bluesky · ♥ 19 ↻ 0 ↩ 3 · 21d ago

The one objectively good thing about the Claude Fable 5 launch is that it implies OpenAI will release GPT 5.6 in a couple days.

View on Bluesky · ♥ 15 ↻ 0 ↩ 1 · 9d ago

First time I've seen a disclaimer like this on a tool with obvious AI-generated copy.

View on Bluesky · ♥ 8 ↻ 1 ↩ 3 · 35d ago

The new Siri AI features are clearly using some sort of search engine (Google) so wonder how strong Apple's privacy guarantees are.

View on Bluesky · ♥ 11 ↻ 0 ↩ 2 · 10d ago

A fun pricing nit with agentic LLMs on OpenRouter: in DeepSeek V4 Flash's case, the LLM provider is extremely relevant as DeepSeek is the cheapest by a large margin as their caching is extremely cheap. Unfortunately downstream agent clients may not let you choose a specific provider.

View on Bluesky · ♥ 8 ↻ 0 ↩ 2 · 33d ago

Asked GPT-5.5 to make a WEBP encoding workflow from scratch while keeping outputs perceptually similar to the source image and it did it in 10 minutes lol (15% smaller, 10x faster encoding)

View on Bluesky · ♥ 4 ↻ 0 ↩ 2 · 24d ago

asked GPT 5.5 to find all grammatical errors in my short blog post for tomorrow and it found over 100 of them

View on Bluesky · ♥ 2 ↻ 0 ↩ 2 · 23d ago

In Max Woolf's orbit

Center = Max Woolf. Left = members they follow (green edges). Right = members who follow them (blue edges). Top = mutual follows (orange edges, slightly larger). Drag any node to reposition; click to open that profile.