antirez.bsky.social

Reproducible bugs are candies 🍭🍬 I like programming too much for not liking automatic programming.

Recent commentary

What Europe should do right now: 1. Call all the European researchers working on AI and return them back with same salary (or they can stay but switch career). 2. Fill EU places having GPUs with money, and put those people there. 3. AI partnerships with China + India.

View on Bluesky · ♥ 149 ↻ 23 ↩ 7 · 5d ago

Another important thing: Chinese models are not strong because they distill US models. Distillation of models via API is *impossible*. If somebody tells you the contrary, they don't understand machine learning:

View on Bluesky · ♥ 96 ↻ 11 ↩ 11 · 3d ago

Today I had an harder than usual question for my local model (security). With SSD streaming now DwarfStar can run DeepSeek v4 PRO at 4.15 t/s, and this was more than enough to get a detailed reply. I already feel "safer" than before in my AI future. M5 max 128GB, model 433GB.

View on Bluesky · ♥ 77 ↻ 3 ↩ 4 · 2d ago

DwarfStar now supports SSD streaming in the DGX Spark and Strix Halo, not just in Metal. You can run the Q4 quants at decent speed, and even DeepSeek v4 PRO at low speed, or you can run Q2 Flash if you have less than 128GB.

View on Bluesky · ♥ 50 ↻ 3 ↩ 4 · 3d ago

Simple way to evaluate an AI frontier lab ethics / company culture: attitude towards the user base in the time spans they have the best model.

View on Bluesky · ♥ 47 ↻ 5 ↩ 1 · 8d ago

While Fable is an amazing model, don't get to excited: it is great, but still has the usual failure models of the other good LLMs we saw in the past, including GPT 5.5. If you look at Anthropic, Opus -> Fable was a huge jump. If you look at the field, GPT 5.5 -> Fable is incremental.

View on Bluesky · ♥ 45 ↻ 1 ↩ 4 · 8d ago

OpenAI may delay GPT6 (or even 5.6) before making sure could not be blocked like Fable. Or they could play it smart, publishing only the benchmarks that show the improvements on certain area, providing a very censored model in the cyber-security side, and cross their fingers.

View on Bluesky · ♥ 15 ↻ 0 ↩ 1 · 1d ago

In antirez.bsky.social's orbit

Center = antirez.bsky.social. Left = members they follow (green edges). Right = members who follow them (blue edges). Top = mutual follows (orange edges, slightly larger). Drag any node to reposition; click to open that profile.