Ramon Astudillo
Articles & links
👆 A paranoid LLM is ofc worse. This is just tuning a prior belief up or down. I guess you could self distill additional context for the train data e.g. "you know arxiv.org is such and such" or "this is an unknown source" with the hope it generalises (and also injecting some ba…
new mystery model minimaxir.com/2026/05/open...
if you are wondering about Mistral koenvangilst.nl/lab/mistral-...
- Mistral owns data centers directly, backing its full-stack infrastructure claim with physical compute assets beyond model licensing.
- Domain-specific smaller models outperformed general-purpose alternatives in speed and efficiency across Mistral's enterprise demonstrations.
- Mistral signed anchor enterprise deals with ASML, BNP Paribas, and Amazon Alexa+ as evidence of European sovereign AI demand.
Recent commentary
Competing against a local gpt-oss-120b 10 sample ensemble at paper understanding and, man, it's not looking great for humans
You can see how LLMs still lack a lot of implicit context. For example, when reading a document, they are bad at guessing if the document can be trustworthy. They read an arxiv paper with grandiose unsupported claims and they repeat them to you as if it were its own judgment. 👇
There is this new meme out there that is something like "AI costs more than human employees". Seems like totally the wrong take. It costs much less for the things they can do, but you can't run an org w/o human employees (for now). 👇
Now there are three levels of alerts in generative code: errors, warnings and errors and warnings that you pass to the LLM agent and don't bother about.
Got reminded about OpenAI 5 and now I see much more timelines with decent probability mass, that are pretty far from where we are now. We could call them the "no Radford" timelines.
An LLM being bad at an underspecified problem or consuming lots of tokens seems like a signal of benchmaxing
5y ago Demerzel would have felt like a completely wrong portrayal of an AI. Now it somehow feels pretty realistic.
It seems the suspected 5T models from Anthropic and OpenAI are kinda close in cybersec skills?
In Ramon Astudillo's orbit
Center = Ramon Astudillo. Left = members they follow (green edges). Right = members who follow them (blue edges). Top = mutual follows (orange edges, slightly larger). Drag any node to reposition; click to open that profile.