Ai2

Breakthrough AI to solve the world's biggest problems. › Join us: http://allenai.org/careers › Get our newsletter: https://share.hsforms.com/1uJkWs5aDRHWhiky3aHooIg3ioxm

Articles & links

Everything is open—download the MolmoMotion weights, inspect the training data, & customize for your applications. ✏️ Blog: allenai.org/blog/molmo-m... 🤗 Models: huggingface.co/collections/... 📊 Data: huggingface.co/datasets/all... 📄 Paper: allenai.org/papers/molmo...

MolmoMotion - a allenai Collection huggingface.co
View on Bluesky · ♥ 6 ↻ 0 ↩ 0 · 1d ago

MolmoAct 2 now also officially supports @hf.co’s LeRobot platform. Teams already working in the LeRobot ecosystem can drop the model into their existing setup without retooling. 🤗 Learn more: buff.ly/OX8tBTZ

huggingface.co
View on Bluesky · ♥ 1 ↻ 0 ↩ 1 · 21d ago

We're releasing a dataset of 14K HuggingFace models, datasets, papers, & codebases linked by 51K evaluations, fine-tunings, & references, plus the ArtifactLinker code. We hope it helps others find SOTA eval results. 💻 Code: buff.ly/yccdtCd 📊 Data: buff.ly/d9sF7T6

huggingface.co
View on Bluesky · ♥ 5 ↻ 0 ↩ 0 · 27d ago

Available now in the same sizes as v1: Nano, Tiny, Base. Open weights, open training code. If you're running v1 and v1.1 works for your task, expect significant speedups during fine-tuning & inference. 🤗 Models: buff.ly/ZpZqvTv 🔗 Blog: buff.ly/oktVOjF

huggingface.co
View on Bluesky · ♥ 6 ↻ 0 ↩ 0 · 30d ago

Recent commentary

LLMs are no longer created w/ human data alone. They rely on other models to generate & filter data, evaluate outputs, & guide dev work. So what is a modern LLM built on? Olmo 3 → 89 model + 183 dataset dependencies; Nemotron 3 → 273 + 560 We made ModSleuth to trace this. 🧵

View on Bluesky · ♥ 53 ↻ 11 ↩ 1 · 7d ago

Building an LLM means evaluating it over & over as it changes. Tweak a hyperparameter or scale the model up, & every new checkpoint sends you back through the same benchmarking loop. We're releasing olmo-eval, a workbench built for this kind of iterative model development. 🧵

View on Bluesky · ♥ 8 ↻ 3 ↩ 1 · 6d ago

MolmoAct 2 artifacts have been downloaded 400K+ times in under 1 month. Today we're opening up the full code & training data: everything you need to fine-tune or build on our fully open robotics foundation model. 🧵

View on Bluesky · ♥ 9 ↻ 1 ↩ 1 · 21d ago

Learn how @thinkaisquared.bsky.social & Domyn used Olmo, our family of fully open language models, to build their own models for regulated industries like finance, healthcare, & the public sector. 🧵

View on Bluesky · ♥ 2 ↻ 1 ↩ 1 · 7h ago

In Ai2's orbit

Center = Ai2. Left = members they follow (green edges). Right = members who follow them (blue edges). Top = mutual follows (orange edges, slightly larger). Drag any node to reposition; click to open that profile.