YY Ahn

Prof @ School of Data Science, University of Virginia. Formerly at IU. Networks, data science, and machine learning. https://yyahn.com 🚲🚲🚲

Articles & links

This paper looks cool: arxiv.org/abs/2605.23901 "We propose the Shannon Scaling Law, a unified theoretical framework that models LLM training as information transmission over a noisy channel ... The Shannon Scaling Law consistently outperforms classical scaling laws ..."

arxiv.org
View on Bluesky · ♥ 10 ↻ 1 ↩ 0 · 3d ago

Sutton's "Bitter Lesson" keeps showing up in new corners. Latest example on data filtering for pretraining: arxiv.org/abs/2605.19407 "... with enough compute, the best data filter is no data filter." -- Btw, Welch Labs has a nice video on the "bitter lesson": www.youtube.com/w…

arxiv.org
View on Bluesky · ♥ 4 ↻ 0 ↩ 0 · 5d ago