This paper looks cool: arxiv.org/abs/2605.23901 "We propose the Shannon Scaling Law, a unified theoretical framework that models LLM training as information transmission over a noisy channel ... The Shannon Scaling Law consistently outperforms classical scaling laws ..."
Who's Who of AI
YY Ahn
Prof @ School of Data Science, University of Virginia. Formerly at IU. Networks, data science, and machine learning. https://yyahn.com
🚲🚲🚲
What they're sharing
arxiv.org
arxiv.org
Articles & links
Sutton's "Bitter Lesson" keeps showing up in new corners. Latest example on data filtering for pretraining: arxiv.org/abs/2605.19407 "... with enough compute, the best data filter is no data filter." -- Btw, Welch Labs has a nice video on the "bitter lesson": www.youtube.com/w…