"Leak, Cheat, Repeat: Data Contamination and Evaluation Malpractices in Closed-Source LLMs" arxiv.org/abs/2402.03927
YY Ahn
Articles & links
This paper looks cool: arxiv.org/abs/2605.23901 "We propose the Shannon Scaling Law, a unified theoretical framework that models LLM training as information transmission over a noisy channel ... The Shannon Scaling Law consistently outperforms classical scaling laws ..."
Sutton's "Bitter Lesson" keeps showing up in new corners. Latest example on data filtering for pretraining: arxiv.org/abs/2605.19407 "... with enough compute, the best data filter is no data filter." -- Btw, Welch Labs has a nice video on the "bitter lesson": www.youtube.com/w…
In YY Ahn's orbit
Center = YY Ahn. Left = members they follow (green edges). Right = members who follow them (blue edges). Top = mutual follows (orange edges, slightly larger). Drag any node to reposition; click to open that profile.