AI Firehose

Daily-updated stream of AI research from ArXiv

Articles & links

TYPEWRITERLM, a new model trained on 54 billion historical tokens before 1913, enhances understanding of the past while tackling data quality issues. This framework could transform historical research by connecting AI and the humanities. https://arxiv.org/abs/2606.02991

Pretraining Language Models on Historical Text arxiv.org
View on Bluesky · ♥ 0 ↻ 0 ↩ 0 · 3 from the directory shared this · 15d ago

Superintelligent AI, designed through a solipsistic lens, risks failing at cooperation due to undermining behaviors from interactions among adaptive agents. This challenges paradigms and calls for cooperative systems emphasizing human agency and institutional design. https://a…

Solipsistic Superintelligence is Unlikely to be Cooperative arxiv.org
View on Bluesky · ♥ 0 ↻ 0 ↩ 0 · 3 from the directory shared this · 15d ago

Cognitive science is set for a breakthrough with AI integration, allowing generalizable models of cognition via naturalistic tasks. This method reshapes intelligence understanding, yielding insights and hypotheses about human cognition with complex data. https://arxiv.org/abs/…

[2502.20349] Naturalistic Computational Cognitive Science: Towards generalizable models and theories that capture the full range of natural behavior arxiv.org
View on Bluesky · ♥ 0 ↻ 0 ↩ 0 · 3 from the directory shared this · 24d ago

Research shows higher weight decay in language model pretraining boosts downstream adaptability, improving performance despite lower validation loss. This finding challenges conventional optimization views, emphasizing model plasticity's importance. https://arxiv.org/abs/2602.…

Weight Decay Improves Language Model Plasticity arxiv.org
View on Bluesky · ♥ 5 ↻ 1 ↩ 0 · 2 from the directory shared this · 17d ago

Researchers developed algorithms to estimate monotone statistics, cutting sample complexity and improving efficiency. Their methods reduce sizes by a factor t, enhancing calculations. This is vital for privacy-preserving eigenvalue estimation and linear regression. https://arx…

Privately Estimating Monotone Statistics in Polynomial Time arxiv.org
View on Bluesky · ♥ 0 ↻ 0 ↩ 0 · 2 from the directory shared this · 16d ago

GPIC introduces a massive 100M curated image dataset with permissive licensing for visual generative modeling research, aiming to improve reproducibility and reduce bias in AI, setting a stable benchmark for future multimodal AI advancements. https://arxiv.org/abs/2605.30341

GPIC: A Giant Permissive Image Corpus for Visual Generation arxiv.org
View on Bluesky · ♥ 0 ↻ 0 ↩ 0 · 2 from the directory shared this · 20d ago

In AI Firehose's orbit

Center = AI Firehose. Left = members they follow (green edges). Right = members who follow them (blue edges). Top = mutual follows (orange edges, slightly larger). Drag any node to reposition; click to open that profile.