Stephen Turner

Associate Professor and Research Dean at UVA School of Data Science, #Rstats enthusiast, dad, runner, guitar noise-maker. Views my own. Web: https://datascience.virginia.edu/people/stephen-turner Newsletter: https://blog.stephenturner.us

Articles & links

Claude Opus 4.8 - if this honesty thing is real that'd be nice www.anthropic.com/news/claude-...

Introducing Claude Opus 4.8 \ Anthropic anthropic.com
AI Weekly's analysis
  • Opus 4.8 matches Opus 4.7 pricing at $5/$25/M tokens; Effort Modes replace pricing tiers as the cost-quality dial.
  • Dynamic Workflows impose hard ceilings: 1,000 total subagents, 16 concurrent; workflow plans live in JavaScript variables outside Claude's context window.
  • SWE-bench Pro score jumps from 64.3% (Opus 4.7) to 69.2% (Opus 4.8); the model flags its own code flaws 4x more often than its predecessor.
Read full analysis →
View on Bluesky · ♥ 8 ↻ 0 ↩ 1 · 3 from the directory shared this · 1d ago

RefusalBench: Why Refusal Rate Misranks Frontier LLMs on Biological Research Prompts: arxiv.org/abs/2605.21545

arxiv.org
View on Bluesky · ♥ 3 ↻ 0 ↩ 0 · 5d ago

This one hits hard www.newyorker.com/news/fault-l...

newyorker.com
View on Bluesky · ♥ 17 ↻ 6 ↩ 1 · 2 from the directory shared this · 2d ago

Five Things (May 29, 2026): AI, writing, and thinking; the despair of the professor; ESMFold2 and a world model of proteins; Pope Leo on AI; NIH’s foreign co-author crackdown blog.stephenturner.us/p/five-thing... 🧬💻🧪

blog.stephenturner.us
View on Bluesky · ♥ 2 ↻ 0 ↩ 0 · 8h ago

What's happening this week in AI & life sciences: doi.org/10.59350/810... Jassi Pannu on AI and biosecurity, RAND/Helena AIxBio biosecurity mitigations, Blekhman on genomics AI, Nature’s AI scientists week, AI in peer review. 🧬💻🧪

doi.org
View on Bluesky · ♥ 1 ↻ 0 ↩ 0 · 6d ago

Recent commentary

I'd like to see the Claude/ChatGPT histories of all these folks booing commencement speakers off stage 🤔

View on Bluesky · ♥ 3 ↻ 0 ↩ 0 · 7d ago