Scott McGrath

Biomedical Informatics PhD • CITRIS Health @UC Berkeley • FAMIA • Focusing on Informatics and AI in medicine • Linfield U. Grad • Missoula MT https://smcgrath.phd

Articles & links

Interesting perspective on one's student's experience of spending their entire college experience in the AI era. #AcademicSky

nytimes.com
AI Weekly's analysis
  • ChatGPT's 2022 debut triggered cascading honor code violations and policy reversals that persisted across all four years of one college cohort.
  • Faculty policy swung between banning and requiring AI tools within single academic years, leaving students unable to follow consistent rules.
  • A documented social rift formed within the Class of 2026 between students who adopted AI tools and those who refused throughout their degree.
Read full analysis →
View on Bluesky · ♥ 3 ↻ 0 ↩ 0 · 8 from the directory shared this · 11d ago

Claude Opus 4.8 is out! It adds a major push for precision, making it four times less likely than Opus 4.7 to let flaws in code pass unremarked. Early testers note it proactively flags uncertainties and shaky assumptions in data.

Introducing Claude Opus 4.8 \ Anthropic anthropic.com
AI Weekly's analysis
  • Opus 4.8 matches Opus 4.7 pricing at $5/$25/M tokens; Effort Modes replace pricing tiers as the cost-quality dial.
  • Dynamic Workflows impose hard ceilings: 1,000 total subagents, 16 concurrent; workflow plans live in JavaScript variables outside Claude's context window.
  • SWE-bench Pro score jumps from 64.3% (Opus 4.7) to 69.2% (Opus 4.8); the model flags its own code flaws 4x more often than its predecessor.
Read full analysis →
View on Bluesky · ♥ 7 ↻ 1 ↩ 0 · 3 from the directory shared this · 1d ago

Academic publishing is facing a major crisis with AI slop. Journal editors are being flooded with AI-generated submissions that are almost impossible to detect. It is getting harder it is for human reviewers to filter out the noise.

AI-generated research papers are overwhelming peer review | The Verge theverge.com
AI Weekly's analysis
  • AI-generated academic papers now regularly pass journal peer review, evading both human reviewers and automated AI-detection tools.
  • Scientists identify mandatory data-sharing and reproducibility checks as the only remaining procedural safeguards capable of catching AI-fabricated research.
  • The crisis extends beyond arXiv's hallucinated-citation problem to affect broad peer-reviewed publishing across multiple scientific disciplines.
Read full analysis →
View on Bluesky · ♥ 2 ↻ 1 ↩ 0 · 3 from the directory shared this · 14d ago
Scott McGrath reposted
Ethan Mollick @emollick.bsky.social

I wrote a new post on what we need to keep human and what to hand over to AI, with forays into experiments in education, consulting, and the the latest controversy over literary prizes. www.oneusefulthing.org/p/choosing-t...

oneusefulthing.org View on Bluesky →

Anthropic and OpenClaw have pushed the tech sector into the long-awaited age of AI agents. Programmers using Claude Code Opus 4.5 report a 90x surge in code output, though security reviews show these autonomous subagents remain highly unpredictable risk factors.

wired.com
View on Bluesky · ♥ 1 ↻ 2 ↩ 0 · 2 from the directory shared this · 3d ago

A massive gap exists between physician AI awareness and clinical adoption. A new study out shows that while 80% of 1,049 doctorss surveyed expect AI to improve care, only 28% have used it. #MedSky #MedAI

nature.com
View on Bluesky · ♥ 3 ↻ 0 ↩ 1 · 14d ago

Sneak peek at new Siri app reveals Apple’s plans to take on ChatGPT and more techcrunch.com/2026/05/28/s...

techcrunch.com
View on Bluesky · ♥ 1 ↻ 0 ↩ 1 · 1d ago

School leaders are lagging on giving teachers formal guidance for using AI tools in the classroom. According to Gallup data, around 8 in 10 K-12 educators say they have received zero institutional direction on how to apply the tech to their daily workflows. #EduSky

axios.com
View on Bluesky · ♥ 2 ↻ 0 ↩ 1 · 2d ago

The true threat of AI to the labor market lies in a difficult workplace transition rather than sudden mass unemployment. While over 40% of employees now use generative AI, early data reflects incremental productivity gains rather than widespread role replacement.

technologyreview.com
View on Bluesky · ♥ 3 ↻ 1 ↩ 0 · 3d ago

Recent commentary

An unanticipated danger of ambient AI: converting a the statement “female mail man” into a “Patient is a 26 year old biological male identifying as a female” #MedSky

View on Bluesky · ♥ 117 ↻ 45 ↩ 5 · 11d ago

Just finished recording my last lecture for an Introduction to AI for Clinical Students class that I’m teaching in two weeks. 30 lectures spread out over 4 weeks! Really interested in how it is received. #MedEd #MedSky

View on Bluesky · ♥ 9 ↻ 0 ↩ 2 · 3d ago

Editing times for pediatric admission notes plummeted from 48.5 to 10.8 minutes with ambient AI. Across 127k hospital notes, the tool slashed cognitive burden during initial ED & ward encounters, but it offered no time savings for heavily templated daily progress notes. #amplify2026 #medsky

View on Bluesky · ♥ 3 ↻ 1 ↩ 1 · 9d ago

Keynote talk from Dr. Lee, advancing healthspan with AI and Agentic AI. #amplify2026

View on Bluesky · ♥ 3 ↻ 0 ↩ 1 · 10d ago

Setting up for the Clinical Informatics Keynote: Advancing Healthspan with AI and Agentic AI: Transforming How We Care, Discover, and Share. Over 1118 people in attendance here in Denver! #amplify2026

View on Bluesky · ♥ 5 ↻ 0 ↩ 0 · 10d ago

Setting up in workshop #CI07 Building AI Agents for Healthcare: A Practical Introduction Using Microsoft Copilot Studio Workshop. Here are some nice visualizations of some medical AI agent use cases. #Amplify2026 #MedSky

View on Bluesky · ♥ 3 ↻ 0 ↩ 1 · 11d ago

Walking through how the speakers sort their risk categories for considering approval of Generative AI tools in a clinical setting. #Amplify2026 #Medsky

View on Bluesky · ♥ 2 ↻ 0 ↩ 1 · 11d ago

Standard IT evaluations fall apart for generative AI. There is a lack gold standards for subjective outputs like clinical notes, and background vendor updates cause model drift. Epic's AI discharge summary tool matches human quality but produces more errors. #Amplify2026 #MedSky

View on Bluesky · ♥ 2 ↻ 0 ↩ 1 · 11d ago

LLMs alone are blind to today's lab results and proprietary clinical protocols. RAG bridges this gap by chunking institutional data into searchable numeric vectors. It doesn't make the model smarter, but grounds it in specific, cited documents. #Amplify2026 #MedSky

View on Bluesky · ♥ 3 ↻ 0 ↩ 0 · 11d ago

Over 1,250 FDA-authorized AI medical devices are on the market, but only 9% have post-deployment surveillance plans. The recent NHLBI workshop shows patient use is outrunning clinical guidance. #amplfiy2026 #medsky

View on Bluesky · ♥ 2 ↻ 0 ↩ 0 · 9d ago