Jonathan Stray

Knowing things is a solved problem. Getting along is not. Working on AI, media, and inter-group conflict @CHAI_Berkeley. Got here from computational journalism.

Articles & links

Recent commentary

Humanity's ability to know, reason, judge, and act well is the foundation of science, democracy, crisis response, & management of AI itself. AI poses serious risks to that foundation. New paper on epistemic risks by 30 experts calls for attention and proposes solutions. Link in thread.

View on Bluesky · ♥ 34 ↻ 19 ↩ 3 · 9d ago

What could it mean for an AI to be "politically neutral”? And can we measure it? New paper + dataset. We propose a definition that applies to any type of conflict on any topic: a neutral response should maximize approval on both sides of an issue, while keeping that approval balanced. 1/🧵

View on Bluesky · ♥ 4 ↻ 1 ↩ 2 · 12d ago

Seeing a flurry of evals and startups promising to test the mental health effects of AI. Literally all of them test what the model says in various conditions... none of them measure actual outcomes on actual people. A big gap, fixable with privacy-preserving experiments.

View on Bluesky · ♥ 5 ↻ 0 ↩ 1 · 23d ago

AI safety typically assumes one well-meaning user. I'm working on the case where two of them are at war.

View on Bluesky · ♥ 2 ↻ 0 ↩ 0 · 4d ago

In Jonathan Stray's orbit

Center = Jonathan Stray. Left = members they follow (green edges). Right = members who follow them (blue edges). Top = mutual follows (orange edges, slightly larger). Drag any node to reposition; click to open that profile.