Who's Who of AI

Jonathan Stray

66 trust researcher @jonathanstray.bsky.social · 880 followers

Why they matter

Researcher with public evidence across AI research, Culture, work & education.

AI signals: 4
Sources: 4
Discussions: 21
Latest signal: 16h ago

Knowing things is a solved problem. Getting along is not. Working on AI, media, and inter-group conflict @CHAI_Berkeley. Got here from computational journalism.

What they're sharing

Articles & links

Hi! Ready your paper, very interesting. Thought about it, and I'm not sure I find the computational complexity proof convincing because it bites only for learning all possible distributions D -- human-like D is plausibly easier. I found this, makes argument more rigorously. ar…

Barriers to Complexity-Theoretic Proofs that "AGI" Using Machine Learning is Impossible arxiv.org

AI Weekly's analysis →

Guerzhoy argues van Rooij et al.'s 2024 proof that AGI via machine learning is intractable rests on an unjustified assumption about data distributions.
The same proof structure, applied consistently, would show ImageNet classification is intractable, yet that task demonstrably works.
Three barriers block any such proof: defining human-like behavior precisely, accounting for inductive bias, and specifying relevant data subsets.

Read full analysis →

View on Bluesky · ♥ 0 ↻ 0 ↩ 1 · 2 from the directory shared this · 34d ago

There's a long list of content-based signals we might want to rank on. We are limited by the classifiers we actually have. BUT we are working on LLM scoring with arbitrary prompts! The challenge is performance, but we have a cunning plan docs.google.com/document/d/1...

LLM-based scoring for GreenEarth docs.google.com

View on Bluesky · ♥ 3 ↻ 1 ↩ 2 · 11d ago

Interesting! But there's no human testing here. "Quality" was evaluated only using automated metrics (BLUE, COMET) and while classifiers could tell human vs LLM apart, that doesn't tell us which one is preferred by humans. Meanwhile, www.nature.com/articles/s41...

AI-generated poetry is indistinguishable from human-written poetry and is rated more favorably - Scientific Reports nature.com

View on Bluesky · ♥ 0 ↻ 0 ↩ 1 · 13d ago

Well, there are examples in the paper and the dataset is open source! github.com/humanCompati...

github.com

View on Bluesky · ♥ 0 ↻ 0 ↩ 1 · 32d ago

Our definition turns “neutral” into something empirically testable, generalizes to any conflict, and is grounded in political theory. And it really does find better answers that everyone can agree on. Preprint arxiv.org/abs/2605.28911 Dataset github.com/HumanCompati... /FIN

Political Neutrality as Balanced Approval: A Large-Scale Human Evaluation of AI Responses arxiv.org

View on Bluesky · ♥ 0 ↻ 0 ↩ 0 · 52d ago

This may be the first widely deployed AI conflict mediation system. It's used by Chinese citizens in Hangzhou to resolve disputes with businesses. We're going to be seeing a lot more of this. www.sohu.com/a/932941980_...

AI调解员：国产大模型赋能消费纠纷化解，效率提升25% | 法律AI应用 sohu.com

View on Bluesky · ♥ 8 ↻ 2 ↩ 0 · 2 from the directory shared this · 16h ago

You may remember we did a 10,000 person experiment testing AI-powered healthier feed algorithms. Well, now we’ve built it as a product for BlueSky/ATProto. We are currently recruiting pre-release users to test it out and give us feedback. Want to try it? survey.qualtrics.com/j…

Qualtrics Survey | Qualtrics Experience Management survey.qualtrics.com

View on Bluesky · ♥ 4 ↻ 2 ↩ 0 · 20d ago

Their own posts

Recent commentary

Humanity's ability to know, reason, judge, and act well is the foundation of science, democracy, crisis response, & management of AI itself. AI poses serious risks to that foundation. New paper on epistemic risks by 30 experts calls for attention and proposes solutions. Link in thread.

View on Bluesky · ♥ 34 ↻ 19 ↩ 3 · 49d ago

Fusion power is gonna go a lot like AI. Empty promises for decades, then suddenly here faster than anyone can adjust to.

View on Bluesky · ♥ 11 ↻ 0 ↩ 1 · 12d ago

What could it mean for an AI to be "politically neutral”? And can we measure it? New paper + dataset. We propose a definition that applies to any type of conflict on any topic: a neutral response should maximize approval on both sides of an issue, while keeping that approval balanced. 1/🧵

View on Bluesky · ♥ 4 ↻ 1 ↩ 2 · 52d ago

The AI models of today are the worst they will ever be. And yet, pretty much every "AI will never..." claim has now been shattered. I don't understand people who still bet against AI. How much more evidence do you need that these machines are going to be smarter than us in every way?

View on Bluesky · ♥ 4 ↻ 1 ↩ 1 · 6d ago

Seeing a flurry of evals and startups promising to test the mental health effects of AI. Literally all of them test what the model says in various conditions... none of them measure actual outcomes on actual people. A big gap, fixable with privacy-preserving experiments.

View on Bluesky · ♥ 5 ↻ 0 ↩ 1 · 63d ago

Is this a good logo for the GreenEarth feed? It's a healthier, user-controllable, open-source, LLM-powered, transparent feed we're building -- now in alpha testing. Try it? Tell us what you think! bsky.app/profile/did:...

View on Bluesky · ♥ 3 ↻ 1 ↩ 0 · 22d ago

I want to make sure AI doesn't incite human conflict. It's sometimes hard to explain what I do, but that's the core of it -- it won't happen automatically. And we're making progress! Both theoretically, and in field experiments that test how AI alters human relationships.

View on Bluesky · ♥ 2 ↻ 1 ↩ 0 · 27d ago

AI safety typically assumes one well-meaning user. I'm working on the case where two of them are at war.

View on Bluesky · ♥ 2 ↻ 0 ↩ 0 · 44d ago

Hello I am in Montreal for the week! Anyone interesting in AI safety I should meet here?

View on Bluesky · ♥ 1 ↻ 0 ↩ 0 · 28d ago

Their network

In Jonathan Stray's orbit

Center = Jonathan Stray. Left = members they follow (green edges). Right = members who follow them (blue edges). Top = mutual follows (orange edges, slightly larger). Drag any node to reposition; click to open that profile.