Who's Who of AI

Shauli Ravfogel

162 trust researcher @shauli.bsky.social · 771 followers

AI research NLP & language

Why they matter

Researcher with public evidence across AI research, NLP & language.

AI signals: 0
Sources: 0
Discussions: 0
Latest signal: —

View every signal from Shauli Ravfogel →

Faculty fellow at NYU CDS. Previously: PhD @ BIU NLP.

What they're sharing

Articles & links

↻ Shauli Ravfogel reposted

@hakwan.bsky.social

LLM introspection revisited. if we do the controls properly we may not have strong enough evidence just yet arxiv.org/html/2605.26...

Can LLMs Introspect? A Reality Check arxiv.org

AI Weekly's analysis →

NYU researchers argue current evidence is insufficient to establish that large language models display strong metacognitive monitoring of their internal states.
After random relabeling that removes semantic correlations, model performance on biofeedback tasks falls close to the majority-class baseline.
In a three-way steering test adding a 'gaslight' input-level condition, models fail to reliably distinguish input-level from activation-level interventions.

Read full analysis →

View on Bluesky →

12/ We just think the evidence so far doesn't quite support a strong interpretation of the introspection findings in previous work. And, as always, extraordinary claims require extraordinary evidence. Paper: arxiv.org/abs/2605.26242

Can LLMs Introspect? A Reality Check arxiv.org

View on Bluesky · ♥ 1 ↻ 0 ↩ 0 · 55d ago

Their own posts

Recent commentary

1/ Can LLMs introspect, i.e., reason about their internal states? Recent work claims LLMs notice when their "thoughts" get tampered with, and can report the content. We took a closer look and think it's too early to say that. Work led by Shashwat Singh, with @tallinzen.bsky.social and me. A thread 🧵

View on Bluesky · ♥ 5 ↻ 0 ↩ 1 · 55d ago

Their network

In Shauli Ravfogel's orbit

Center = Shauli Ravfogel. Left = members they follow (green edges). Right = members who follow them (blue edges). Top = mutual follows (orange edges, slightly larger). Drag any node to reposition; click to open that profile.