Shauli Ravfogel

Faculty fellow at NYU CDS. Previously: PhD @ BIU NLP.

Articles & links

12/ We just think the evidence so far doesn't quite support a strong interpretation of the introspection findings in previous work. And, as always, extraordinary claims require extraordinary evidence. Paper: arxiv.org/abs/2605.26242

Can LLMs Introspect? A Reality Check arxiv.org
View on Bluesky · ♥ 1 ↻ 0 ↩ 0 · 10d ago

Recent commentary

1/ Can LLMs introspect, i.e., reason about their internal states? Recent work claims LLMs notice when their "thoughts" get tampered with, and can report the content. We took a closer look and think it's too early to say that. Work led by Shashwat Singh, with @tallinzen.bsky.social and me. A thread 🧵

View on Bluesky · ♥ 5 ↻ 0 ↩ 1 · 10d ago

In Shauli Ravfogel's orbit

Center = Shauli Ravfogel. Left = members they follow (green edges). Right = members who follow them (blue edges). Top = mutual follows (orange edges, slightly larger). Drag any node to reposition; click to open that profile.