Who's Who of AI

David Bau

279 trust researcher @davidbau.bsky.social · 2,317 followers

AI research

Why they matter

Researcher with public evidence across AI research.

AI signals: 3
Sources: 3
Discussions: 0
Latest signal: 10d ago

View every signal from David Bau →

Interpretable Deep Networks. http://baulab.info/ @davidbau

What they're sharing

Articles & links

You can play the contest too, at mazesofmenace.ai. Just point your coding agents at the repo github.com/davidbau/te... Nobody has cracked it yet: the field is wide open.

GitHub - davidbau/teleport-contest: The Teleport Coding Challenge — port NetHack 5.0 from C to JavaScript with bit-exact parity. Fork to enter. github.com

View on Bluesky · ♥ 2 ↻ 0 ↩ 2 · 10d ago

But AI lie detection is hard and remains a central research challenge. Recent research suggests that simple probes can pick up on neural "tells" that reveal when it is lying, even when the output looks clean. anthropic.com/research/pr... arxiv.org/abs/2502.03407

Simple probes can catch sleeper agents anthropic.com

View on Bluesky · ♥ 0 ↻ 0 ↩ 1 · 53d ago

Detecting Strategic Deception Using Linear Probes arxiv.org

View on Bluesky · ♥ 0 ↻ 0 ↩ 1 · 53d ago

Detecting Strategic Deception Using Linear Probes arxiv.org

View on Bluesky · ♥ 0 ↻ 0 ↩ 1 · 53d ago

I recently spoke with Yascha Mounk about how researchers look inside AI to understand how it is thinking. Here is the podcast: writing.yaschamounk.com/p/david-bau-2

David Bau on How—and Whether—Artificial Intelligence Thinks writing.yaschamounk.com

View on Bluesky · ♥ 6 ↻ 2 ↩ 1 · 2 from the directory shared this · 44d ago

Also check out the previous interview I had with Yascha about AI, which more of primer, here: writing.yaschamounk.com/p/david-bau

David Bau on How Artificial Intelligence Works writing.yaschamounk.com

View on Bluesky · ♥ 1 ↻ 1 ↩ 0 · 2 from the directory shared this · 44d ago

You can play the contest too, at mazesofmenace.ai. Just point your coding agents at the repo github.com/davidbau/te... Nobody has cracked it yet: the field is wide open.

Mazes of Menace — The Teleport Coding Challenge mazesofmenace.ai

View on Bluesky · ♥ 2 ↻ 0 ↩ 2 · 10d ago

Is it possible to write 100,000 lines of code well, if you do not read it? Let's go Hunting Zombies! davidbau.com/archives/20... In this post I dive into the code of two AI agent contestants in the Teleport coding challenge to learn their secrets. Very fun. And also instructive.

davidbau.com Hunting Zombies davidbau.com

View on Bluesky · ♥ 7 ↻ 0 ↩ 2 · 10d ago

Their own posts

Recent commentary

"You're right to call me on that!" Can you catch an AI in the act of lying? Register below to enter our AI lie-detection contest. AI lies are a big problem. The frontier labs have all worked hard to fight AI deception. They all try to monitor their AIs for it.

View on Bluesky · ♥ 6 ↻ 2 ↩ 1 · 53d ago

Their network

In David Bau's orbit

Center = David Bau. Left = members they follow (green edges). Right = members who follow them (blue edges). Top = mutual follows (orange edges, slightly larger). Drag any node to reposition; click to open that profile.