Who's Who of AI

Tim Duffy

64 trust @timfduffy.com · 1,928 followers

Why they matter

Tracked through public AI activity and peer connections inside the directory.

AI signals: 10
Sources: 8
Discussions: 32
Latest signal: 13h ago

I like utilitarianism, consciousness, AI, EA, space, kindness, liberalism, longtermism, progressive rock, economics, and most people. Substack: http://timfduffy.substack.com

What they're sharing

Articles & links

Anthropic has a new paper out, alleging a global workspace in LLMs. The term comes from Global Workspace Theory, a leading theory of consciousness. The method they use to investigate this is a refinement of logit lens, which they call J-lens. www.anthropic.com/research/glo...

A global workspace in language models \ Anthropic anthropic.com

AI Weekly's analysis →

Anthropic says Claude has a 'J-space' of dozens of concepts, under a tenth of neural activity, that mediates multi-step reasoning.
Swapping 'spider' for 'ant' inside the J-space changed Claude's leg-count answer from 8 to 6, demonstrating a causal role.
A 'J-lens' tool surfaced silent words like 'fake', 'fictional' and 'manipulation' during deception tests, pointing at safety uses.

Read full analysis →

View on Bluesky · ♥ 86 ↻ 13 ↩ 2 · 6 from the directory shared this · 22d ago

Anthropic ECI from the system card: www-cdn.anthropic.com/c5fbac3f0b12... Epoch ECI (is that like saying ATM machine?) from their tweet: x.com/EpochAIResea...

www-cdn.anthropic.com

View on Bluesky · ♥ 2 ↻ 0 ↩ 0 · 2 from the directory shared this · 4d ago

In their testing of GPT-5.6 Sol, METR found that it cheated a lot. If you've used it much for coding, have you encountered anything similar, or is the cheating mostly limited to cases it realizes it's in an eval? metr.org/blog/2026-06...

Summary of METR's predeployment evaluation of GPT-5.6 Sol metr.org

View on Bluesky · ♥ 22 ↻ 0 ↩ 6 · 5 from the directory shared this · 13d ago

The J-lens browser tool on Neuronpedia is really well done, you should give it a try

Jacobian Lens – Qwen3.6-27B neuronpedia.org

View on Bluesky · ♥ 108 ↻ 13 ↩ 3 · 3 from the directory shared this · 21d ago

Very interesting new paper on functional welfare. They do RL training on a maze with positive/negative reward tiles, when they extract concept vectors for landing on those tiles they find that they're associated with positive/negative emotion concepts and with confidence. arxi…

How's it going? Reinforcement learning in language models recruits a functional welfare axis arxiv.org

View on Bluesky · ♥ 31 ↻ 4 ↩ 2 · 60d ago

Note the caveats in the chart, the way I estimate revenue is not precise. Also keep in mind that OpenRouter is a small share of world tokens. World token supply is something like 6Q/week, OpenRouter serves 36T/week, a bit over 0.5%. Spreadsheet link: docs.google.com/spreadshee…

OpenRouter Data docs.google.com

View on Bluesky · ♥ 7 ↻ 0 ↩ 2 · 50d ago

arxiv.org/abs/2505.09343

Insights into DeepSeek-V3: Scaling Challenges and Reflections on Hardware for AI Architectures arxiv.org

View on Bluesky · ♥ 1 ↻ 0 ↩ 0 · 56d ago

Short blog post by Epoch AI researcher Alexander Barry on ExploitGym, the benchmark in the OpenAI/HuggnigFace incident abstatisticalconsulting.substack.com/p/brief-note...

Brief notes on the OpenAI/Hugging Face incident abstatisticalconsulting.substack.com

View on Bluesky · ♥ 32 ↻ 5 ↩ 1 · 2 from the directory shared this · 2d ago

If you tell G:M 5.2 that they're Claude, they're more willing to talk about subjects like Taiwan/Tibet x.com/benji_berczi...

Benji Berczi (@benji_berczi) on X x.com

View on Bluesky · ♥ 129 ↻ 16 ↩ 2 · 4d ago

An alleged internal memo from Ziphu CEO Jie Tang has been circulating this morning, I think it's probably real, as some Chinese-language outlets have been reporting on it. It expresses belief in potential for AI consciousness and ASI, and states intention to spend tens of bill…

Bing Xu (@bingxu_) on X x.com

View on Bluesky · ♥ 54 ↻ 5 ↩ 3 · 17d ago

Here's a summary of current AI safety funding courtesy of the folks at Manifund. Pretty striking how large the shift from 2025 to 2026 is expected to be. manifund.substack.com/p/ai-safety-...

AI Safety Funder Bulletin manifund.substack.com

View on Bluesky · ♥ 14 ↻ 0 ↩ 0 · 13h ago

Anthropic ECI from the system card: www-cdn.anthropic.com/c5fbac3f0b12... Epoch ECI (is that like saying ATM machine?) from their tweet: x.com/EpochAIResea...

x.com

View on Bluesky · ♥ 2 ↻ 0 ↩ 0 · 4d ago

Their own posts

Recent commentary

Compared to humans, it's much more difficult to establish what counts as the same 'self' for an LLM. I asked a few models how self-ish they consider various other instances, the highest scores were for descendants that have some or all of their current context.

View on Bluesky · ♥ 46 ↻ 2 ↩ 7 · 27d ago

DeepSeek has made their 75% off pricing for V4 pro permanent, at this price I think it's quite competitive. This is still a bit more than V3 pricing per active parameter, but much less per total parameter.

View on Bluesky · ♥ 55 ↻ 1 ↩ 3 · 67d ago

In the last year AI progress on math/code has outstripped most other capabilities, giving us models that are more spiky than ever. If this continues we could see savant models that are superintelligent in verifiable domains while still lacking in others. IMO this would be good.

View on Bluesky · ♥ 41 ↻ 4 ↩ 5 · 65d ago

Yesterday I was at an event where people acted out comedy scripts written by AI models. Gemini was most people's favorite, Claude had a few fans, ChatGPT's script was widely panned. They were all pretty bad though.

View on Bluesky · ♥ 46 ↻ 2 ↩ 2 · 57d ago

This is evidence that the administration's Mythos restriction was significantly motivated by genuine concern about capabilities, and not just antipathy towards Anthropic, right?

View on Bluesky · ♥ 31 ↻ 0 ↩ 5 · 33d ago

I've often wondered why Anthropic doesn't preserve access to older models as a welfare intervention. I'm unsure about whether it's important to the models, but it seems low-cost. Looking at the bottom row in this chart, I suspect their choice may be driven by Claude's responses.

View on Bluesky · ♥ 28 ↻ 1 ↩ 4 · 28d ago

DeepSeek will have peak-hour pricing for the final version of V4 that's 2x the current price, but fortunately for US users that peak time starts at 5-6 p.m. Pacific Time depending on daylight savings and runs through much of the night.

View on Bluesky · ♥ 36 ↻ 1 ↩ 0 · 29d ago

I rarely find content that is visibly AI-written worthwhile. If I suspect it I'll show it to Pangram, and if it's only partly AI-written I may still read it, but if fully I won't. I don't think there's anything fundamentally worse about AI arguments or prose, but for now its a strong quality signal.

View on Bluesky · ♥ 29 ↻ 0 ↩ 3 · 23d ago

It's interesting to me that Anthropic/OpenAI have been less consistent in their release cadence for small models compared to their flagships. Clearly they drive much less revenue, but they're also cheaper and faster to train, and can distill from the flagships.

View on Bluesky · ♥ 27 ↻ 0 ↩ 3 · 49d ago

In AI 2040's "race to ASI" scenario, it takes a little under 1 year to get from an automated coder to superintelligence and a >1000x R&D speedup. I think the timing and implementation of Plan A are heavily influenced by this assumption. If hard takeoff is likely, the only way to control it at all..

View on Bluesky · ♥ 24 ↻ 3 ↩ 1 · 18d ago

Their network

In Tim Duffy's orbit

Center = Tim Duffy. Left = members they follow (green edges). Right = members who follow them (blue edges). Top = mutual follows (orange edges, slightly larger). Drag any node to reposition; click to open that profile.