Ethan Mollick

Professor at Wharton studying AI, work and education

Professor at Wharton, studying AI and its implications for education, entrepreneurship, and work. Author of Co-Intelligence. Book: https://a.co/d/bC2kSj1 Substack: https://www.oneusefulthing.org/ Web: https://mgmt.wharton.upenn.edu/profile/emollick

Articles & links

“Whimsey attacks” that seem absurd (“I cannot pay that much because of the Geneva Convention”) work against AI agents because guardrails are weak against out-of-distribution arguments. Smaller models fall often, but it even gives an edge against bigger ones. www.microsoft.com/…

Whimsical Strategies Break AI Agents: Generating Out-of-Distribution Adversarial Strategies at Scale - Microsoft Research microsoft.com
View on Bluesky · ♥ 82 ↻ 11 ↩ 6 · 2 from the directory shared this · 15d ago

🚨Our paper is out in PNAS: we found classic human persuasion techniques worked on AIs in a "parahuman" way, making them agree to objectionable requests (increasing compliance from 35% to 51%) It worked on a range of major recent LLMs though newer models do resist more www.pnas…

pnas.org
View on Bluesky · ♥ 49 ↻ 8 ↩ 0 · 2 from the directory shared this · 9d ago

Estimates of power usage here: arxiv.org/pdf/2509.20241 (these numbers also match independent assessments) Estimates of water usage here: eta-publications.lbl.gov/sites/defaul... (note it only includes direct cooling, not water for electricity generation)

arxiv.org
View on Bluesky · ♥ 30 ↻ 0 ↩ 2 · 8d ago

I wrote a new post on what we need to keep human and what to hand over to AI, with forays into experiments in education, consulting, and the the latest controversy over literary prizes. www.oneusefulthing.org/p/choosing-t...

oneusefulthing.org
View on Bluesky · ♥ 72 ↻ 15 ↩ 1 · 2 from the directory shared this · 3d ago

How lucky are you to have been born when and where you are? Had Opus 4.8 in Claude Code whip up a new visualization of all humans who ever lived. In addition to being neat to try, it is an interesting test for an AI that combines research, code, design and stats together veil-…

veil-of-history.netlify.app
View on Bluesky · ♥ 88 ↻ 10 ↩ 8 · 2 from the directory shared this · 15h ago

Recent commentary

The talk about AI & politics seems to be oddly missing a segment (a) assumes extremely capable AI is possible soon and (b) has a strong belief about how to use this technology to make human life better according to the political project they believe in. It is a moment of action right now.

View on Bluesky · ♥ 132 ↻ 15 ↩ 12 · 13d ago

BlueSky AI conversations have gotten less heated recently* * because much of this site has blocked me via automated lists so I have no contact with large parts of this social network, which isn’t necessarily a good thing, though it does make for nice echo chambers, which are pleasant, at least.

View on Bluesky · ♥ 179 ↻ 7 ↩ 15 · 11d ago

June 2024: The latest general-purpose LLMs could not count the r's in strawberry. July 2025: The latest general-purpose LLMs get gold in the International Math Olympiad. May 2026: The latest general-purpose LLM solve an 80 year old problem, one of the "best-known questions in combinatorial geometry"

View on Bluesky · ♥ 263 ↻ 45 ↩ 10 · 9d ago

Making humans responsible for their AI use seems like an incredibly reasonable way to address problems & opportunities in the use of AI for academic research, at least in the short term (autonomous scientific work will require different solutions).

View on Bluesky · ♥ 193 ↻ 22 ↩ 4 · 15d ago

As more people come to recognize the tells of AI, which mostly happens as you start to work with AI a lot, the scales are going to fall from their eyes and they are going to realize what some of us already see: how much of this site (and blog posts, articles, scientific papers) are AI now.

View on Bluesky · ♥ 160 ↻ 18 ↩ 13 · 4d ago

Most prophetic tweet of all time (2 months post ChatGPT release by a member of the AI technical staff). And you can safely repost it every day and it will still be prophetic for the future. This is the least the world will care about AI.

View on Bluesky · ♥ 151 ↻ 9 ↩ 3 · 11d ago

I am starting to have trouble paying attention to even interesting information if it is written in Claude or ChatGPT house style. I think some is the sameness of the rhythm rather than obvious words & tics: Claude is always so staccato. ChatGPT loves short sentences as kickers. Boring at scale.

View on Bluesky · ♥ 125 ↻ 8 ↩ 12 · 9d ago

The Second Scaling Law of AI remains undefeated. If you want better hacking (or math, or science, or crossword puzzle solving) out of an LLM, just let it use more tokens. There doesn't seem to be any plateau so far in the new study by the UK's governmental AI Security Institute.

View on Bluesky · ♥ 114 ↻ 13 ↩ 5 · 14d ago

One thing to watch for with Claude & GPT is that the models expose too much irrelevant history in their outputs. Slides are given footers saying things like "Better, more targeted version" if you asked for a better version, documents make references to how they are improved, etc.

View on Bluesky · ♥ 108 ↻ 5 ↩ 12 · 11d ago

Anton labs have hooked up a bunch of AI models to harnesses and had them working as DJs, programming and running a radio station, including taking callers and donations, which they use to buy more music. The results are both hilarious and a good reminder of how working with AI is deeply weird.

View on Bluesky · ♥ 86 ↻ 17 ↩ 4 · 14d ago