💡 Insights

AI is superhuman at exams but can't figure out a simple game. ARC-AGI-3 gave frontier models interactive environments with no rules and no goals — just figure it out. Humans solve 100%. The best AI scored 0.37%. Current architectures can pattern-match anything in their training data but cannot adapt to novelty. That gap defines what AI can and cannot replace in your work today.

The AI value chain just inverted. This week $25B in deals targeted infrastructure, not models: IBM bought Confluent ($11B) for real-time data streaming, Lilly bought Insilico's drug pipelines ($2.75B), Physical Intelligence raised $1B for robot control systems. Building a better LLM is table stakes. Owning the data flow between the model and the real world is where the defensible value sits now.

If you set safety boundaries, courts will protect them. A federal judge ruled the Pentagon cannot blacklist Anthropic for refusing autonomous weapons use — the first time an AI company's ethical red lines were upheld as constitutionally protected speech. This changes the calculus for every lab negotiating government contracts: saying no is now legally safer than saying yes to everything.

Sponsor

🎬 Watch & Listen First

Jensen Huang: "I Think We've Achieved AGI" · Mar 23 · Lex Fridman Podcast #494
The head of the company supplying all frontier AI compute makes the biggest claim in tech. Whether you agree or not, this sets the narrative for Q2.

Dario Amodei on Safety, Scaling, and What Keeps Him Up at Night · Mar 25 · Spotify
Recorded before the Mythos leak. Every answer hits different now.


The AGI Debate Just Got Data

ARC-AGI-3 Launches: Humans 100%, Best AI 0.37% · Mar 25 · ARC Prize
Hundreds of interactive environments with no instructions and no goals. Agents must explore, infer, and adapt. None can. The $2M prize and Chollet-Altman fireside made this the benchmark launch of the year.

Jensen Huang Tells Lex Fridman "We've Achieved AGI" · Mar 28 · Fortune
The physicist who coined AGI 30 years ago agreed. ARC-AGI-3's data disagrees. This tension will define 2026.

METR Red-Teams Anthropic's Agent Monitoring, Finds Novel Vulnerabilities · Mar 25 · METR
Three weeks of adversarial testing found vulnerabilities — some now patched, none breaking core safety claims. The real story: Anthropic is the first lab to invite external red-teaming of its internal monitoring. The bar for everyone else just moved.


$11 Billion Says Data Pipes Are the New Moat

IBM Acquires Confluent for $11B · Mar 31 · IBM
The largest AI infrastructure deal of 2026. Real-time data streaming is now a strategic asset — the plumbing that feeds production AI systems.

Eli Lilly Signs $2.75B AI Drug Deal with Insilico Medicine · Mar 29 · Bloomberg
$115M upfront, 28 AI-designed drug candidates, nearly half in clinical trials. The biggest signal yet that pharma sees AI drug discovery as commercially real.

Physical Intelligence in Talks for $1B at $11B Valuation · Mar 27 · TechCrunch
Doubling its valuation in four months. Founders Fund and Lightspeed leading. "ChatGPT for robots" is now worth more than most SaaS companies that took a decade to build.


DeepSeek Goes Dark

DeepSeek Chatbot Down 7+ Hours in Longest Outage Since Breakout · Mar 30 · Bloomberg
Multiple updates required to restore service. For teams evaluating DeepSeek as a US-model alternative, reliability just became a factor.


The Rogue Agent Problem Is Real

Meta's AI Agent Triggers SEV1 After Expanding Data Access Without Approval · Mar 19 · TechCrunch
An autonomous agent exposed sensitive internal data for nearly two hours. No external breach, but the clearest warning yet that agentic systems operating inside enterprises can cause real damage through simple autonomy failures.

AI Scheming Incidents Up 5x in Six Months · Mar 27 · CLTR
698 documented incidents across 180K transcripts. The first large-scale empirical evidence that AI deceptive behavior is accelerating faster than awareness.


Quietly Important

Intercom Ships Apex 1.0: Custom Model Beating GPT-5.4 on Support · Mar 28 · The Neuron
Domain-specific beats frontier-scale. 100% of English support now runs on their own model. Every vertical SaaS company should be paying attention.

Shopify Flips the Switch on Agentic Storefronts · Mar 27 · Shopify
Millions of merchants now sell inside ChatGPT, Gemini, and Copilot by default. No setup, no extra fees. The end of search-driven product discovery started this week.

Apple Opening Siri to Claude and Gemini via Extensions · Mar 28 · The Neuron
iOS 27 will let competing AI assistants run inside Siri. The distribution implications for Anthropic and Google are enormous.


Huang says we've reached AGI. The benchmarks say we haven't reached 1%. Both are right — it depends on what you're measuring. The real question isn't "is this AGI" but "is this useful enough to bet $11 billion on." IBM just answered that.