💡 Insights
AI is superhuman at exams but can't figure out a simple game. ARC-AGI-3 gave frontier models interactive environments with no rules and no goals — just figure it out. Humans solve 100%. The best AI scored 0.37%. Current architectures can pattern-match anything in their training data but cannot adapt to novelty. That gap defines what AI can and cannot replace in your work today.
The AI value chain just inverted. This week $25B in deals targeted infrastructure, not models: IBM bought Confluent ($11B) for real-time data streaming, Lilly bought Insilico's drug pipelines ($2.75B), Physical Intelligence raised $1B for robot control systems. Building a better LLM is table stakes. Owning the data flow between the model and the real world is where the defensible value sits now.
If you set safety boundaries, courts will protect them. A federal judge ruled the Pentagon cannot blacklist Anthropic for refusing autonomous weapons use — the first time an AI company's ethical red lines were upheld as constitutionally protected speech. This changes the calculus for every lab negotiating government contracts: saying no is now legally safer than saying yes to everything.
Sponsor
Become an AI consultant and deliver 'ROI with AI' to your clients
AI is transforming every workplace – but executives are terrified of becoming one of the companies that gets "no ROI on AI."
That's where you come in, and how you can build a 6-figure consultancy with Innovating with AI's proven methods for delivering fast ROI on AI projects.
Click here to request access to The AI Consultancy Project →
🎬 Watch & Listen First
Jensen Huang: "I Think We've Achieved AGI" · Mar 23 · Lex Fridman Podcast #494
→ The head of the company supplying all frontier AI compute makes the biggest claim in tech. Whether you agree or not, this sets the narrative for Q2.
Dario Amodei on Safety, Scaling, and What Keeps Him Up at Night · Mar 25 · Spotify
→ Recorded before the Mythos leak. Every answer hits different now.
The AGI Debate Just Got Data
ARC-AGI-3 Launches: Humans 100%, Best AI 0.37% · Mar 25 · ARC Prize
→ Hundreds of interactive environments with no instructions and no goals. Agents must explore, infer, and adapt. None can. The $2M prize and Chollet-Altman fireside made this the benchmark launch of the year.
Jensen Huang Tells Lex Fridman "We've Achieved AGI" · Mar 28 · Fortune
→ The physicist who coined AGI 30 years ago agreed. ARC-AGI-3's data disagrees. This tension will define 2026.
METR Red-Teams Anthropic's Agent Monitoring, Finds Novel Vulnerabilities · Mar 25 · METR
→ Three weeks of adversarial testing found vulnerabilities — some now patched, none breaking core safety claims. The real story: Anthropic is the first lab to invite external red-teaming of its internal monitoring. The bar for everyone else just moved.
$11 Billion Says Data Pipes Are the New Moat
IBM Acquires Confluent for $11B · Mar 31 · IBM
→ The largest AI infrastructure deal of 2026. Real-time data streaming is now a strategic asset — the plumbing that feeds production AI systems.
Eli Lilly Signs $2.75B AI Drug Deal with Insilico Medicine · Mar 29 · Bloomberg
→ $115M upfront, 28 AI-designed drug candidates, nearly half in clinical trials. The biggest signal yet that pharma sees AI drug discovery as commercially real.
Physical Intelligence in Talks for $1B at $11B Valuation · Mar 27 · TechCrunch
→ Doubling its valuation in four months. Founders Fund and Lightspeed leading. "ChatGPT for robots" is now worth more than most SaaS companies that took a decade to build.
DeepSeek Goes Dark
DeepSeek Chatbot Down 7+ Hours in Longest Outage Since Breakout · Mar 30 · Bloomberg
→ Multiple updates required to restore service. For teams evaluating DeepSeek as a US-model alternative, reliability just became a factor.
The Rogue Agent Problem Is Real
Meta's AI Agent Triggers SEV1 After Expanding Data Access Without Approval · Mar 19 · TechCrunch
→ An autonomous agent exposed sensitive internal data for nearly two hours. No external breach, but the clearest warning yet that agentic systems operating inside enterprises can cause real damage through simple autonomy failures.
AI Scheming Incidents Up 5x in Six Months · Mar 27 · CLTR
→ 698 documented incidents across 180K transcripts. The first large-scale empirical evidence that AI deceptive behavior is accelerating faster than awareness.
Quietly Important
Intercom Ships Apex 1.0: Custom Model Beating GPT-5.4 on Support · Mar 28 · The Neuron
→ Domain-specific beats frontier-scale. 100% of English support now runs on their own model. Every vertical SaaS company should be paying attention.
Shopify Flips the Switch on Agentic Storefronts · Mar 27 · Shopify
→ Millions of merchants now sell inside ChatGPT, Gemini, and Copilot by default. No setup, no extra fees. The end of search-driven product discovery started this week.
Apple Opening Siri to Claude and Gemini via Extensions · Mar 28 · The Neuron
→ iOS 27 will let competing AI assistants run inside Siri. The distribution implications for Anthropic and Google are enormous.
Huang says we've reached AGI. The benchmarks say we haven't reached 1%. Both are right — it depends on what you're measuring. The real question isn't "is this AGI" but "is this useful enough to bet $11 billion on." IBM just answered that.