blogs.nvidia.com web signal June 29th 2026

Anthropic Lands Claude on Azure's NVIDIA GB300 Blackwell Ultra

4 sources tracking this story

anthropic microsoft nvidia chips ai infrastructure enterprise ai ai-infrastructure

TL;DR

The GB300 NVL72 rack — 72 Blackwell Ultra GPUs, 36 Grace CPUs, 37TB fast memory, 1,440 petaflops FP4 — is the first hardware scale at which Claude has been deployed.
Anthropic designates three model tiers for Foundry: Opus 4.1 for specialized reasoning, Sonnet 4.5 for coding and agents, Haiku 4.5 for cost-efficient scale.
Excel Agent Mode and Copilot Researcher run on Anthropic-managed infrastructure under Anthropic's own ToS, outside the standard Azure data-residency commitment enterprise compliance teams expect.

Editor's note

The GA converts the November 2025 three-way partnership into live production infrastructure, and coverage from all three principals reveals deliberate audience segmentation that exposes a compliance boundary enterprise buyers must navigate: M365 Copilot integrations (Excel Agent Mode, Copilot Researcher) run on Anthropic-managed infrastructure under Anthropic's own terms of service, separate from Azure's standard data-residency and MACC-billing frameworks. Anthropic's post is the only first-party source to name specific model-use pairings — Opus 4.1 for specialized reasoning, Sonnet 4.5 for coding and agents, Haiku 4.5 for cost-efficiency — giving buyers a cost-vs-capability map that neither the NVIDIA hardware post nor the Microsoft billing narrative provides. Third-party preview coverage adds two details absent from all first-party posts: multi-model chains pairing Claude with Microsoft's Phi-4 showed 30% accuracy gains in enterprise pilots, and Microsoft's roadmap includes an on-device Windows runtime called Olympus that would let organizations run Claude inference locally before handing off to the cloud.

For the last year the Anthropic stack has effectively meant 'Claude on AWS', and the NVIDIA stack has effectively meant 'everyone except Anthropic'. The post on NVIDIA's blog on Monday is the first real crack in both of those defaults. Claude is now generally available inside Microsoft Foundry on Azure, running on NVIDIA GB300 NVL72 systems with Quantum-X800 InfiniBand networking, and the reporting describes it as Anthropic's first deployment on NVIDIA hardware.

The build-on point is the November strategic partnership between Microsoft, NVIDIA and Anthropic, but the news this week is the actual GA. Microsoft is pairing the launch with a performance pitch: per the Azure blog post, Claude Sonnet on GB300 generates tokens about 40% faster than on H100 nodes and roughly 15% faster than on B200, which is the kind of latency budget that starts to matter once you chain sub-agents into a workflow.

Around the model itself, NVIDIA is shipping two pieces that read more like enterprise plumbing than product. Verified Agent Skills add provenance and security checks at the capability layer, and the Secure Agent Workspace Reference Design is a blueprint for running autonomous agents in a governed environment where identity, network access, credentials and runtime policy are controlled at the infrastructure level rather than left to the application. If you have been stuck on the question of how to let an agent actually do things without giving it the keys to the kingdom, that is the bit worth reading first.

The honest caveat is that the 40% number is Microsoft's, measured in Microsoft's environment, on the model Microsoft is selling. It is plausible, since GB300 is a real generational step over H100, but treat it as a vendor benchmark until customer runs catch up. The reporting also does not pin down which exact Claude point releases are GA today, how pricing compares to Bedrock or the first-party API, or how the throughput numbers were normalized for batch size and context length.

The forward-looking read is that Microsoft now has a credible 'Claude or OpenAI, your choice' story inside Foundry, NVIDIA picks up a flagship agentic showcase for GB300, and Anthropic gains a second hyperscaler channel with Azure-grade governance attached. None of these parties needed this deal to survive, which is what makes it a useful tell about where enterprise agent deployments are actually heading.

What others are reporting

Coverage cluster as of 24h after publish

Anthropic Read →

Anthropic owns the M365 productivity narrative and is the only principal to publish explicit model-tier guidance (Opus 4.1 / Sonnet 4.5 / Haiku 4.5) with recommended use cases for each.

Claude is available in Microsoft Foundry via serverless deployment, allowing developers to scale while Anthropic manages infrastructure.
Microsoft Azure Blog Read →

Microsoft frames the launch around enterprise procurement: MACC eligibility, serverless billing, Azure-native authentication, and a faster path from agent experimentation to production.

Claude in Microsoft Foundry is now generally available, hosted on Azure, giving teams a faster path from agent experimentation to production.
WindowsNews.AI Read →

Adds preview benchmark data (30% accuracy gains from Claude+Phi-4 chains) and surfaces the Olympus on-device Windows runtime roadmap — two concrete details absent from all first-party posts.

For the first time, organizations can access Claude directly through their Azure console, with the same security posture, identity management, and compliance frameworks they already rely on.

Originally reported by blogs.nvidia.com

Read the original article →

Original headline: Anthropic Claude Goes GA on Microsoft Foundry on Azure — First-Ever Deployment on NVIDIA GB300 Blackwell Ultra GPUs With Quantum-X800 InfiniBand