Utopia Tech
Engineering4 min read

Claude in Microsoft Foundry is now generally available

Claude in Microsoft Foundry is the production path enterprises have been asking for: true frontier model choice, Azure-native controls, simplified procurement, and faster time to value. Most enterprise AI projects do not stall because of model quality. They stall because of everything around the model: procurement, governance, networking, and data. Claude in Microsoft Foundry i

UT

Utopia Tech

June 30, 2026 · 4 min read

Share

Claude in Microsoft Foundry is the production path enterprises have been asking for: true frontier model choice, Azure-native controls, simplified procurement, and faster time to value. Most enterprise AI projects do not stall because of model quality. They stall because of everything around the model: procurement, governance, networking, and data.

Claude in Microsoft Foundry is now generally available, hosted on Azure, giving teams a faster path from agent experimentation to production. Enterprises can build with Claude through their existing Azure account , using the authentication, billing, networking, governance, and data controls their teams already trust. Instead of solving for infrastructure, teams can focus on building agentic applications that run their work with Claude, in the environment where they already operate.

This is a real step forward for customers building agentic applications and want to move from AI experimentation to production. Claude brings leading capabilities for coding, agentic workflows, and complex reasoning. Microsoft Foundry brings the enterprise harness to build, evaluate, deploy, and scale those agents on Azure.

Together, they give teams a trusted path to production AI with frontier model quality and the Azure controls they already trust. Today’s announcement builds on the strategic partnership Microsoft, NVIDIA, and Anthropic announced in November 2025 to expand enterprise access to Claude on NVIDIA accelerated computing. Claude runs on NVIDIA Blackwell Ultra systems, connected by InfiniBand networking, bringing the rack-scale AI infrastructure designed for inference performance and efficiency.

Build with Claude through your Azure account Developers can access Claude through the Messages API and use core capabilities including prompt caching, extended thinking, and tool streaming. For teams building agents, Foundry Agent Service uses Claude as the reasoning core to orchestrate multi-step planning, tool use, and task execution across enterprise systems.

Inference is processed in Azure, and customers can choose between Global and US data zones, for teams with data residency requirements. Anthropic operates the inference and is the data processor and SLA provider. Because Claude is available natively through Foundry, teams can work inside the Azure environment they already use.

They can authenticate with Microsoft Entra ID , apply Azure role-based access controls, manage access through existing governance policies, and track usage through familiar Azure management experiences. For high-sensitivity workloads, zero data retention is also available, so prompts and completions are not retained by Anthropic after the API call completes.

For commercial teams, it also simplifies how Claude is purchased and consumed. Claude usage is billed in Claude Consumption Units (CCU), a single, consolidated line on your Azure bill, with MACC drawdown and per-model detail in Foundry unchanged. For many enterprises, that matters as much as model capability.

The barrier to production isn’t only whether a model is powerful enough, it’s whether teams can procure it, govern it, secure it, and operate it at scale inside their existing cloud. With Claude in Foundry, they get frontier capabilities in an Azure environment that aligns with enterprise requirements for security, compliance posture, governance, and data residency.

Running Anthropic’s models on Azure has given us the sustained throughput and reliability our enterprise customers expect. The combination of frontier model quality and enterprise-grade infrastructure is what makes Bolt viable for the Fortune 500. —Gary Ballabio, Vice President, Partnerships, Bolt Customers are already building with Claude in Foundry Enterprises aren’t just running isolated pilots; they’re building production systems and agents that need throughput, reliability, governance, security, and scale.

At NVIDIA, we use autonomous AI agents every day to help our teams move faster and think bigger. Anthropic’s Claude models bring strong reasoning, coding and enterprise capabilities that are valuable for complex technical work. With Claude now available in Microsoft Foundry running on NVIDIA GB300 GPUs, more organizations can run advanced, specialized AI agents with the performance, scale and security needed for production.

—Justin Boitano, Vice President and GM of Enterprise Computing, NVIDIA Our customers describe their tests in plain English, and Momentic runs through the interface to verify everything works before a release ships. We found Claude’s Opus models especially suited to this, and running them on Microsoft Foundry we now serve millions of tokens per minute with the reliability our customers depend on.

—Jeff An, Co-Founder and CEO, Momentic Built for coding, agents, and complex reasoning Claude models are especially well-suited to some of the fastest-growing enterprise AI workloads. For software teams, Claude supports code generation, refactoring, debugging, test creation, and large-scale development workflows. For teams building agents, it powers multi-step reasoning, tool use, planning, and task execution.

For business teams, it supports document-heavy analysis, research synthesis, and complex decision support. In Microsoft Foundry, these capabilities connect to the broader Azure ecosystem . With Foundry Agent Service, teams orchestrate multi-step, goal-driven agents that use Claude as their reasoning core, planning, calling tools, and executing tasks across enterprise systems.

Features like model router enable customers to automatically route queries to the most appropriate Claude model, saving up to 50% while improving user satisfaction . All this governed and monitored by Foundry Control Plane which continuously runs evaluations to ensure agent responses match customer expectations, even blocking responses that violate rules before they reach users.

Originally published at azure.microsoft.com

Share
▸ Want a deeper look?

Talk to an architect about applying this to your stack.

60-minute technical evaluation, no obligation. We'll map the ideas in this article to your environment.

Skip to main content