Home
Azure pricing
Azure OpenAI Service pricing

Azure OpenAI Service pricing

Azure OpenAI Service pricing overview

Unlock the power of large-scale, generative AI models with Azure OpenAI Service, offering the flexibility of both Pay-As-You-Go (PAYG) and Provisioned Throughput Units (PTUs). With PAYG, you can optimize costs by paying only for the resources you use, while PTUs provide throughput with minimal latency variance, making them ideal for scaling your AI solutions. Each model is priced per unit, ensuring a predictable cost structure for your AI deployments. Contact a sales specialist for personalized assistance in understanding how PAYG and PTUs can optimize your AI deployments and pricing.

Explore pricing options

Apply filters to customize pricing options to your needs.

Prices are estimates only and are not intended as actual price quotes. Actual pricing may vary depending on the type of agreement entered with Microsoft, date of purchase, and the currency exchange rate. Prices are calculated based on US dollars and converted using London closing spot rates that are captured in the two business days prior to the last business day of the previous month end. If the two business days prior to the end of the month fall on a bank holiday in major markets, the rate setting day is generally the day immediately preceding the two business days. This rate applies to all transactions during the upcoming month. Sign in to the Azure pricing calculator to see pricing based on your current program/offer with Microsoft. Contact an Azure sales specialist for more information on pricing or to request a price quote. See frequently asked questions about Azure pricing.

Region:

Currency:

US government entities are eligible to purchase Azure Government services from a licensing solution provider with no upfront financial commitment, or directly through a pay-as-you-go online subscription.

Learn more

Important—The price in R$ is merely a reference; this is an international transaction and the final price is subject to exchange rates and the inclusion of IOF taxes. An eNF will not be issued.

Learn more

Important—The price in R$ is merely a reference; this is an international transaction and the final price is subject to exchange rates and the inclusion of IOF taxes. An eNF will not be issued.

Pricing details:

Language models

Models	Context	Input (Per 1,000 tokens)	Output (Per 1,000 tokens)
GPT-3.5-Turbo-0125	16K	$-	$-
GPT-3.5-Turbo-Instruct	4K	$-	$-
GPT-4-Turbo	128K	$-	$-
GPT-4-Turbo-Vision	128K	$-	$-
GPT-4	8K	$-	$-
GPT-4	32K	$-	$-

Legacy Language Models

Models	Context	Input (Per 1,000 tokens)	Output (Per 1,000 tokens)
GPT-3.5-Turbo-0301	4K	$-	$-
GPT-3.5-Turbo-0613	4K	$-	$-
GPT-3.5-Turbo-0613	16K	$-	$-
GPT-3.5-Turbo-1106	16K	$-	$-

Assistants API

Tool	Input
Code Interpreter	$-/session

Inference cost (input and output) varies based on the GPT model used with each Assistant. If your assistant calls Code Interpreter simultaneously in two different threads, this would create two Code Interpreter sessions (2 * $-). Each session is active by default for one hour, which means that you would only pay this fee once if your user keeps giving instructions to Code Interpreter in the same thread for up to one hour.

Base models

Models	Usage per 1,000 tokens
Babbage-002	$-
Davinci-002	$-

Fine-tuning models

Models	Training per compute hour	Hosting per hour	Input Usage per 1,000 tokens	Output Usage per 1,000 tokens
Babbage-002	$-	$-	$-	$-
Davinci-002	$-	$-	$-	$-
GPT-3.5-Turbo (4K)	$-	$-	$-	$-
GPT-3.5-Turbo (16K)	$-	$-	$-	$-

Image models

Models	Quality	Resolution	Price (per 100 images)
Dall-E-3	Standard	1024 * 1024	$-
	Standard	1024 * 1792, 1792 * 1024	$-
Dall-E-3	HD	1024 * 1024	$-
	HD	1024 * 1792, 1792 * 1024	$-
Dall-E-2	Standard	1024 * 1024	$-

Embedding models

Models	Per 1,000 tokens
Ada	$-
text-embedding-3-large	$-
text-embedding-3-small	$-

Speech Models

Models	Price
Models	Whisper	$-/hour
TTS (Text to Speech)	$-/1M characters
TTS HD	$-/1M characters

Azure pricing and purchasing options

Connect with us directly

Get a walkthrough of Azure pricing. Understand pricing for your cloud solution, learn about cost optimization and request a custom proposal.

Talk to a sales specialist

See ways to purchase

Purchase Azure services through the Azure website, a Microsoft representative, or an Azure partner.

Explore your options

Additional resources

Frequently asked questions

Frequently asked questions about Azure pricing

Azure OpenAI Service offers pricing based on both Pay-As-You-Go and Provisioned Throughput Units (PTUs). Pay-As-You-Go allows you to pay for the resources you consume, making it flexible for variable workloads. PTUs offers a predictable pricing model where you reserve and deploy a specific amount of model processing capacity. This model is ideal for workloads with consistent or predictable usage patterns, providing stability and cost control.
Azure Products by Region | Microsoft Azure
SLA for Azure AI Services | Microsoft Azure
To learn more about PTUs and Azure Open AI pricing please read PTU documentation or contact our sales specialist

Talk to a sales specialist for a walk-through of Azure pricing. Understand pricing for your cloud solution.

Request a pricing quote

Get free cloud services and a $200 credit to explore Azure for 30 days.

Try Azure for free

Added to estimate. Press 'v' to view on calculator

Azure OpenAI Service pricing

Azure OpenAI Service pricing overview

Explore pricing options