OpenAI: GPT Audio API Pricing
The gpt-audio model is OpenAI's first generally available audio model. The new snapshot features an upgraded decoder for more natural sounding voices and maintains better voice consistency. Audio is priced...
Input / 1M tokens
$2.50
Output / 1M tokens
$10.00
Cost per request (1K in / 500 out)
$0.0075
~$7.50 per 1M tokens mixed
Specifications
Context Window 128K tokens
Max Output Tokens 16,384
Vision Support No
Function Calling No
Streaming Yes
Modality audio
Release Date 2026-01-19
Price history will appear here after the first price change is detected.
More from OpenAI
Input / 1M tokens
$0.60
Output / 1M tokens
$2.40
Context: 128K Streaming
Multimodal
Input / 1M tokens
$5.00
Output / 1M tokens
$30.00
Context: 400K Vision Streaming
Input / 1M tokens
$0.50
Output / 1M tokens
$1.50
Context: 16K Streaming
Input / 1M tokens
$1.00
Output / 1M tokens
$2.00
Context: 4K Streaming
Input / 1M tokens
$3.00
Output / 1M tokens
$4.00
Context: 16K Streaming
Similar Models from Other Providers
Input / 1M tokens
$0.10
Output / 1M tokens
$0.30
Context: 32K Streaming