Audio & Speech Models

Speech-to-text, text-to-speech, and audio processing models.

19 Models
Compare Model Creator Context Length Input Price Output Price Image Generation Video Generation Open Weight
OpenAI Codex Security
OpenAI 1050000 tokens
text: $2.5
text: $15
- - No
Google Gemini 2.5 Flash Native Audio
Google 131072 tokens - - - - No
Google Gemini 2.5 Flash Preview TTS
Google 8192 tokens - - - - No
Google Gemini 2.5 Pro Preview TTS
Google 8192 tokens - - - - No
OpenAI GPT-4o Audio Preview
OpenAI 128000 tokens
audio: $40
text: $2.5
audio: $80
text: $10
- - No
OpenAI GPT-4o Mini Audio Preview
OpenAI 128000 tokens
audio: $10
text: $0.15
audio: $20
text: $0.6
- - No
OpenAI GPT-4o Mini Realtime Preview
OpenAI 16000 tokens
audio: $10
text: $0.6
audio: $20
text: $2.4
- - No
OpenAI GPT-4o Mini Transcribe
OpenAI 16000 tokens
audio: $3
text: $1.25
text: $5
- - No
OpenAI GPT-4o Mini TTS
OpenAI 2000 tokens
text: $0.6
audio: $12
- - No
OpenAI GPT-4o Realtime Preview
OpenAI 32000 tokens
audio: $40
text: $5
audio: $80
text: $20
- - No
OpenAI GPT-4o Transcribe
OpenAI 16000 tokens
audio: $6
text: $2.5
text: $10
- - No
OpenAI GPT-4o Transcribe Diarize
OpenAI 16000 tokens
audio: $6
text: $2.5
text: $10
- - No
OpenAI GPT-5.3 Instant
OpenAI 400000 tokens
text: $1.75
text: $14
- - No
OpenAI GPT-Audio
OpenAI 128000 tokens
audio: $32
text: $2.5
audio: $64
text: $10
- - No
OpenAI GPT-Audio 1.5
OpenAI 128000 tokens
audio: $32
text: $2.5
audio: $64
text: $10
- - No
OpenAI GPT-Audio Mini
OpenAI 128000 tokens
text: $0.6
text: $2.4
- - No
OpenAI GPT-Realtime 1.5
OpenAI 32000 tokens
audio: $32
image: $5
text: $4
audio: $64
text: $16
- - No
OpenAI TTS-1
OpenAI - - - - - No
OpenAI TTS-1 HD
OpenAI 100000 tokens
text: $30
audio: $30
- - No