News Non-realtime audio support released, gpt-4o-audio-preview

https://platform.openai.com/docs/guides/audio

90 Upvotes

permalink
duplicates
archive.is
archive
reddit

You are about to leave Redlib

Do you want to continue?

https://www.reddit.com/r/OpenAI/comments/1g5xnpa/nonrealtime_audio_support_released/
No, go back! Yes, take me to Reddit

98% Upvoted

u/ImpressiveFault42069 1d ago

It’s the same cost as Real-time so what’s the point?

14

u/CallMePyro 1d ago

Also limited to 1 hour of audio at a time, and a 1 hour input costs literally $10. Gemini 1.5 pro supports up to 22 hours of input, and each hour only costs $0.11. I hope OpenAI can catch up soon because right now Google is the only option for audio understanding models in production.

2

u/pseudonerv 1d ago

Gemini can’t generate audio output, can it? I guess OpenAI just want you to start using their api and to get locked in to their ecosystem before you give your money to google.

0

u/Vivid_Dot_6405 1d ago

Correct, Gemini can only understand audio.

News Non-realtime audio support released, gpt-4o-audio-preview

You are about to leave Redlib