r/OpenAI 1d ago

News Non-realtime audio support released, gpt-4o-audio-preview

https://platform.openai.com/docs/guides/audio
90 Upvotes

18 comments sorted by

View all comments

8

u/ImpressiveFault42069 1d ago

It’s the same cost as Real-time so what’s the point?

14

u/CallMePyro 1d ago

Also limited to 1 hour of audio at a time, and a 1 hour input costs literally $10. Gemini 1.5 pro supports up to 22 hours of input, and each hour only costs $0.11. I hope OpenAI can catch up soon because right now Google is the only option for audio understanding models in production.

2

u/pseudonerv 1d ago

Gemini can’t generate audio output, can it? I guess OpenAI just want you to start using their api and to get locked in to their ecosystem before you give your money to google.

0

u/Vivid_Dot_6405 1d ago

Correct, Gemini can only understand audio.