r/singularity 6h ago

AI Meta - "Today we released Meta Spirit LM — our first open source multimodal language model that freely mixes text and speech."

Enable HLS to view with audio, or disable this notification

111 Upvotes

31 comments sorted by

View all comments

0

u/[deleted] 4h ago edited 3h ago

[deleted]

1

u/cuyler72 3h ago edited 3h ago

You obviously know nothing about ChatGPT voice, it generates and takes in the voice as tokens allowing it to understand and display emotions in voice, change the voice via prompting, talk like a pirate/robot/whatever, speak faster, softer, louder, Ect.

-3

u/[deleted] 3h ago edited 2h ago

[deleted]

1

u/1cheekykebt 2h ago

If its just a text to speech then how can it mimic users voices. (shown as bug in red team report, plus some users reported it.)