Viewing a single comment thread. View all comments

kamenpb t1_j7huq7e wrote

I think whoever releases their model as a smartphone app that includes an option for voice synthesis will win.
A model that's slightly more agentic than chatgpt (sparrow seems to be), with a solid voice (think elevenlabs), and available on your phone = winner.
Google seems like they're in a better position to ultimately provide that at scale.
But if their plan this year is to release a slightly better version of AI kitchen, then it's no contest.. people will just keep using ChatGPT.

22

HumanSeeing OP t1_j7hvf56 wrote

Absolutely agreed, that would be amazing and life changing to have such tech!

3

Talkat t1_j7j2pgq wrote

Agreed... However I'm surprised such a thing doesn't exist yet.

OpenAI has whisper which is great an voice to text (and open source)

The text can then be input to GTP3

And then the result spoken via elevenlabs.

This is just plugging some apis together in an app...

Why hasn't this been done

3

citizentim t1_j7je2lq wrote

like, literally, wait a month. The speed at which everything is moving-- I'm sure it's being worked out.

3

Talkat t1_j7mwqjd wrote

My point is more that for an experienced developer it would take them a day. For an inexperienced developer less than a week.

I guess my real question is how come I'm not doing this?!

2

Feebleminded10 t1_j7i8o9n wrote

Yeah i want a customized voice i want an African female voice with a British accent.

2

Talkat t1_j7j2jgs wrote

Better yet you can find your "ideal" voice on YouTube and then clone that specific voice

3

Redditing-Dutchman t1_j7klnz0 wrote

There is only one acceptable voice for an AI assistant and thats Mr Gutsy from Fallout 4 ;)

1