22
votes
OpenAI insists it's not launching a search engine nor GPT-5 on Monday
Link information
This data is scraped automatically and may be incorrect.
- Title
- OpenAI downplays rumors of web search engine, GPT-5
- Published
- May 11 2024
- Word count
- 652 words
Rumor has it they're unveiling some sort of conversational, pure-audio model (for ex, leaked code makes reference to some kind of phone call capability). The current ChatGPT app already has great speech recognition thanks to their Whisper transcription model, and it responds with lifelike vocal synthesis, but the content gets transcribed to text in the middle both ways in order to get processed by GPT-4. An audio-only model would be able to process speech directly and respond in kind, applying the conversational interactivity of text language models to the kind of raw-waveform generation that powers music synthesizers like Jukebox and Udio. It might also imply the ability to recognize and respond to the emotional content of speech -- the speech generator from ElevenLabs, for example, is already spooky-good at being able to imbue spoken text with whatever emotional qualities the text itself conveys. If such a model were good enough and fast enough, it would seem eerily like talking to an actual person. This could be revolutionary for agent-like models -- imagine Google Duplex on steroids. And there's a reason a lot of leakers draw comparisons to Her.
The closest thing out there to solid information on this is from AI Explained. He covered several papers that dropped in the last two weeks that shed some light on recent developments. Also covers MedGemini which is the most badass application of AI I've yet seen.
New OpenAI Model 'Imminent' and AI Stakes Get Raised (plus Med Gemini, GPT 2 Chatbot and Scale AI)
there's a huge market for Her type interactions. There are a few "girl / boy friend" chat bots. This is the beginning of the end of dating
I don't know about it being that extreme. Seems more like a way for some people to opt out.
I feel like that market is made up almost entirely of incels. The rest of us will be fine.
In the same vein though, there's probably something coming down the pipe for an AI therapist/confiding model that would be much more general audience. I'm not sure how one would qualify a chatbot for that level of social work but the market is there and it would seem to be where the wind is blowing in regards to capabilities vs demographic vs margin.
But why speculate when we will find out tomorrow? For the clicks, I guess.
I am really impressed by chatgpt's speech recognition, do you know of any android keyboard app that does voice transcription powered by the same engine? ive been using gboard and it's honestly atrocious in comparison
These LLMs would absolutely shine in replacing mobile keyboard typo corrections, and voice to text.