22 votes

OpenAI insists it's not launching a search engine nor GPT-5 on Monday

9 comments

  1. [9]
    Jordan117
    (edited )
    Link
    Rumor has it they're unveiling some sort of conversational, pure-audio model (for ex, leaked code makes reference to some kind of phone call capability). The current ChatGPT app already has great...

    Rumor has it they're unveiling some sort of conversational, pure-audio model (for ex, leaked code makes reference to some kind of phone call capability). The current ChatGPT app already has great speech recognition thanks to their Whisper transcription model, and it responds with lifelike vocal synthesis, but the content gets transcribed to text in the middle both ways in order to get processed by GPT-4. An audio-only model would be able to process speech directly and respond in kind, applying the conversational interactivity of text language models to the kind of raw-waveform generation that powers music synthesizers like Jukebox and Udio. It might also imply the ability to recognize and respond to the emotional content of speech -- the speech generator from ElevenLabs, for example, is already spooky-good at being able to imbue spoken text with whatever emotional qualities the text itself conveys. If such a model were good enough and fast enough, it would seem eerily like talking to an actual person. This could be revolutionary for agent-like models -- imagine Google Duplex on steroids. And there's a reason a lot of leakers draw comparisons to Her.

    14 votes
    1. Amarok
      Link Parent
      The closest thing out there to solid information on this is from AI Explained. He covered several papers that dropped in the last two weeks that shed some light on recent developments. Also covers...

      The closest thing out there to solid information on this is from AI Explained. He covered several papers that dropped in the last two weeks that shed some light on recent developments. Also covers MedGemini which is the most badass application of AI I've yet seen.

      New OpenAI Model 'Imminent' and AI Stakes Get Raised (plus Med Gemini, GPT 2 Chatbot and Scale AI)

      8 votes
    2. [4]
      Fin
      Link Parent
      there's a huge market for Her type interactions. There are a few "girl / boy friend" chat bots. This is the beginning of the end of dating

      there's a huge market for Her type interactions. There are a few "girl / boy friend" chat bots. This is the beginning of the end of dating

      3 votes
      1. teaearlgraycold
        Link Parent
        I don't know about it being that extreme. Seems more like a way for some people to opt out.

        This is the beginning of the end of dating

        I don't know about it being that extreme. Seems more like a way for some people to opt out.

        9 votes
      2. [2]
        babypuncher
        Link Parent
        I feel like that market is made up almost entirely of incels. The rest of us will be fine.

        I feel like that market is made up almost entirely of incels. The rest of us will be fine.

        4 votes
        1. moocow1452
          Link Parent
          In the same vein though, there's probably something coming down the pipe for an AI therapist/confiding model that would be much more general audience. I'm not sure how one would qualify a chatbot...

          In the same vein though, there's probably something coming down the pipe for an AI therapist/confiding model that would be much more general audience. I'm not sure how one would qualify a chatbot for that level of social work but the market is there and it would seem to be where the wind is blowing in regards to capabilities vs demographic vs margin.

          2 votes
    3. skybrian
      Link Parent
      But why speculate when we will find out tomorrow? For the clicks, I guess.

      But why speculate when we will find out tomorrow? For the clicks, I guess.

      2 votes
    4. [2]
      RheingoldRiver
      Link Parent
      I am really impressed by chatgpt's speech recognition, do you know of any android keyboard app that does voice transcription powered by the same engine? ive been using gboard and it's honestly...

      I am really impressed by chatgpt's speech recognition, do you know of any android keyboard app that does voice transcription powered by the same engine? ive been using gboard and it's honestly atrocious in comparison

      2 votes
      1. blindmikey
        Link Parent
        These LLMs would absolutely shine in replacing mobile keyboard typo corrections, and voice to text.

        These LLMs would absolutely shine in replacing mobile keyboard typo corrections, and voice to text.

        2 votes