News Technology

OpenAI’s chatbot turns even more human-like, can now talk and see

The company behind the popular ChatGPT service has added new features that let users interact with its artificial intelligence bot using voice and images.

OpenAI, the San Francisco-based artificial intelligence powerhouse, has released a new version of its chatbot that can talk and see. The chatbot, called ChatGPT, can now interact with users using spoken words and respond to images uploaded by them.

The new version of ChatGPT, which was released on Monday, adds two new features that make it more human-like than ever before. First, it can now talk back to users using synthetic voices that reportedly sound more human than other digital assistants. Users can choose from five different voice options, including male and female voices. Second, it can now respond to images uploaded by users. For example, users can send a photo of the inside of their refrigerator, and ChatGPT can suggest dishes they can cook with the ingredients they have.

ChatGPT is powered by a large language model, or LLM, that has learned to generate natural language by analysing billions of words from the internet. With the addition of support for voice, ChatGPT may seem like it’s similar to voice assistants like Siri and Alexa. However, it’s actually different because it’s powered by LLM tech and can therefore handle a wide range of topics and tasks without being pre-programmed. It can write and (now) even read out emails, poetry, term papers, and jokes on the fly.

Use your voice to engage in a back-and-forth conversation with ChatGPT. Speak with it on the go, request a bedtime story, or settle a dinner table debate.

Sound on 🔊 pic.twitter.com/3tuWzX0wtS

— OpenAI (@OpenAI) September 25, 2023

OpenAI said that these features are designed to make ChatGPT more accessible and useful for everyone. It also argues that ChatGPT’s voices are more convincing than others used with popular digital assistants. The tool can be seen as a more natural way of interacting with its chatbot, especially for people who are not comfortable with typing or reading.

Users can get started with voice by heading to Settings > New features on the mobile app, then checking the option toopt into voice conversations.

Meanwhile, the image feature is also super handy. For instance, users can upload a photograph, chart, or diagram, and ChatGPT can provide a detailed description of the image and answer questions about its contents. This could be a useful tool for people who are visually impaired or want to learn more about something they see.

Also read | Here are 7 things that you can do with the iPhone 15 Pro’s brand-new USB-C port

While OpenAI had demonstrated the image tool back in the spring, it said it held off from a public release due to fears of misuse. The company feared that the product could turn into a face-recognition service to quickly identify people in photos, among other things.

The new version of ChatGPT is being made available to everyone who subscribes to ChatGPT Plus, a service that costs $20 a month, and Enterprise, over the next two weeks. However, the voice feature only works on iPhones, iPads, and Android devices. The image feature works on both web and mobile devices.

OpenAI has been releasing its AI tools at a rapid pace in recent weeks. Previously, it unveiled a new version of its DALL-E image generator, which it has integrated into ChatGPT so users can also ask the chatbot to create images for them.

Source:indianexpress.com

Leave a Reply

Your email address will not be published. Required fields are marked *