OpenAI releases new version of ChatGPT with new voice and image input features

OpenAI Releases New Version of ChatGPT, Adds Voice and Image Inputs

Recently.artificial intelligence (AI)research organization OpenAI Announcing a new version of the chatbot ChatGPT, and added picture and voice input features. This update will be available in the next two weeks to ChatGPT Plus subscribers are rolling out, and for others, will soon be able to use these new features as well.

First of all, the new voice input feature is similar to a voice assistant on a cell phone. Users simply press a button and speak their question, and ChatGPT converts it to text, generates the answer and then plays it back to the user in voice form.OpenAI indicated that this interaction is more natural and convenient. The quality of the answers will also be higher due to the technical advantages of the Large Language Model (LLM). At the same time, OpenAI A new text-to-speech model has also been developed that generates a human voice similar to the sample voice based on a few seconds of sample speech. Users can choose from five options for ChatGPT's voice, and there are more potential uses for the model. For example, OpenAI is working with Spotify to translate podcasts into other languages while preserving the voice of the podcast host. However, there are some risks associated with this model, such as the possibility of it being used maliciously to impersonate public figures or commit fraud. As a result, OpenAI emphasizes that the model will not be made widely available, but will be subject to strict controls and limitations.

Second, the new image input function is similar to the Google Internet company Lens. users can take pictures of things they are interested in and upload them to ChatGPT, which tries to recognize what they want to ask and give them an answer. Users can also use the app's drawing tools to help express their questions, or communicate with voice or text input.The advantage of ChatGPT is that it allows for multiple rounds of conversations, rather than a one-time search. If the user is not satisfied with the answer or wants more information, they can continue to ask ChatGPT questions to get a more accurate and comprehensive answer. However, there are some potential problems with image search. For example, when processing images of people, OpenAI limits ChatGPT's ability to analyze and directly evaluate people, both for accuracy and privacy. This means that it is currently not possible to know who a person is by uploading his/her photo.

Since launching ChatGPT in early 2022, OpenAI has been working to add more features and capabilities to its bots. This update is an attempt by the company to find a balance between keeping things safe and within reasonable boundaries. But as more and more people use voice control and image search, and as ChatGPT evolves into a truly multimodal and useful virtual assistant, maintaining that balance will become increasingly difficult.

This article comes from users or anonymous contributions, does not represent the position of Mass Intelligence; all content (including images, videos, etc.) in this article are copyrighted by the original author. Please refer to this site for the relevant issues involvedstatement denying or limiting responsibilityPlease contact the operator of this website for any infringement of rights (Contact Us) We will handle this as stated. Link to this article: https://dzzn.com/en/2023/1498.html