ChatGPT Expands with Voice and Image Abilities

ChatGPT Expands with Voice and Image Abilities

00:00
06:10

The artificial intelligence (AI) tool ChatGPT has added new abilities that include voice and image recognition.

人工智能 (AI) 工具 ChatGPT 增加了包括语音和图像识别在内的新功能。


The changes permit some users to directly ask ChatGPT questions and receive voice answers. In addition, the tool can recognize images and provide information about what is in them.

这些更改允许一些用户直接询问 ChatGPT 问题并接收语音答案。 此外,该工具还可以识别图像并提供有关图像内容的信息。


ChatGPT is a chatbot, a computer-powered tool designed to interact smoothly with humans and perform high-level writing. The technology is also known as “generative AI.”

ChatGPT 是一个聊天机器人,是一种计算机驱动的工具,旨在与人类顺利交互并执行高级写作。 该技术也被称为“生成人工智能”。


The creator of ChatGPT, OpenAI, announced the tool’s latest additions, or upgrades, this week. Currently, the new voice and image upgrades are only available to users of ChatGPT’s Plus and Enterprise services.

ChatGPT 的创建者 OpenAI 本周宣布了该工具的最新添加或升级。 目前,新的语音和图像升级仅适用于 ChatGPT Plus 和 Enterprise 服务的用户。


ChatGPT’s main service, called GPT-3.5, is free for all users. ChatGPT Plus costs $20 per month. The Enterprise service is designed for individual companies, with costs tied to the services used by the business.

ChatGPT 的主要服务称为 GPT-3.5,对所有用户免费。 ChatGPT Plus 的费用为每月 20 美元。 企业服务是为个体公司设计的,其成本与企业使用的服务挂钩。


OpenAI explained that ChatGPT Plus and Enterprise users would be able to use the voice and image additions over the next two weeks. The upgraded tools would be made available to other groups of users, including developers, “soon after.” The voice and image upgrades will also be added to devices using the iOS and Android systems in the near future.

OpenAI 解释说,ChatGPT Plus 和企业用户将能够在接下来的两周内使用语音和图像添加功能。 升级后的工具将“很快”提供给包括开发人员在内的其他用户群体。 语音和图像升级也将在不久的将来添加到使用iOS和Android系统的设备上。


The company said ChatGPT’s new voice control is designed to provide a natural way for users to communicate with the AI tool in a way that is similar to speaking with a human. But it noted the chatbot can do more than answer questions. It can also tell a story to children or provide detailed instructions for making or building something.

该公司表示,ChatGPT 的新语音控制旨在为用户提供一种与人工智能工具进行交流的自然方式,就像与人类交谈一样。 但它指出,聊天机器人可以做的不仅仅是回答问题。 它还可以给孩子们讲故事或提供制作或建造东西的详细说明。


Users can choose different voices they want the chatbot to use. The company said it worked closely with professional voice actors to make the interactions more realistic and personal.

用户可以选择希望聊天机器人使用的不同声音。 该公司表示,它与专业配音演员密切合作,使互动更加真实和个性化。


The voice interaction ability of the ChatGPT upgrade already exists in many voice assistance systems. These include Amazon’s Alexa, Alphabet’s Google Assistant, Apple’s Siri, and others. American software maker Microsoft added voice controls to its new ChatGPT-powered Bing search engine earlier this year.

ChatGPT升级的语音交互能力已经存在于很多语音辅助系统中。 其中包括亚马逊的 Alexa、Alphabet 的 Google Assistant、苹果的 Siri 等。 美国软件制造商微软今年早些时候在其新的由 ChatGPT 驱动的 Bing 搜索引擎中添加了语音控制功能。


Another notable change to the ChatGPT tool is image recognition. This permits users to upload a photo to the system and then get information about what is contained in the picture.

ChatGPT 工具的另一个显着变化是图像识别。 这允许用户将照片上传到系统,然后获取有关图片中包含的内容的信息。


For example, the company says a user could take a picture of what is currently available in their refrigerator. After entering the photo into ChatGPT, the tool could suggest dinner possibilities based on what the person has. The system could also provide step-by-step instructions for preparing the meal.

例如,该公司表示,用户可以拍摄冰箱中当前可用物品的照片。 将照片输入 ChatGPT 后,该工具可以根据该人的情况建议晚餐的可能性。 该系统还可以提供准备膳食的分步说明。


Another example given is a parent who might take a picture of a child’s math problem and then seek advice on how to explain to the child how to solve it. There is even a way for users to mark areas of the image – for example with a circle – to get more specific information or help with that element.

另一个例子是,父母可能会给孩子的数学问题拍张照片,然后寻求如何向孩子解释如何解决它的建议。 用户甚至可以通过一种方法来标记图像区域(例如用圆圈)以获得更具体的信息或有关该元素的帮助。


Along with its announcement, OpenAI issued another warning about how its ChatGPT tool can easily get things wrong. It noted that because the system is trained using massive amounts of publicly available information, it can return results that are false, outdated or discriminatory.

在宣布这一消息的同时,OpenAI 还发布了另一项警告,称其 ChatGPT 工具很容易出错。 它指出,由于该系统是使用大量公开信息进行训练的,因此它可能会返回错误、过时或歧视性的结果。


The company urged all its users to watch out for misinformation and to attempt to verify the information provided by chatbots.

该公司敦促所有用户警惕错误信息,并尝试验证聊天机器人提供的信息。


OpenAI announced its AI technology is also being used by the digital music service Spotify. ChatGPT is being used to power a system designed to permit Spotify podcasters to translate their shows into different languages. The translations are completed in the podcasters’ own voice in an effort to make them sound more natural, OpenAI said.

OpenAI 宣布其人工智能技术也被数字音乐服务 Spotify 使用。 ChatGPT 被用来为一个系统提供支持,该系统旨在允许 Spotify 播客将他们的节目翻译成不同的语言。 OpenAI 表示,翻译是用播客自己的声音完成的,以使它们听起来更自然。


Spotify says the first languages to be added in the coming weeks will be Spanish, French and German.

Spotify 表示,未来几周内首先添加的语言将是西班牙语、法语和德语。

以上内容来自专辑
用户评论

    还没有评论,快来发表第一个评论!