OpenAI Technology Live Episode 6: ChatGPT "Open Your Eyes and See the World" AI Companion/AI Education New Benchmark?
Katlyn30590
发表于 6 일전
1136
0
0
On the sixth day of the technology sharing day, OpenAI provided something closer to the "heart" - ChatGPT opens advanced voice mode: real-time video calls, screen sharing, and image uploading.
Why is it said to be closer to the 'heart'?
OpenAI CEO Altman previously revealed in an interview with Salesforce that his favorite AI movie is "Her" (the story of a man falling in love with his AI virtual assistant), and "the idea of a conversational language interface has incredible foresight." The Information reported that Altman hopes to eventually develop a virtual assistant that can respond quickly like the AI assistant in the movie.
The robot girlfriend in Her represents the ultimate form of embodied intelligence, which can interact with humans without barriers.
Previously, ChatGPT's DAN mode (short for Do anything now) allowed AI to converse with users in a more casual way, and its emphasis on "human touch" has been stunning. It not only enables low latency communication, but also imitates human tone and provides emotional value. This time, ChatGPT not only enables listening and speaking, but also unlocks visual abilities, allowing users to "open their eyes and see the world" through the camera.
In this live sharing session, CEO Sam Altman did not appear. Instead, four employees including Kevin Weil, OpenAI's Chief Product Officer, Jackie Shannon, OpenAI's Product Manager, Michelle Qin, and Rowan Zellers, members of OpenAI's multimodal technology team, introduced the updated features.
The real-time video call function in advanced voice mode is the most outstanding. After the OpenAI team members greeted ChatGPT video and got to know each other, someone asked: What is the name of the colleague with reindeer antlers? ChatGPT provided accurate answers using Santa Claus's limited voice, demonstrating their "memory" ability.
Next, the team demonstrated how ChatGPT can teach people how to operate a hand brewed coffee device. Just make a "video call" to ChatGPT, and it can teach you step by step based on the equipment in front of you. Throughout the entire demonstration, ChatGPT's voice was natural and friendly, adjusting its tone and even laughing like a human.
The screen sharing function allows ChatGPT to "see" your screen through screen sharing, which is also a real-time video understanding ability. Users only need to click on the advanced voice mode icon in the bottom right corner and select Share Screen from the drop-down menu to receive targeted assistance.
After successfully sharing with OpenAI team members, ChatGPT browsed their messages and requested guidance to reply. ChatGPT showed a "high emotional intelligence" side and suggested praising the other party's Christmas decorations.
It is reported that the advanced voice mode supports over 50 languages, 9 realistic output voice options, and each voice has its own unique tone and features. And the GPT-4o behind it can not only convert speech into text, but also understand and label other functions of audio, such as breathing and emotion.
ChatGPT, which supports over 50 languages, is able to understand real-world scenarios in real-time. This not only greatly enhances the experience of ChatGPT as an AI companion tool, but also demonstrates a more efficient and powerful AI education tool.
The above features will be launched in the ChatGPT mobile app from today onwards, and will be open to all team users as well as most Plus and Pro users in the next week.
CandyLake.com is an information publishing platform and only provides information storage space services.
Disclaimer: The views expressed in this article are those of the author only, this article does not represent the position of CandyLake.com, and does not constitute advice, please treat with caution.
Disclaimer: The views expressed in this article are those of the author only, this article does not represent the position of CandyLake.com, and does not constitute advice, please treat with caution.
You may like
- AI Agents: A New Track for Technology Companies to Compete in
- 45 billion education technology giant invites Shen Teng to sell equipment
- What do you think of Trump's return to power for American technology companies?
- Most of the "Seven sisters of Science and Technology" rose, and Nvidia's market value increased by 1.2 trillion yuan overnight! Trump Media Technology Falls Over 8%! Microsoft releases ultra convenient cloud PC
- Apple Pro Display XDR 2 may adopt the same quantum dot display technology as MacBook Pro
- Microchip Technology suspends application for chip bill related subsidies
- NIO Technologies increases capital to 18 billion yuan, with a growth rate of 200%
- NIO Technologies increases capital to 18 billion yuan, with a growth rate of 200%
- Hesai Technology's Q3 revenue increased by 21.1% year-on-year
- The tech industry is shaking! OpenAI whistleblower killed himself by explosion!
-
12월 18일, 중국국가약품감독관리국은 EliLillyandCompany (이하"예래"로 략칭함.) 의 알츠하이머병요법을 비준하고 기능달 & amp;reg;(도네 단항 주사액, 4주마다 정맥 주입) 성인이 알츠하이머병으로 인한 경도 ...
- 我是来围观的逊
- 어제 20:25
- Up
- Down
- Reply
- Favorite
-
"브로드컴의 시가총액이 조 달러를 돌파한 맞춤형 AI 칩이 엔비디아를'호칭'할 수 있을까?" 엔비디아, 인텔이 주로 컴퓨팅 칩을 생산하는 것과 달리 브로드컴은 주로 네트워크 연결에 사용되는 칩 제품을 생산한다. ...
- 我是来围观的逊
- 그저께 17:58
- Up
- Down
- Reply
- Favorite
-
신경보소식: 12월 18일, 례래 공식위챗공중번호는 그 알츠하이머병요법 도네단항주사액 (상품명기 능달 & amp; reg;,4주마다 정맥주입) 국가약품감독국의 비준을 받아 성인이 알츠하이머병으로 인한 경도인지기능장 ...
- oralpapapa
- 어제 17:07
- Up
- Down
- Reply
- Favorite
-
[소식에 따르면 머스크는 TSMC 회장 위철가와 만나 로봇이 테슬라의 미래의 중심이라고 밝혔다.] 보도에 따르면 테슬라 CEO 엘론 머스크는 지난주 미국에서 TSMC 회장 위철가를 만났다.위철가는 "전 세계에서 가장 ...
- 明绍宗朱聿键鼻
- 그저께 15:19
- Up
- Down
- Reply
- Favorite