Is OpenAI's first AI assistant product for the 'next major breakthrough' or will it be released in January next year to revolutionize human-computer interaction?
芊芊551
发表于 어제 11:06
119
0
0
According to media reports, OpenAI is preparing to launch a new AI assistant product codenamed "Operator" that can automatically perform various complex operations, including coding, booking travel, and automated e-commerce shopping. According to internal employee reports, OpenAI's leadership is expected to release the product in January 2025, initially as a research preview and development tool, with an open API interface for developers.
According to reports, OpenAI has been conducting several research projects related to intelligent agents. One of the sources stated that the closest thing to completion will be a universal tool for executing tasks in a web browser.
An AI agent is an intelligent entity that can perceive the environment, make decisions, and perform actions. It has the ability to gradually achieve given goals through independent thinking and calling tools. It can provide personalized applications for the C-end and cost reduction and efficiency improvement solutions for the B-end. For ordinary users, the core function of an AI assistant is to autonomously operate the phone and assist in completing complex reasoning tasks.
OpenAI CEO Altman has already revealed his intention to leave. A few weeks ago, he stated on the "Ask Me Anything" forum on Reddit, "We will have better and better models, but I believe the next major breakthrough will be AI assistants." At OpenAI's press conference before the company's annual development day last month, Kevin Weil, the company's Chief Product Officer, said, "I think 2025 will be the year when Agent systems finally enter the mainstream
From OpenAI's perspective, it is facing increasing pressure in the commercialization process, and the gradual improvement of ChatGPT may not be able to attract users to pay higher prices. Executives urgently need a breakthrough product to prove that the huge investment in AI development is worthwhile.
At present, OpenAI has open-source a multifunctional collaborative AI agent called Swarm, which can create multiple agents to work together more efficiently to complete tasks. Its GPT o1 model enhances its reasoning ability, making significant progress in solving complex problems and natural user interaction, and making it more suitable for AI agent scenarios.
AI assistants are regarded as the core foundation leading to AGI, and in the era where hardware manufacturers always refer to AI, AI assistants may become a breakthrough point for terminal intelligence. Yongxing Securities stated that AI agents may grasp the new entry point of mobile internet, and the traffic distribution pattern is expected to reshape the AI agent intelligent agent. Due to its strong interactivity and convenience, it may be able to break down the natural barriers between different apps on the same terminal.
According to incomplete analysis by the Science and Technology Innovation Board Daily, top domestic and foreign manufacturers are competing to launch AI assistant products——
Microsoft recently quietly opened sourced the AI tool OmniParser, which can help users create personalized agents to operate personal computers; On October 22nd, Microsoft announced the integration of 10 autonomous AI agents in Dynamics 365, supporting OpenAI's latest model o1, with self-learning capabilities and the ability to automatically execute complex cross platform business; In September, Microsoft launched a benchmark framework called Windows Agent Arena, which also falls under the category of AI assistant development.
According to The Information, Google plans to preview its large-scale action model "Project Jarvis" in December, which will help users perform tasks such as "collecting research, purchasing products, or booking flights".
On October 22nd, Anthropic iterated a new feature for the large model Claude - Computer Use, allowing AI to manipulate computers like humans. Claude 3.5 Sonnet is the first model to support computer control, capable of simulating human computer operations, including moving the cursor, clicking buttons, and inputting text.
Apple has chosen to integrate Siri with ChatGPT to achieve smarter human-computer interaction. Some netizens have also discovered that Apple has quietly released two implementation versions of Ferret UI (based on Gemma 2B and Llama 8B respectively), which is a technology released by Apple in May this year that allows AI to understand mobile phone screens.
Huawei has released a new research result that allows AI to operate mobile phones like humans. The relevant team has proposed a mobile phone control architecture: Lightweight Multi modal App Control (LiMAC).
Chinese unicorn enterprise Zhipu AI has launched the AI assistant tool AutoGLM, which does not require manual operation. Users can speak into their phones (give commands) and automatically open various apps on their phones to shop online, order takeout, book high-speed rail tickets, even send WeChat messages, grab red envelopes, comment on friend circles, organize notes and generate strategies and summarize papers.
CITIC Securities stated that terminal AI assistant technologies such as AutoGLM will bring a shorter path of interaction, and the ability to accept voice commands and automatically complete complex operations will bring great convenience to consumers. It is expected to become a highlight feature of AI terminals and attract consumers to upgrade and replace them.
Huatai Securities also stated that the implementation of AI assistants will bring multiple levels of industry opportunities, among which Agent+terminals are expected to drive the transformation of human-computer interaction. In addition to changes in terminal sales volume and price, it may have a more profound impact on the business model of terminal applications.
CandyLake.com is an information publishing platform and only provides information storage space services.
Disclaimer: The views expressed in this article are those of the author only, this article does not represent the position of CandyLake.com, and does not constitute advice, please treat with caution.
Disclaimer: The views expressed in this article are those of the author only, this article does not represent the position of CandyLake.com, and does not constitute advice, please treat with caution.
You may like
- Xiaopeng Motors: The first product in the MONA series named Xiaopeng M03
- Before the United States! What is the safety and efficacy of the world's first weekly insulin injection approved in China
- The world's first! French regulatory authorities sue NVIDIA
- Alibaba launches the first AI picture book tool in China to care for children with autism
- JD Group and Hailan Group have reached a strategic cooperation and will open the first JD Aolai offline store
- NIO's first fourth substitution power station in Beijing officially goes online
- Baidu's Liang Zhixiang: From Human Computer Interaction to Everyone Dialogue, AI Agents are Driving Change
- New breakthrough in AI chips! SK Hynix, the world's first to achieve mass production of 12 layer HBM3E products, saw its stock price surge by nearly 9%
- The first new force in car manufacturing! Ideal car's one millionth complete vehicle rolled off the production line
- Mingchuang Youpin Ye Guofu: Retail industry should not compromise on prices. Going abroad should first go to Southeast Asia and then to Europe and America
-
"영비릉: 2024회계연도 영업수입 동기대비 8% 감소"영비릉은 2024회계연도 재무제보를 발표했다.2024 회계연도 매출은 149억5500만 유로로 전년 동기 대비 8% 감소했습니다.이익은 31억 500만 유로입니다.이익률은 ...
- 勇敢的树袋熊1
- 3 일전
- Up
- Down
- Reply
- Favorite
-
계면신문기자 장우발 4분기의 영업수입이 하락한후 텐센트음악은 다시 성장으로 돌아왔다. 11월 12일, 텐센트음악은 최신 재보를 발표했다.2024년 9월 30일까지 이 회사의 3분기 총수입은 70억 2천만 위안으로 전년 ...
- 勇敢的树袋熊1
- 그저께 15:27
- Up
- Down
- Reply
- Favorite
-
본사소식 (기자 원전새): 11월 14일, 다다그룹 (나스닥코드: DADA) 은 2024년 3분기 실적보고를 발표했다. 수치가 보여준데 따르면 고품질발전전략에 지속적으로 전념하고 사용자체험을 끊임없이 최적화하며 공급을 ...
- 家养宠物繁殖
- 어제 15:21
- Up
- Down
- Reply
- Favorite
-
11월 12일 소식에 따르면 소식통에 따르면 아마존은 무료스트리밍서비스 Freevee를 페쇄하고 일부 종업원과 프로를 구독서비스 Prime Video로 이전할 계획이다. 올해 초 아마존이 내놓은 몇 편의 대형 드라마의 효 ...
- 度素告
- 그저께 13:58
- Up
- Down
- Reply
- Favorite