Challenge OpenAI, Google's new move! Significantly updated generative AI, launching video model VEO 2 and the latest version Imagen3
wdx5566
发表于 그저께 09:32
3090
0
0
Google DeepMind, the flagship AI research laboratory of Google (GOOGL, stock price $196.66, market value $2407.3 billion), significantly upgraded its AI driven content generation tool on Monday, launching the Veo 2 video generation model and an enhanced version of the Imagen 3 image model, challenging OpenAI's leading position in AI image and video generation. Google stated that these updates are expected to completely change the creative workflow, providing video and image creators with higher realism and customized experiences.
According to Google, Veo 2 is a video generation tool that can generate high-quality videos with diverse themes and styles. Google stated in its blog that this model excels in realism, capturing details such as human expressions and movie effects. Its enhanced understanding of physics and film enables users to generate stunning content, including tracking shots and wide-angle compositions.
For example, Veo 2 is familiar with the language of movie shooting, and users can request a certain type of style, specify the lens, and suggest movie effects. Veo 2 will present videos at up to 4K resolution and extended to several minutes in length. It is worth noting that this resolution is 4 times that of the OpenAI Sora model, and the video duration is more than 6 times longer.
However, these advantages are still theoretical at present. In Google's experimental video creation tool VideoFX, videos generated by Veo 2 are limited to 720p resolution and 8 seconds in length. (In contrast, Sora's maximum output is 1080p, 20 second short films.)
Google stated that although video generation models often "hallucinate" unnecessary details such as extra fingers or unexpected objects, Veo 2 performs more realistically in this regard with a lower frequency of generation errors. In addition, the videos generated by Veo 2 include invisible SynthID watermarks to mark them as AI generated content, thereby reducing the risk of misuse or incorrect attribution.
DeepMind's Vice President of Product, Eli Collins, told the media that as the model gradually becomes ready for large-scale use, Google will provide Veo 2 through its Vertex AI developer platform.
Developers and creators can currently access the tool through Google Labs, and it is expected to be widely integrated into platforms such as YouTube Shorts by 2025. Meanwhile, the Imagen 3 model has been enhanced in terms of image composition and detail accuracy, supporting various styles from realistic to abstract, generating richer textures, and responding more faithfully to user prompts.
Currently, Imagen 3 has been launched in over 100 countries through Google Labs' ImageFX tool, allowing global users to experiment with its cutting-edge features.
In addition, Google has also launched Whisk, a creative tool that combines the visual analysis capabilities of Imagen 3 and Gemini. Users can input images, generate detailed text descriptions, remix styles, or design personalized works such as digital dolls or enamel badges.
Google introduced that Whisk combines the Imagen 3 model with Gemini's visual understanding and descriptive capabilities. The Gemini model will automatically generate detailed textual descriptions for the user's images and pass these descriptions to Imagen 3. This process allows users to remix themes, scenes, and styles in interesting new ways.
On December 10th Beijing time, Google announced the development of its new quantum chip Willow. This powerful chip has achieved a crucial breakthrough in the field of quantum computing over the past 30 years, completing tasks that today's computers take 10 years to complete in just 5 minutes. The research results were published in the journal Nature on December 9th.
After the news came out, the quantum information industry cheered and the AI circle was also greatly shocked.
Willow's major breakthroughs are reflected in two aspects: one is the significant increase in performance, that is, computing power. 5 minutes of computation is equivalent to a task that the fastest computer currently can complete in 10 years. 10& sup2; Years are much older than the age of the universe (about 13 billion years). 5 minutes and 10& sup2; In the year, this comparison shows that the leap in computing speed is very terrifying.
The second is the powerful quantum error correction capability. Willow's significant progress in the field of quantum error correction is that, based on a scalable square grid, the number of logical qubits (currently 105 qubits) increases while the error rate rapidly decreases. It expands from 3x3 encoded qubits to 5x5 grids, and then to 7x7 grids, with each expansion halving the error rate. Moreover, Willow can perform real-time error correction, making it possible to scale to higher order qubits (such as 1050) in a short period of time.
The above two major breakthroughs, compared to performance improvement, have attracted more attention from scientists in terms of error correction capability.
Quantum chips are the core of quantum computers. Willow's research and development team is the Google Quantum AI Laboratory led by Hartmut Neven. Hartmut stated that Willow is a big step towards large-scale, self correcting quantum computers, whose error correction capabilities and beyond classical computing power bring us closer to a system that can provide commercial applications, from helping discover new drugs, to designing more efficient electric vehicle batteries, to accelerating progress in nuclear fusion and new energy alternatives.
Daily Economic News Comprehensive Google, Public Information
Disclaimer: The content and data in this article are for reference only and do not constitute investment advice. Please verify before use. Based on this operation, the risk is borne by oneself.
CandyLake.com is an information publishing platform and only provides information storage space services.
Disclaimer: The views expressed in this article are those of the author only, this article does not represent the position of CandyLake.com, and does not constitute advice, please treat with caution.
Disclaimer: The views expressed in this article are those of the author only, this article does not represent the position of CandyLake.com, and does not constitute advice, please treat with caution.
You may like
- ERNIE Bot has more than 400 million users, Baidu Wu Tian: the big model is reshaping the industrial intelligence engine
- In October of this year, Tesla Model Y won the sales championship for first tier and new first tier city models
- Alibaba CEO Wu Yongming: AI development requires a batch of open-source models of different scales and fields
- Baidu's Q3 core net profit increased by 17%, exceeding expectations. Wenxin's large model daily usage reached 1.5 billion
- The delivery fee pricing has been lowered to 6 yuan, and McDonald's has adjusted the McDonald's delivery fee model
- Ideal Automobile implements a limited time zero interest policy for all models for the first time
- OpenAI launches full health version of the o1 big model and $200 per month ChatGPT Pro
- OpenAI has Rocket again! Officially launched Sora, an AI video generation model
- Google releases its most powerful model to attack OpenAI, shifting focus to AI agents
- Is it increasingly difficult to distinguish between truth and falsehood? Google launches new generation video generation model Veo 2
-
12월 18일, 중국국가약품감독관리국은 EliLillyandCompany (이하"예래"로 략칭함.) 의 알츠하이머병요법을 비준하고 기능달 & amp;reg;(도네 단항 주사액, 4주마다 정맥 주입) 성인이 알츠하이머병으로 인한 경도 ...
- 我是来围观的逊
- 어제 20:25
- Up
- Down
- Reply
- Favorite
-
"브로드컴의 시가총액이 조 달러를 돌파한 맞춤형 AI 칩이 엔비디아를'호칭'할 수 있을까?" 엔비디아, 인텔이 주로 컴퓨팅 칩을 생산하는 것과 달리 브로드컴은 주로 네트워크 연결에 사용되는 칩 제품을 생산한다. ...
- 我是来围观的逊
- 그저께 17:58
- Up
- Down
- Reply
- Favorite
-
신경보소식: 12월 18일, 례래 공식위챗공중번호는 그 알츠하이머병요법 도네단항주사액 (상품명기 능달 & amp; reg;,4주마다 정맥주입) 국가약품감독국의 비준을 받아 성인이 알츠하이머병으로 인한 경도인지기능장 ...
- oralpapapa
- 어제 17:07
- Up
- Down
- Reply
- Favorite
-
[소식에 따르면 머스크는 TSMC 회장 위철가와 만나 로봇이 테슬라의 미래의 중심이라고 밝혔다.] 보도에 따르면 테슬라 CEO 엘론 머스크는 지난주 미국에서 TSMC 회장 위철가를 만났다.위철가는 "전 세계에서 가장 ...
- 明绍宗朱聿键鼻
- 그저께 15:19
- Up
- Down
- Reply
- Favorite