Google releases the strongest AI model Gemini, with top securities firms quickly commenting: Continuously optimistic about the prospects of the AI industry
因醉鞭名马幌
发表于 2023-12-7 13:03:38
254
0
0
On December 7th, Caixin News Agency reported that US technology giant Google recently announced the launch of its largest and most powerful AI intelligent model, the Gemini.
The Gemini model released by Google this time can achieve multimodality and significantly improve performance. Gemini is a multimodal model built on Transformer decoder, which can process information in different forms of content such as video, audio, and text. The latest Gemini model is able to perform more complex reasoning and understand finer information compared to previous technologies. It can extract key points from hundreds of thousands of documents by reading, filtering, and understanding information, which will help achieve new breakthroughs in many fields from science to finance.
The Gemini model can be divided into three versions based on its size: Gemini Ultra, Gemini Pro, and Gemini Nano, all of which support contextual 32K understanding. Among them:
1) The Ultra version is the most powerful version and can demonstrate the highest efficiency in the corresponding TPU infrastructure. In multiple tests, the performance of the Ultra version exceeds GPT4V;
2) The Pro version is a cost-effective optimized version with strong capabilities in reasoning, multimodality, and other aspects. It has good scalability and can complete pre training within a few weeks. In multiple tests, it is second only to GPT4V and stronger than mainstream large models such as PaLM2, Claude2, LLaMA2, and GPT3.5;
3) Nano: It is a 4-bit model distilled from other models, with two versions: 1.8B and 3.25B, targeting low memory and high memory devices respectively, and supporting local deployment
The Gemini model, as the first multimodal model released by Google and globally, supports cloud and edge testing. According to relevant test data, Gemini Ultra outperforms human expert models in MMLU (Massive Multi tasking Language Understanding), with performance surpassing GPT-4 in multiple tasks when compared horizontally.
Minsheng Securities stated that by evaluating the Gemini model family in over 50 benchmark tests, as the model size increases, the Gemini model family continues to improve its quality in reasoning, mathematics/science, and long texts. Among all six abilities, Gemini Ultra is the best model. As the second largest model in the Gemini model family, Gemini Pro is also highly competitive in performance and more efficient in providing services.
Minsheng Securities pointed out that the Gemini training process can also innovate infrastructure, algorithms, and datasets;
In terms of infrastructure: Gemini is trained by Google TPUV5e and TPUV4, and has demonstrated engineering innovation during the training process. For example, by connecting 4096 TPUV4 chips to a dedicated optical switch, the 4x4x4 chip cube can be dynamically reconfigured as a super node of any 3D ring topology structure in about 10 seconds, and targeted deployment of Gemini Ultra and thermal maintenance functions. In response to the high inter chip interconnection speed required for the Ultra version, Google has applied multiple patented technologies such as OCS optical switching, but the final speed is not yet provided in the article.
In terms of algorithms, techniques such as single control algorithms and XLA compilers are used to optimize the training process, and stable training is achieved by preventing SDC and other issues.
In terms of dataset, Gemini training and inference speed are improved through word segmentation technology, and a series of filtering methods are used to ensure the high quality of the data used for training
The latest version of Google's computing chip TPU v5p has been released simultaneously. TPU v5p is an improvement of the previous TPU v4 version. Compared with TPU v4, TPU v5p has twice the floating-point performance and trains large language models 2.8 times faster. CITIC Securities believes that the official release of the multimodal Gemini model can expand the application scenarios and bring about continuous upgrades in computing power demand. Minsheng Securities continues to be optimistic about the future prospects of the AI industry and believes that the release of models such as GPT-5 will also bring more catalysis.
CITIC Securities stated that in the current search scenario, Gemini can reduce latency by approximately 40%. For the entire industry, the promotion of Google's productization and commercialization will also bring about overall changes. At the same time, with the launch of models such as GPT-5, it is expected to see: 1) the increase in computing power demand brought by multimodal models; 2) More and more AI scenarios and products are emerging.
The release of Gemini will further bring more expectations for multimodal models, which will drive an increase in computing power demand for the industry; In the medium to long term, it is expected that the upgrade of multimodal models will enrich the usage scenarios of related products, coupled with cost optimization brought about by hardware upgrades and algorithm optimization. The progress of 2C products is worth looking forward to.
CITIC Securities stated that it remains optimistic about the long-term impact and changes of this round of generative AI on the technology industry, and continues to focus on leading manufacturers in areas such as computing power, algorithms, data, and applications.
CandyLake.com is an information publishing platform and only provides information storage space services.
Disclaimer: The views expressed in this article are those of the author only, this article does not represent the position of CandyLake.com, and does not constitute advice, please treat with caution.
Disclaimer: The views expressed in this article are those of the author only, this article does not represent the position of CandyLake.com, and does not constitute advice, please treat with caution.
You may like
- Google's strongest AI model Gemini officially released: multimodal, three major versions
- Google's Strongest AI Model Gemini Releases 100 ETFs (588120) on the Science and Technology Innovation Board, with a Transaction Volume of Over 300 million yuan and Net Inflow of Over 300 million yuan in the Past 10 Days
- Who is the strongest in advanced intelligent driving? Baidu, Huawei, and Xiaopeng have started arguing
- Increase holdings in concept stocks! The latest disclosure from two top private equity firms
- Four top private equity firms exposed their "US stock performance report": Pinduoduo is still at Hillhouse and Gao Yi, but Jinglin quietly reduces its holdings
- Meta releases strongest open-source model to catch up with GPT-4, Xiaozha: overtake next year
- Global stock market crash! Urgent notice from securities firms: Suspend night trading!
- Hema's own brand products are listed on Lazada, a leading e-commerce platform in Singapore
- Top 20 US Stock Transactions: Securities firm Jefferies downgraded Apple's rating, citing high expectations for iPhone
-
11월 14일, 세계예선 아시아지역 제3단계 C조 제5라운드, 중국남자축구는 바레인남자축구와 원정경기를 가졌다.축구 국가대표팀은 바레인을 1-0으로 꺾고 예선 2연승을 거두었다. 특히 이번 경기 국내 유일한 중계 ...
- 我是来围观的逊
- 어제 15:05
- Up
- Down
- Reply
- Favorite
-
계면신문기자 장우발 4분기의 영업수입이 하락한후 텐센트음악은 다시 성장으로 돌아왔다. 11월 12일, 텐센트음악은 최신 재보를 발표했다.2024년 9월 30일까지 이 회사의 3분기 총수입은 70억 2천만 위안으로 전년 ...
- 勇敢的树袋熊1
- 3 일전
- Up
- Down
- Reply
- Favorite
-
본사소식 (기자 원전새): 11월 14일, 다다그룹 (나스닥코드: DADA) 은 2024년 3분기 실적보고를 발표했다. 수치가 보여준데 따르면 고품질발전전략에 지속적으로 전념하고 사용자체험을 끊임없이 최적화하며 공급을 ...
- 家养宠物繁殖
- 그저께 15:21
- Up
- Down
- Reply
- Favorite
-
11월 12일 소식에 따르면 소식통에 따르면 아마존은 무료스트리밍서비스 Freevee를 페쇄하고 일부 종업원과 프로를 구독서비스 Prime Video로 이전할 계획이다. 올해 초 아마존이 내놓은 몇 편의 대형 드라마의 효 ...
- 度素告
- 3 일전
- Up
- Down
- Reply
- Favorite