Tongyi Qianwen's open-source 32 billion parameter model has achieved full open-source for 7 major language models
芊芊551
发表于 2024-4-7 17:04:49
202
0
0
Alibaba Cloud's Tongyi Qianwen open-source 32 billion parameter model Qwen1.5-32B can balance performance, efficiency, and memory usage to the greatest extent possible, providing enterprises and developers with a more cost-effective model choice. At present, Tongyi Qianwen has opened up 7 major language models, with a cumulative download volume exceeding 3 million in open source communities at home and abroad.
Tongyi Qianwen has previously opened up six large language models with parameters of 500 million, 1.8 billion, 4 billion, 7 billion, 14 billion, and 72 billion, all of which have been upgraded to version 1.5. Among them, several small-sized models can be easily deployed on the end side, while the 72 billion parameter model has industry-leading performance and has repeatedly appeared on models lists such as HuggingFace. This open-source 32 billion parameter model will achieve a more ideal balance between performance, efficiency, and memory usage. For example, compared to the 14B model, the 32B model has stronger capabilities in intelligent agent scenarios; Compared to 72B, 32B has lower inference costs. The Tongyi Qianwen team hopes that the 32B open source model can provide better solutions for downstream applications.
In terms of basic capabilities, the 32 billion parameter model of Tongyi Qianwen has performed well in multiple evaluations such as MMLU, GSM8K, HumanEval, BBH, etc. Its performance is close to that of Tongyi Qianwen's 72 billion parameter model, far exceeding other 30 billion parameter models.
In terms of the Chat model, the Qwen1.5-32B-Chat model scored over 8 points in the MT Bench evaluation, and the gap between it and Qwen1.5-72B-Chat is relatively small.
In terms of multilingual abilities, the Tongyi Qianwen team selected 12 languages including Arabic, Spanish, French, Japanese, Korean, etc., and conducted assessments in multiple fields such as exams, comprehension, mathematics, and translation. The multilingual ability of Qwen1.5-32B is only slightly inferior to the 72 billion parameter model of Tongyi Qianwen.
CandyLake.com is an information publishing platform and only provides information storage space services.
Disclaimer: The views expressed in this article are those of the author only, this article does not represent the position of CandyLake.com, and does not constitute advice, please treat with caution.
Disclaimer: The views expressed in this article are those of the author only, this article does not represent the position of CandyLake.com, and does not constitute advice, please treat with caution.
You may like
- Boeing announces 10% layoffs, first delivery of 777X model postponed to 2026
- Faraday Future plans to launch the first model of its second brand by the end of next year
- Will a third brand launch hybrid models overseas? NIO responds: Continuing the pure electric technology route
- He Xiaopeng: Xiaopeng's car end large model aims to achieve a 100 kilometer takeover once next year
- Faraday Future: Second brand FX plans to launch two models with a price not exceeding $50000
- Robin Lee: The average daily adjustment amount of Wenxin Model exceeded 1.5 billion, 30 times more than that of a year ago
- Will DeepMind's open-source biomolecule prediction model win the Nobel Prize and ignite a wave of AI pharmaceuticals?
- "AI new generation" big model manufacturer Qi "roll" agent, Robin Lee said that it will usher in an era of "making money by thinking"
- Robin Lee said that the illusion of the big model has basically eliminated the actual measurement of ERNIE Bot?
-
11월 14일, 세계예선 아시아지역 제3단계 C조 제5라운드, 중국남자축구는 바레인남자축구와 원정경기를 가졌다.축구 국가대표팀은 바레인을 1-0으로 꺾고 예선 2연승을 거두었다. 특히 이번 경기 국내 유일한 중계 ...
- 我是来围观的逊
- 1 분전
- Up
- Down
- Reply
- Favorite
-
"영비릉: 2024회계연도 영업수입 동기대비 8% 감소"영비릉은 2024회계연도 재무제보를 발표했다.2024 회계연도 매출은 149억5500만 유로로 전년 동기 대비 8% 감소했습니다.이익은 31억 500만 유로입니다.이익률은 ...
- 勇敢的树袋熊1
- 3 일전
- Up
- Down
- Reply
- Favorite
-
계면신문기자 장우발 4분기의 영업수입이 하락한후 텐센트음악은 다시 성장으로 돌아왔다. 11월 12일, 텐센트음악은 최신 재보를 발표했다.2024년 9월 30일까지 이 회사의 3분기 총수입은 70억 2천만 위안으로 전년 ...
- 勇敢的树袋熊1
- 그저께 15:27
- Up
- Down
- Reply
- Favorite
-
본사소식 (기자 원전새): 11월 14일, 다다그룹 (나스닥코드: DADA) 은 2024년 3분기 실적보고를 발표했다. 수치가 보여준데 따르면 고품질발전전략에 지속적으로 전념하고 사용자체험을 끊임없이 최적화하며 공급을 ...
- 家养宠物繁殖
- 어제 15:21
- Up
- Down
- Reply
- Favorite