JD technical leader: Large models will become smaller and even finer down to the scene
四夜父脚群
发表于 2024-7-31 19:01:30
1235
0
0
General big models rely on computing power to build, while enterprise big models rely on business to run out
On July 30th, at the JD Cloud Summit held in Shanghai, Cao Peng, Chairman of the Technical Committee of JD Group and President of JD Cloud Business Unit, expressed the above views. According to his understanding, for large models, data is nourishment and scenarios are training grounds.
Over the past year, there has been a sustained craze for big models, and the industry has experienced a 'thousand model war'. According to statistics from the China Academy of Information and Communications Technology, there are currently over 1000 basic large-scale models worldwide, with China accounting for 35% of the global total.
Although the performance of basic models is constantly improving, in the personal user end, large models have not yet achieved true super applications. Instead, in many enterprise scenarios, they have gradually been deployed based on applications.
At the summit, JD Cloud showcased the latest practices of JD Yanxi's big model landing industry and released eight products including JD Cloud Enterprise Big Model Service, Yanxi Intelligent Agent Platform, Intelligent Programming Assistant JoyCoder, and Yanxi Digital Person 3.0.
According to data provided by JD.com, as of now, JD's big model has been implemented in over a hundred scenarios, covering different industries such as healthcare, e-commerce live streaming, logistics, and finance. Many of JD's own delivery personnel, merchants, doctors, procurement and sales operations, and R&D personnel have received support from the big model application.
For example, the "Jingyi Qianxun" service that serves medical scenarios, according to the head of JD Health Intelligent Algorithm Department, currently has four different sized models internally. One is a small model of about 2b, which provides a single service in a narrow domain. The team envisions that it can even be used on mobile phones in the future; The second is a medium-sized model with 14b and 22B as the core, which completes some medical consulting and service support work; Finally, there is a large model centered around 80s that specializes in serving complex medical decision-making and reasoning abilities.
The above model supports private deployment, even integrated deployment, which is related to industry characteristics. "It is difficult for the medical industry to accept a completely cloud based model, and few hospitals can accept this breakthrough," said the person in charge.
According to its introduction, in actual hospital implementation scenarios, Beijing Medical Qianxun will pay more attention to independently completing patient services in compliance, including triage, pre consultation, registration, appointment, accompanying consultations during consultations, and post consultation health management.
On the first day of GPT's release, everyone thought about the natural conversational ability and so-called anthropomorphic ability of this generation. From this perspective, whether it can better become a doctor's assistant is more valuable than becoming a diagnostic tool for doctors, "the person in charge emphasized.
In the beauty scene, unlike pure live streaming in the past, JD.com is currently attempting to combine digital person makeup testing with digital person anchors internally; In terms of footwear and clothing scenes, there will be a scene where digital people live stream in the front and hosts change their outfits in the back. The live streaming style based on specific category attributes will be transferred to digital people.
When it comes to the development trend of large models, several technical leaders from JD.com have stated that large models will become smaller and smaller. Vertical large models are a relatively certain direction, and can even be further refined to scene large models. The inherent logic is that large models need to adapt to scenarios and industries, so they cannot be too large.
He Xiaodong, Dean of JD Exploration Research Institute and Head of JD Technology's Artificial Intelligence Business, believes that due to limitations in data and computing power, simply increasing the scale of the model may quickly reach the development ceiling, resulting in the economic benefits generated by the large model being insufficient to support its own costs, making it difficult to sustain.
The large-scale models are growing at a rate of 10 times per year, with parameters ranging from billions to trillions. However, commercialization is currently lagging behind and will eventually become a problem in the medium to long term. He also pointed out that the illusion rate of many models is still high, which cannot provide solid guarantees for future industrial applications.
According to He Xiaodong, JD.com starts from the initial strategy model in terms of model self evolution. Firstly, it constructs an initial preference dataset, and then uses a pre trained reward model to score each answer. Based on the high or low score, it constructs new preference data, which will greatly promote model iteration and updates.
In terms of model inference, the cost of big language model inference is currently skyrocketing. Therefore, JD.com has improved model construction efficiency through end-to-end, low bit, high-precision quantization technology, reducing model size and enhancing inference performance without affecting model output accuracy and parameter quantity. He Xiaodong said that his current technical solution has saved 70% of the model's video memory.
When it comes to the large-scale model of enterprise implementation, Cao Peng believes that there are three key points. Firstly, simplicity is crucial. The diversity and fragmentation of scenarios cannot sustain high development costs, and it is necessary to minimize the threshold for using large models in order to cover more applications. Next is openness, based on an open Agent ecosystem, large model ecosystem, and cloud native ecosystem, giving customers the right to choose. The third is security, providing data security and privacy protection, AIGC content compliance, corpus data security management, making enterprise big model services trustworthy and reliable.
CandyLake.com is an information publishing platform and only provides information storage space services.
Disclaimer: The views expressed in this article are those of the author only, this article does not represent the position of CandyLake.com, and does not constitute advice, please treat with caution.
Disclaimer: The views expressed in this article are those of the author only, this article does not represent the position of CandyLake.com, and does not constitute advice, please treat with caution.
You may like
- JD Seven Fresh responds to price war rumors: no one targeted, just offering low prices
- JD Seven Fresh reduces prices, Meituan Xiaoxiang follows the trend of instant retail and the smoke of gunpowder rises again
- JD Seven Fresh initiates an instant retail price war
- The Nasdaq Golden Dragon Index fell over 4%, Pinduoduo fell over 6%, and JD.com fell over 6%
- JD.com announces Double 11 results, with a year-on-year increase of over 20% in the number of shopping users
- The number of shopping users on JD.com 'Double 11' has increased by over 20% year-on-year
- JD 11.11: The number of shopping users increased by over 20% year-on-year
- Double Eleven data revealed: cumulative sales exceeded 1.4 trillion yuan, with JD 3C Digital accounting for 42.8%
- JD's revenue growth accelerates in the third quarter, with executives revealing plans to increase investment in clothing and beauty
- A sudden fire broke out in the logistics park! JD releases statement
-
11월 14일, 세계예선 아시아지역 제3단계 C조 제5라운드, 중국남자축구는 바레인남자축구와 원정경기를 가졌다.축구 국가대표팀은 바레인을 1-0으로 꺾고 예선 2연승을 거두었다. 특히 이번 경기 국내 유일한 중계 ...
- 我是来围观的逊
- 6 시간전
- Up
- Down
- Reply
- Favorite
-
계면신문기자 장우발 4분기의 영업수입이 하락한후 텐센트음악은 다시 성장으로 돌아왔다. 11월 12일, 텐센트음악은 최신 재보를 발표했다.2024년 9월 30일까지 이 회사의 3분기 총수입은 70억 2천만 위안으로 전년 ...
- 勇敢的树袋熊1
- 그저께 15:27
- Up
- Down
- Reply
- Favorite
-
본사소식 (기자 원전새): 11월 14일, 다다그룹 (나스닥코드: DADA) 은 2024년 3분기 실적보고를 발표했다. 수치가 보여준데 따르면 고품질발전전략에 지속적으로 전념하고 사용자체험을 끊임없이 최적화하며 공급을 ...
- 家养宠物繁殖
- 어제 15:21
- Up
- Down
- Reply
- Favorite
-
11월 12일 소식에 따르면 소식통에 따르면 아마존은 무료스트리밍서비스 Freevee를 페쇄하고 일부 종업원과 프로를 구독서비스 Prime Video로 이전할 계획이다. 올해 초 아마존이 내놓은 몇 편의 대형 드라마의 효 ...
- 度素告
- 그저께 13:58
- Up
- Down
- Reply
- Favorite