Home GADGETS AI disruptor DeepSeek’s next-gen model delayed by Nvidia GPU export restrictions to...

AI disruptor DeepSeek’s next-gen model delayed by Nvidia GPU export restrictions to China — short supply of AI GPUs hinders development

AI disruptor DeepSeek’s next-gen model delayed by Nvidia GPU export restrictions to China — short supply of AI GPUs hinders development


AI disruptor DeepSeek’s next-gen model delayed by Nvidia GPU export restrictions to China — short supply of AI GPUs hinders development

DeepSeek attracted a lot of attention with its R1 AI model earlier this year, but it looks like development of the next-generation R2 model has stalled due to shortage of Nvidia’s H20 processors in China, reports The Information. DeepSeek itself has not commented on when its R2 model is set to be available.

DeepSeek used a cluster consisting of 50,000 Hopper GPUs — including 30,000 H20s, 10,000 H800s, and 10,000 H100s — obtained by its investor High-Flyer Capital Management — to train its R1 model. It is unclear whether R2 has already been fully pre-trained. The Information reports citing two individuals familiar with the project that DeepSeek team has been working intensively on the model, but CEO Liang Wenfeng is not yet satisfied with its capabilities. Work continues internally to improve performance before the model is cleared for deployment.

Source link