Nvidia calls China’s DeepSeek R1 model ‘an excellent AI advancement’
Nvidia called DeepSeek’s R1 model “an excellent AI advancement,” despite the Chinese startup’s emergence causing the chip maker’s stock price to plunge 17% on Monday.
儘管中國新創DeepSeekR1的出現造成晶片製造商的股價下滑17%,Nvidia稱DeepSeekR1模型是一個傑出的進化。
“DeepSeek is an excellent AI advancement and a perfect example of Test Time Scaling,” an Nvidia spokesperson told CNBC on Monday.
星期一Nvidia發言人告訴CNBC,DeepSeek是一個傑出的進化,同時是一個測試時間擴展的完美例證。
“DeepSeek’s work illustrates how new models can be created using that technique, leveraging widely-available models and compute that is fully export control compliant.”
DeepSeek 的成就顯示出能夠如何利用技術,採用廣泛可用的模型和完全符合出口管制的運算,來創建新的模型。
The comments come after DeepSeek last week released R1, which is an open-source reasoning model that reportedly outperformed the best models from U.S. companies such as OpenAI. R1′s self-reported training cost was less than $6 million, which is a fraction of the billions that Silicon Valley companies are spending to build their artificial-intelligence models. 評論自DeepSeekR1上星期發表而來,此係一款開放軟體為據之模型,據報導其表現堪比最佳有如OpenAI模型為優。R1自行報導說,其訓練成本少於600萬美元,動輒比這些來自矽谷公司花費數十億美元建立的AI少了許多。
Nvidia’s statement indicates that it sees DeepSeek’s breakthrough as creating more work for the American chip maker’s graphics processing units, or GPUs. Nvidia的聲明稱:其見識到DeepSeek突破創造比美國晶片製造商的圖形處理器或GPUs更多的工作。
“Inference requires significant numbers of NVIDIA GPUs and high-performance networking,” the spokesperson added. “We now have three scaling laws: pre-training and post-training, which continue, and new test-time scaling.”
Nvidia發言人補充說:介面需要多量的Nvidia GPUs以及高效的網路;我們現有3個縮放定律:前置訓練與後置訓練以及新的時間測試縮放。
Nvidia also said that the GPUs that DeepSeek used were fully export compliant. That counters Scale AI CEO Alexandr Wang’s comments on CNBC last week that he believed DeepSeek used Nvidia GPUs models which are banned in mainland China. DeepSeek says it used special versions of Nvidia’s GPUs intended for the Chinese market.
Analysts are now asking if multi-billion dollar capital investments from companies like Microsoft, Google and Meta for Nvidia-based AI infrastructure are being wasted when the same results can be achieved more cheaply.
Nvidia稱:DeepSeek使用的GPUs係受完全出口管制。Scale AI執行長 Alexandr Wang上星期在NCBC訪談中說:我堅信DeepSeek使用的是完全限制輸入中國大陸的Nvidia模型,而DeepSeek則說,其使用Nvidia專門給大陸市場用的特別版GPUs。分析家不禁要問:與其像Microsoft, Google 與 Meta等動輒花數億美元於Nvidia為基礎之AI,相較可得到相同結果但花費低廉DeepSeek,簡直是浪費錢。