If you're active online, particularly in tech spaces, you might have heard about a company called DeepSeek. If not, here’s a quick intro: DeepSeek is a Chinese artificial intelligence (AI) company founded in 2023 as a spinout from Zhejiang University.
DeepSeek's models have caught the attention of Western companies because they are claimed to have been developed at a fraction of the typical cost and use significantly less computing power. To give you a quick comparison with a pioneer like ChatGPT, take a look at this table:
Aspect | DeepSeek V3 | OpenAI GPT-4 |
---|---|---|
Training Cost | Approximately $5.57 million | Estimated around $100 million |
Computational Resources | Utilized about 2,000 NVIDIA H800 GPUs | Employed approximately 25,000 NVIDIA A100 GPUs |
Training Duration | Completed in less than two months | Took between 90 to 100 days |
You get the idea. Western markets are freaking out, especially since DeepSeek has risen to the top of the App Store charts in recent days. With so many people trying to access it, there are now confirmed reports of outages and performance problems, reminiscent of ChatGPT's early days. Remember when ChatGPT would regularly log people out, glitch out, and users had to report bugs to OpenAI on Discord?
This comes after Biden's last-minute rule (before leaving office) that will place further restrictions on AI chip exports to countries like China, a rule that Nvidia fiercely opposed, condemning Biden and praising Trump.
Speaking of Nvidia, Bloomberg reports that the company's stock market value has taken a historic plunge, with over $400 billion erased from its market cap. According to Bloomberg, this 13% drop is the biggest in US stock market history, surpassing Nvidia's own record from September, when the company lost almost $280 billion in value.
Investors are concerned that the rise of competitors like DeepSeek could change the way modern LLMs are trained and used. So, you might be wondering, how are OpenAI and its employees responding to all this? Sam Altman, in an X post last month, said:
While DeepSeek wasn't mentioned explicitly, the comment is generally interpreted as being aimed at models like DeepSeek, which are replicating ChatGPT at a cheaper cost.
Altman tweeted this a day after DeepSeek V3 launched (December 27, 2024), as the initial hype took off on X.
Image via Depositphotos.com
Hope you enjoyed this news post.
Thank you for appreciating my time and effort posting news every day for many years.
News posts... 2023: 5,800+ | 2024: 5,700+
RIP Matrix | Farewell my friend
Recommended Comments
Join the conversation
You can post now and register later. If you have an account, sign in now to post with your account.
Note: Your post will require moderator approval before it will be visible.