I found this to be the most useful video on DeepSeek that I've seen so far. I wanted to understand what the real innovation of DeepSeek was.
Some takeaways:
- DeepSeek claims to have trained a Llama-sized model for only $5 million. <-- To me, this is still a suspect claim.
- DeepSeek uses a Mixture of Experts and Distillation approach. These are essentially strategies for making the models smaller.
- I'm not sure how much of an innovation this really is. both methods were already known
- DeepSeek has made some mathematical innovations that speed up some computations
- DeepSeek R1 is training Chain-of-Thought models using reinforcement learning.
- Advantage here is that the thought-chains are not needed as part of the training data, only questions and answers. DeepSeek optimizes its chains-of-thought simply to maximize the rewards / minimize the punishments from the reinforcement learning. The model is rewarded for getting the right answer, with a small reward for explaining its chain of thought. This makes training more accessible for people without huge data centers & huge training datasets.
- This seems to be one of the main innovations, because previously only OpenAI's o1 was working on chain-of-thought models. The claim is that DeepSeek makes training and inference of CoT models cheaper and more accessible.
I feel like I understand more why people are hyped about DeepSeek, but I'm not fully convinced I should believe in the hype. There are dozens of videos out there showing DeepSeek's supposed capabilities, but the incentives of YouTubers is to overhype stuff. They show all the cool things, but how many failed DeepSeek responses did they not show? How much of the training cost is not being properly reported?
In any case, the fact that DeepSeek released their code as open source is a good sign. It means that even if they're overhyped, people can quickly learn where the pros and cons are.
Is this bad for Nvidia? I don't think so. See Jevons Paradox.
Is this bad for OpenAI? Let's just say Sam Altman is probably wetting his pants
Footnotes