‘A virtual DPU within a GPU’: Could smart hardware hack be behind the innovative efficiency of Deepseek?



  • A new approach called Dualpipe seems to be the key to Deekseek’s success
  • An expert describes it as a virtual DPU in the GPU that maximizes bandwidth efficiency
  • While Deepseek has used only the Nvidia GPUs, one wonders how I would go to AMD instinct

Deepseek AI Chatbot of China has surprised the technology industry, representing a credible alternative to the OpenAi chatpt to a fraction of the cost.

A recent article Deepseek V3 was trained in a group of 2,048 GPU NVIDIA H800: paralyzed versions of H100 (we can only imagine how much more powerful it would be executed in AMD Instinct accelerators!). As reported, it required 2.79 million hours of GPU for the previous, adjusted, adjusted in 14.8 billion tokens and cost, according to the calculations made by The next platform – Only $ 5.58 million.

Leave a Comment

Your email address will not be published. Required fields are marked *