Sambanova arrives at 198 tokens per second in the deep depth and not completely annoying with only 16 RDU SN40L chips



  • Sambanova runs Deepseek-R1 to 198 tokens/sec using 16 custom chips
  • According to reports, the SN40L RDU chip is 3 times faster, 5 times more efficient than GPUs
  • Soon 5x speed impulse is promised, with 100x capacity at the end of the year in the cloud

The Chinese Deepseek monkey , while more profitable.

Sambanova Systems, a startup founded in 2017 by experts from Sun/Oracle and Stanford University, has now announced what states that it is the fastest deployment in the world of the Deepseek-R1 671b LLM to date.

Leave a Comment

Your email address will not be published. Required fields are marked *