Intel and SambaNova just built a three-chip AI machine that splits work between GPU, RDU and Xeon


  • GPUs handle prefetch operations by converting messages into key-value caches.
  • SambaNova RDUs generate tokens with high throughput and low latency
  • Intel Xeon 6 processors manage workload distribution and execute compiled code

Intel and SambaNova Systems have introduced a joint hardware model that combines GPUs, SambaNova RDUs, and Intel Xeon 6 processors for large-scale inference workloads.

The system allocates GPUs for prefetch operations, RDUs for decoding, and Xeon CPUs for execution and orchestration tasks in agent-controlled environments.



Leave a Comment

Your email address will not be published. Required fields are marked *