Yet another tech startup wants to topple Nvidia with ‘orders of magnitude’ better energy efficiency; Sagence AI bets on analog computing in memory to deliver 666,000 tokens/s in Llama2-70B


  • Sagence brings in-memory analog computing to redefine AI inference
  • Ten times less power and 20 times less costs
  • It also offers integration with PyTorch and TensorFlow.

Sagence AI has introduced an advanced in-memory analog computing architecture designed to address power, cost and scalability issues in AI inference.

Using an analog approach, the architecture offers improvements in energy efficiency and cost-effectiveness, while delivering performance comparable to existing high-end CPU and GPU systems.

Leave a Comment

Your email address will not be published. Required fields are marked *