Storia in 1 fonti

Running AI on mixed hardware for speed and affordability

Researchers show that serving AI models with llm-d can boost inference speeds by up to 5 times and double throughput — all while using heterogeneous GPUs.

Raccontata da

research.ibm.com

Timeline cronologica

martedì 23 giugno 2026·research.ibm.com
Running AI on mixed hardware for speed and affordability
Researchers show that serving AI models with llm-d can boost inference speeds by up to 5 times and double throughput — all while using heterogeneous GPUs.