According to a report by The Information, OpenAI has cut inference costs for its AI models by more than half. The company applied the optimizations to ChatGPT, where the number of Nvidia GPUs needed dropped to just a few hundred at times.

We closely track efforts by Anthropic, Google and OpenAI to get access to more server chips to run their models. But we don’t talk enough about the work these companies are doing…

OpenAI has found a way to cut inference costs by half, easing the financial burden of running large language models as compute spending threatens billions

According to a report by The Information, OpenAI has cut inference costs for its AI models by more than half. The company applied the optimizations to ChatGPT, where the number of…

Laut einem Medienbericht haben OpenAI-Ingenieure Optimierungen gefunden, die die Kosten für den Betrieb bestehender KI-Modelle um mehr als 50 Prozent senken.