Li, who holds a PhD in Computer and Information Sciences, has taken a pivotal role leading speech AI research at the e-commerce giant

Li Xiangang, a leading Chinese scientist in speech recognition, has joined Alibaba Group Holding to spearhead its artificial intelligence (AI) voice team, boosting the tech giant’s capabilities in the burgeoning field.

Sources familiar with the matter said Li, who holds a PhD in Computer and Information Sciences from Peking University, had taken on a role leading speech AI research at the Hangzhou-based e-commerce giant. He fills a position previously held by Yan Zhijie, who left the company.

Alibaba owns the South China Morning Post.

The speech team, part of Alibaba’s Tongyi Lab, focuses on multimodal speech and language models. In July 2024, the lab open-sourced two foundational speech models, SenseVoice and CosyVoice. SenseVoice’s multilingual speech recognition notably outperformed OpenAI’s Whisper by 50 per cent in Chinese and Cantonese, according to Alibaba.