Alibaba is designing AI chips around agents, and that changes what the race is actually about

Alibaba has unveiled a new AI processor built specifically for AI agents, pairing the chip announcement with a multi-year silicon roadmap and a new large language model, signalling that the company is building an integrated AI stack, not just filling a gap left by US export controls.

The Zhenwu M890, developed by Alibaba’s semiconductor subsidiary T-Head, delivers three times the performance of its predecessor, the Zhenwu 810E, according to the company, as per Reuters report. But the performance jump is less notable than the architectural intent behind the chip: the M890 is purpose-built for AI agents, where software systems must retain long stretches of context, coordinate with other models in real time, and execute complex multi-step tasks with limited human intervention. 

Those demands, heavy on memory bandwidth and inter-model communication, are meaningfully different from what standard inference chips are optimised for. The difference matters because it tells you something about where Alibaba thinks AI compute is heading. The company isn’t designing around today’s dominant use case; it’s building for the workload profile it expects to define enterprise AI over the next several years.

Built for AI agents, not just inference

More significant than the chip itself is the roadmap Alibaba put alongside it. The M890 will be followed by the V900 in the third quarter of 2027, expected to deliver another roughly threefold performance gain, followed by the J900 in the third quarter of 2028. That’s a deliberate, sustained cadence of in-house silicon upgrades that mirrors the kind of tick-tock product cycles Nvidia has used to maintain its lead in AI accelerators.

The parallel to Huawei is worth noting. Huawei laid out a similar chip roadmap for its Ascend line last year, and both announcements reflect the same underlying reality: Chinese technology companies have concluded that depending on foreign silicon, even in scenarios where export restrictions might ease, is a structural risk they cannot accept. The response has been to treat semiconductor development as a long-term capability-building exercise rather than a procurement problem.

Alibaba’s commitment to that exercise is not shallow. The company pledged more than 380 billion yuan, roughly US$53 billion, on cloud and AI infrastructure over three years last year, its largest-ever investment commitment to the sector. The M890 and its successors are downstream of that spending.

Traction that predates the announcement

T-Head said it has shipped more than 560,000 Zhenwu units to date, with over 400 external customers across 20 industries deploying the chips, including automakers and financial services firms. That is a material production footprint, not lab hardware, and it provides Alibaba with real-world deployment data at scale ahead of the M890’s rollout.

The new chip will be available to Chinese enterprise customers through Alibaba Cloud’s domestic model platform, Bailian, packaged inside the Panjiu AL128, a server system that stacks 128 M890 accelerators into a single rack.

The software side of the stack

Alongside the hardware, Alibaba announced Qwen 3.7-Max, the latest version of its flagship large language model, described as engineered for advanced coding and long-running agent tasks. The company said the model can operate continuously for up to 35 hours without performance degradation, a capability specification that only makes sense if you are designing for extended autonomous operation.

The timing is deliberate. Releasing a chip and a model optimised for the same workload class on the same day is a platform play. Alibaba is building a closed loop: its own silicon in T-Head, its own model in Qwen, its own cloud delivery in Bailian. Each component reinforces the others, and the combined stack is designed to reduce enterprise customers’ dependence on any external vendor.

Half a million chips shipped. A successor arriving in 2027, another in 2028. T-Head is not hedging. At some point, building around US export controls stops being a workaround and starts being a strategy. Alibaba appears to have crossed that line.

(Image source: The White House)

See Also: Alibaba Qwen is challenging proprietary AI model economics

Want to learn more about AI and big data from industry leaders? Check out AI & Big Data Expo taking place in Amsterdam, California, and London. The comprehensive event is part of TechEx and co-located with other leading technology events. Click here for more information.

AI News is powered by TechForge Media. Explore other upcoming enterprise technology events and webinars here.

The post Alibaba is designing AI chips around agents, and that changes what the race is actually about appeared first on AI News.

📰 Original Source

This article was originally published on AI News. Click below to read the complete article.

Read Full Article on AI News →