On the Huawei Join 2025 occasion in Shanghai final week, Chinese language expertise Huawei defined its plans for the subsequent generations of its Ascend chip line, and launched particulars on its deployment in pods, and world-leading compute clusters.
Talking on the Huawei Join convention, deputy chair of Huawei board, Eric Xu, stated 2025 had been a memorable 12 months, and claimed the debut of DeepSeek-R1 in January was a turning level for Huawei. He acknowledged what business commentators have beforehand confirmed; China might be prone to lag behind in semiconductor manufacturing processing, “for a comparatively very long time.”
Within the face of tariffs, politically-motivated rhetoric, and commerce embargoes, Huawei will stay forward by advancing infrastructure design and related applied sciences, producing highly effective clusters related by an open-sourced protocol. It’s additionally determined to open-source a number of giant items of beforehand proprietary software program, together with the openPangu basis AI fashions and its Thoughts SDKs.
The brand new Ascend chip vary
The corporate plans to produces three new collection of the Ascend chip, which it’s calling the 950, 960, and 970.
The Ascend 950PR and 950TO will present extra help for low-precision information codecs, together with FP8: the 950 mannequin will ship a PFLOP of efficiency, and at MXFP8, ship two PFLOPs. (A PFLOP is one thousand trillion floating level calculations per second.)
The corporate says the 950 will supply higher vector processing, and be able to granular reminiscence entry, processing 128 byte chunks, down from 512 bytes.
The Ascend 950 chips will supply 2 TB/s interconnect bandwidth, two-and-a-half greater than the present Ascend 910C, which went into manufacturing for Huawei prospects in Might this 12 months. The 950PR might be obtainable within the first quarter of 2026, with the Ascend 950DT following This fall.
A 12 months later in This fall 2027, the Ascend 960 will supply twice the computing energy, reminiscence entry bandwidth, reminiscence capability, and variety of interconnect ports as its stablemate, the 950. It would help Huawei’s proprietary HiF4 information format, which brings higher precision than different FP4 applied sciences, the corporate says.
The top of 2028 will see the emergence of Huawei’s most succesful chip, the Ascend 970. Xu informed the convention, “We’re nonetheless engaged on a few of its specs, however our common aim is to push all of its specs a lot increased.” He stated the Ascend 970 collection will have the ability to talk through interconnect bandwidth of 4TB/s, burn by 8 PFLOPs of FP4, and can include an excellent bigger reminiscence capability than earlier iterations of the Ascend vary.

(Eric Xu, Huawei
SuperPods of NPUs
Fairly than concentrating on producing chips which might be extra highly effective than its primary competitor, Nvidia, Huawei’s technique is to supply hyperscalers clusters of ready-made compute within the type of SuperPoDs. These will start to look This fall 2026 within the type of the Atlas 950 SuperPoD loaded with Ascend 950DT chips.
Competitor NVIDIA’s NVL144 system (the closest SuperPod competitor) is about to launch mid- to late-2026. Huawei claims that its first SuperPoD will ship almost seven instances its processing energy, and have 56.8 instances extra NPUs than the variety of GPUs within the NVL144. Nvidia’s next-generation NVL576, set to be launched in 2027, will nonetheless be out-performed by the Atlas 950 SuperPoD, Huawei says.
Common computing (non-AI centered) chip vary
For common computing, Huawei plans to launch two fashions of its Kunpeng 950 processors at the start of 2026. They are going to comprise of 96 cores & 192 threads, or 192 cores & 384 threads within the sooner mannequin. On the identical time, Huawei plans to launch what Xu referred to as, “the wold’s first general-purpose computing SuperPoD,” the Kunpeng 950-based TaiShan 950.
Open-source connectivity protocol, UnifiedBus 2.0
Each the NPU and ‘common computing’ SuperPoDs will use Huawei’s UnifiedBus 2.0, which is an replace on the prevailing UnifiedBus 1.0. Model 1.0 of UnifiedBus is the interconnection expertise utilized by the Atlas 900 A3 SuperPoD, which appeared in March this 12 months, and already has greater than 300 installations in DCs and establishments around the globe.
UnifiedBus 2.0 might be licensed as open-source, and the technical specs are to be launched instantly to the developer group. UnifiedBus 2.0 will minimize its enamel within the new generations of SuperPods, and be the connection protocol for clusters of SuperPods, inevitably named SuperClusters.
The primary SuperCluster would be the Atlas 950 SuperCluster, which is able to supply 2.5 instances extra NPUs and 1.3 instances extra computing energy than xAI’s Colossus cluster, at present the world’s strongest.
In the direction of the tip of 2027, Huawei will launch the Atlas 960 SuperCluster, which is able to include over 1,000,000 NPUs and ship 4 ZFLOPS in FP4 (with a ZFLOP representing 10^21 floating level operations per second) through a single virtualised compute occasion. “SuperPoDs and SuperClusters powered by UnifiedBus are our reply to surging demand for computing, each right now and tomorrow,” Xu stated.
(Picture supply: “Coronal Mass Ejection on the Solar” by Blatant Views is licensed underneath CC BY 2.0.)

Need to study extra about Knowledge Centres from business leaders? Try Knowledge Centre Congress happening in Amsterdam, California, and London. The great occasion is a part of TechEx and co-located with different main expertise occasions. Click on right here for extra info.
CloudTech Information is powered by TechForge Media. Discover different upcoming enterprise expertise occasions and webinars right here.