PulseAugur
EN
LIVE 21:04:56
中文(ZH) 不打价格战,把 Token 工厂建在国产硬件上

Huawei Cloud launches AI infrastructure with sub-10ms token generation

Huawei Cloud has launched a new infrastructure designed for agentic AI, featuring a "Token Factory" capable of generating tokens in under 10 milliseconds. This system integrates general and intelligent computing resources, boosting utilization by over 30%, and offers a petabyte-scale memory space for AI agents. The platform also includes ModelArts Next, a new training and inference platform that can automatically route requests to the most suitable of over 15 SOTA models, reducing costs by an average of 20%. This initiative leverages Huawei's Ascend ecosystem, demonstrating that domestic computing power can achieve competitive performance for mainstream large models, as seen when DeepSeek models were deployed on their hardware. AI

IMPACT This infrastructure aims to accelerate AI agent development and deployment, potentially lowering costs and improving efficiency for enterprise AI applications.

RANK_REASON Launch of a new AI infrastructure platform by a major tech company. [lever_c_demoted from significant: ic=1 ai=0.7]

Read on 36氪 (36Kr) →

AI-generated summary · Google Gemini · from 1 sources. How we write summaries →

COVERAGE [1]

  1. 36氪 (36Kr) TIER_1 中文(ZH)

    Don't engage in price wars, build the Token factory on domestic hardware

    要达到 Agentic Infra的建设目标,华为云端出了一整套底层设施。 而要具体拆解华为云所定义的Agentic Infra,则包括四个方面:高效的Token 工厂、可持续学习、通智一体化调度、安全自治,华为云也对应发布了四款新品。 最核心的是 AICS 灵衢智算集群。它基于灵衢网络,支持10万卡级集群规模,总算力200EFLOPS,把 Token 生成时延压到10毫秒以内,千卡每秒吞吐达到500万 Token,在线服务可用性99.95%——华为云称之为“Token 工厂”。 而配套的 CCE Volcano Next 调度引擎,则通过“训推共池+碎