题图:人工智能代理消耗的词元数量远超早期的聊天机器人,因此中国低成本生产词元的能力使其获得了新的竞争优势。

China is gaining ground in the global AI industry’s hottest commodity: tokens.
中国正在全球人工智能产业最热门的商品——词元领域取得进展。

Since February, Chinese AI models made by groups such as DeepSeek and MiniMax have overtaken US rivals in token consumption, according to OpenRouter data, which tracks these units of text, code or data processed by large language models. 
据追踪大语言模型处理文本、代码或数据单元的OpenRouter平台数据显示,自二月起,由深度求索、MiniMax等企业研发的中国人工智能模型在词元处理量上已超越美国同类产品。

The shift points to a deeper change in the AI race, with Nvidia’s Jensen Huang saying this month that the production and use of the digital units will drive the AI economy. Because developers are charged per token, it doubles as both a proxy for adoption of models and a pricing battleground between AI companies.
这一转变标志着人工智能竞赛正发生更深层次的变化。英伟达首席执行官黄仁勋本月表示,词元的生产和使用将推动人工智能经济发展。由于开发者按词元数量付费,因此这既可作为模型采用率的衡量指标,也成为人工智能企业之间的定价战场。

As AI agents, such as those built on the open-source platform OpenClaw, consume vastly more tokens than earlier chatbots, the ability to cheaply produce tokens is reshaping global competition — and giving China a new edge.
随着基于开源平台OpenClaw等AI智能体消耗的词元数量远超早期聊天机器人,低成本生成词元的能力正在重塑全球竞争格局——这也为中国带来了新的竞争优势。

“If your agent is burning through millions of tokens a day, even a small per-token price difference becomes a significant line item,” said Will Liang, chief executive of Amplify AI Group, a Sydney-based technology consulting firm. “That’s a structural tailwind for Chinese labs, and it only grows as agentic adoption scales.”
“如果你的智能体每天消耗数百万个词元,即使每个词元的价格差异很小,也会成为一项重要的开支项目,”总部位于悉尼的技术咨询公司Amplify AI集团的首席执行官梁威尔表示。“这对中国实验室来说是一个结构性利好,而且随着智能体应用的规模化,这种优势只会增长。”

Chinese AI groups’ cost advantage stems from cheaper energy and more efficient models, allowing companies such as MiniMax and Moonshot to charge $2 to $3 per million output tokens, compared with about $15 for Anthropic’s Claude Sonnet 4.5 — a near sixfold gap.
中国AI企业的成本优势源于更便宜的能源和更高效的模型,这使得像MiniMax和月之暗面这样的公司能够对每百万个输出词元收取2至3美元的费用,而Anthropic的Claude Sonnet 4.5的收费约为15美元——差距接近六倍。



The difference becomes pronounced with AI agents, which consume far more tokens than chatbots. Summarising Shakespeare’s Hamlet might take about 30,000 tokens for a chatbot, but an AI agent can require up to 20mn on a minor coding task.
这种差异在使用AI智能体时变得尤为明显,因为智能体消耗的词元数量远超聊天机器人。总结莎士比亚的《哈姆雷特》对于聊天机器人可能只需要大约3万个词元,但一个AI智能体完成一个简单的编码任务就可能需要多达2000万个词元。

That is changing how AI developers choose to spend their money. Terry Zhang, a Hong Kong-based developer, said he now spends about $50 a day using Moonshot’s Kimi model for roughly 80 per cent of his work, reserving Anthropic’s Claude for more complex tasks.
这正在改变AI开发者的资金分配方式。常驻香港(特区)的开发者张特瑞表示,他现在每天花费约50美元使用月之暗面的Kimi模型来完成大约80%的工作,而将Anthropic的Claude留作处理更复杂任务。

“I used to call only Claude but now with an increasing amount of workload, using just Claude would cost me about $900 a day,” he said. “It’s too much and the mixed use of Kimi and Claude works well for me.”
“我以前只用Claude,但现在工作量越来越大,如果只用Claude,我每天要花费大约900美元,”他说。“这太多了,混合使用Kimi和Claude对我来说效果很好。”

The trend is feeding through to revenues. MiniMax, whose M2.5 model is now ranked among the most used globally by token consumption, has seen token usage rise 476 per cent from a month ago as of March 20, according to OpenRouter. 
这一趋势已开始体现在营收上。根据OpenRouter的数据,截至3月20日,其M2.5模型在全球词元使用量排名中位居前列的MiniMax,其词元使用量较一个月前增长了476%。

While OpenRouter accounts for only a fraction of the global model consumption, it is widely used as an industry indicator, as such data is scarce elsewhere.
尽管OpenRouter仅占全球模型消费量的一小部分,但由于此类数据在其他渠道十分稀缺,该平台被广泛用作行业风向标。



US groups are still growing rapidly as the overall market expands, with OpenAI, Anthropic and Google all reporting strong revenue growth and adoption. But lower-cost Chinese models have obtained an opening to gain ground among users around the world.
随着整体市场扩张,美国企业集团仍在快速增长,OpenAI、Anthropic和谷歌均报告了强劲的收入增长和用户采用率。但成本更低的中国模型已获得突破口,在全球用户中取得进展。

China’s token pricing advantage stems partly from the country’s vast investment in renewable energy. The Chinese government this month designated “computing-electricity synergy” a national priority in its 2026 work report, explicitly lixing energy policy with AI competitiveness.
中国在词元定价上的优势部分源于该国对可再生能源的巨大投资。中国政府在本月的2026年工作报告中将“算电协同”列为国家优先事项,明确将能源政策与人工智能竞争力挂钩。

On the software side, Chinese groups have embraced efficient AI architectures, such as a “mixture-of-experts” designs that reduce computational demand, sometimes at the expense of accuracy. This push for computing efficiency has been driven by a shortage of advanced chips in China due to US export controls.
在软件方面,中国团队采用了高效的人工智能架构,例如“混合专家”设计,这种设计降低了计算需求,有时以牺牲准确性为代价。这种对计算效率的追求,是由美国出口管制导致中国先进芯片短缺所驱动的。

There are technical constraints. Zhipu AI’s GLM-5 model briefly topped OpenRouter charts in February before usage surged beyond its compute capacity, causing delays and service degradation.
技术限制依然存在。智谱AI的GLM-5模型曾在2月短暂登顶OpenRouter排行榜,随后因使用量激增超出其计算承载能力,导致服务延迟和性能下降。

The company, which had to apologise and raise prices, saw its shares drop 22 per cent on the day, erasing more than $10bn in market value.
该公司不得不公开致歉并上调价格,其股价在事发当日暴跌22%,市值蒸发逾100亿美元。

“The model’s capability matters, but stable compute and service are equally indispensable,” said one veteran developer at Google. Google’s Gemini 3 Flash is ranked second among the top five most-used models this month, trailing behind Minimax.
“模型能力固然重要,但稳定的计算资源和服务同样不可或缺,”谷歌一位资深开发者表示。谷歌的Gemini 3 Flash模型在本月使用量前五的模型中排名第二,仅次于MiniMax。

China’s tech giants have moved quickly to press their advantage. Earlier this month Alibaba announced the creation of Alibaba Token Hub, a new business group that will be led by chief executive Eddie Wu. The unit signals Alibaba’s view that token economics will define the next phase of AI competition.
中国科技巨头正迅速扩大这一优势。本月初,阿里巴巴宣布成立ATH事业群,该新业务集团将由首席执行官吴泳铭直接领导。这一组织架构调整彰显了阿里巴巴的判断:词元经济将定义人工智能竞争的下一个阶段。



“We are standing at the threshold of an AGI inflection point,” Wu wrote in an internal memo last week. “Billions of AI agents are poised to take on an ever-greater share of digital work, each powered by tokens generated by models, and these agents will increasingly become the primary interface between people and the digital world.”
“我们正站在通用人工智能的拐点门槛上,”吴泳铭在上周的内部备忘录中写道。“数以亿计的AI智能体即将承担越来越多的数字化工作,每个智能体都由模型生成的词元驱动,这些智能体将日益成为人与数字世界之间的主要交互入口。”

Whether China’s token advantage can persist remains unclear, especially as some companies remain wary of relying on models run on Chinese data centres.
中国的词元优势能否持续尚不明朗,尤其是一些公司仍对依赖运行于中国数据中心上的模型持谨慎态度。

“The geopolitical headwinds are significant, particularly for governments and regulated industries,” said Amplify’s Liang. “Regulators are asking harder questions about where data is processed and under whose jurisdiction it falls.”
“地缘政治逆风相当显著,对政府和受监管行业而言尤其如此,”Amplify的梁先生表示。“监管机构正对数据处理地点及其所属司法管辖权限提出更严苛的质询。”