广告
加载中

从不限量到自费上班 互联网公司们付不起Token账单了

王琳 2026-06-18 22:41
王琳 2026/06/18 22:41

邦小白快读

EN
全文速览

总:本文介绍了当前互联网行业AI Token使用政策从不限量转为配额管控的最新变化,核心干货如下:

1. 行业整体消耗情况远超预期,国内外大厂都出现严重的成本超支问题:Uber4个月花完2026年AI预算,Meta30天消耗的Token成本超1亿美元,国内有AI项目一晚上烧掉价值200万元的Token,腾讯如果维持全员不限量,一年Token成本要达到252亿元,占其2025年研发费用近三成。

2. 目前各企业的调整方案:腾讯改为按工作任务动态调配额度,优先免费开放内部混元大模型,不同岗位额度差异大,不够可向上级申请追加;其他大厂多采用岗位差异化额度,不够可审批额外采购;中小厂大多设置月度固定额度,推广性价比更高的国产大模型。

3. 行业趋势:此前企业盲目鼓励Token消耗,存在大量浪费,很多企业投入后并没有提升运营效率,当前行业正在破除对AI的Token盲目崇拜,走向理性落地。

总:本文反映了当前AI落地企业端的新趋势,给品牌布局AI的成本管控和落地策略提供了实用参考,干货如下:

1. AI布局的成本提醒:AI应用的实际落地成本远高于企业初期预期,哪怕是财大气粗的头部互联网大厂都难以承担无限制的Token消耗,目前全行业都开始管控成本,品牌布局AI时要提前做好成本预算,预留调整空间,避免出现成本超支拖垮整体预算的问题。

2. 落地策略参考:目前行业通用的成熟做法是按业务实际价值动态分配Token额度,优先使用高性价比的国产大模型,只给能产生实际价值的项目保障足额资源,避免无意义的浪费。

3. 考核方向参考:此前很多品牌将Token消耗量作为AI落地的考核指标,实际证明这种方式会催生浪费,也无法提升效率,百度李彦宏提出的DAA(日活智能体数)指标,更能体现AI落地的实际价值,值得品牌参考。

总:当前企业AI应用领域出现成本管控的新变化,给布局AI的卖家提示了风险和机会,干货整理如下:

1. 风险提示:AI应用的实际成本远高于初期预期,无限制使用会带来远超预算的成本支出,目前不少中小公司已经从不限额转为月度配额制,甚至出现额度不够需要员工自行自费购买的情况,卖家布局AI降本提效时,一定要提前做好成本管控,避免盲目投入导致成本失控。

2. 可借鉴的落地方法:目前行业主流的管控方式是按任务动态分配Token额度,优先使用性价比更高的国产大模型,对能产生实际价值的项目保障额度,这种模式既可以控制成本,也不影响核心业务的AI落地,非常适合中小卖家参考。

3. 机会提示:当前行业已经从盲目AI热转向理性落地,国产大模型凭借性价比优势获得越来越多企业采用,卖家可以抓住国产大模型的红利,低成本布局适配自身业务的AI提效工具,降低运营成本。

总:本文反映了企业AI转型过程中的真实问题,给工厂推进数字化和AI转型带来不少启示,干货整理如下:

1. AI转型要理性管控成本:很多互联网企业前期盲目鼓励全员使用AI,放开Token不限量使用,最终成本远超预算,不少中小厂直接扛不住收紧额度,工厂做AI转型(比如用AI做产品设计、生产排期、客户服务)时,一定要提前算好成本账,不要盲目追求全栈AI化和高端海外模型,避免投入失控。

2. 落地路径启示:目前行业通用的理性落地模式是:按实际业务需求分配额度,优先使用高性价比国产大模型,只给能产生实际价值的项目保障资源,这种模式同样适配工厂的AI转型,工厂可以先从简单易落地的场景切入,验证价值后再逐步扩大投入,避免一次性投入过多。

3. 商业机会提示:当前大量企业转向国产大模型,对适配不同行业的定制化AI应用需求快速增长,有条件的工厂可以探索和国产AI平台合作,开发适配制造业生产、设计需求的专属AI工具,挖掘新的业务增长点。

总:本文暴露了当前AI企业客户的核心痛点,指明了行业发展的新趋势,给AI服务商提供了不少参考,干货整理如下:

1. 核心客户痛点:当前企业对AI的需求已经从初期尝鲜转向实际落地,最突出的痛点就是AI使用成本过高,不限量使用模式下企业成本很容易超出预算,同时市场存在明显的能力断层:高端海外模型成本太高,低价国产大模型无法满足复杂任务需求,很多企业额度不够只能自费购买,这是服务商可以切入的机会点。

2. 行业发展趋势:企业不再单纯将Token消耗量作为AI落地的考核指标,转而关注AI的实际产出价值,李彦宏提出的DAA(日活智能体数)将会成为衡量AI价值的新指标,整个行业正在从AI狂热走向理性落地。

3. 解决方案方向:服务商可以针对不同客户推出分级定价的Token套餐,适配不同岗位不同项目的需求;同时加快提升国产大模型的复杂任务处理能力,满足企业降本需求;还可以探索按实际产出计费的新模式,适配企业当前的考核方向。

总:本文反映了当前企业客户对AI大模型的最新需求变化,给平台商的运营管理和风险规避提供了参考,干货整理如下:

1. 客户最新需求:当前企业客户对AI大模型的需求已经从不限量使用转向成本可控的定向供给,大量企业开始优先选择高性价比的国产模型,对差异化额度管理、按项目申请追加额度的功能需求明显提升,平台需要尽快适配客户的成本管控需求,开发对应的功能模块。

2. 运营招商方向:现在头部企业都要求员工优先使用内部模型,屏蔽外部竞对模型,中小客户更倾向选择成本更低的模型,平台可以针对不同客户群体推出差异化产品:对大B客户提供定制化额度调配服务,对中小客户推出高性价比的打包套餐,吸引更多客户入驻。

3. 风险规避提示:此前行业盲目崇拜Token消耗,催生了大量无意义浪费,现在行业已经开始祛魅,平台不要过度营销Token用量概念,要引导客户关注AI实际产出,还可以参考DAA指标衡量平台生态繁荣度,规避行业退潮带来的需求下滑风险。

总:本文记录了AI产业落地过程中的最新动向和核心问题,给产业研究提供了一手素材,干货整理如下:

1. 产业最新动向:全球AI产业正在从初期的盲目扩张转向理性落地,国内外从头部大厂到中小互联网公司,都已经调整了AI Token供给策略:从原先鼓励全员最大化使用Token,甚至将Token消耗和升职加薪挂钩,转为按任务动态调配额度,推广国产大模型替代高端海外模型,全行业都在管控AI成本,AI Token崇拜正在经历祛魅。

2. 产业核心新问题:当前AI大模型的成本结构不合理,企业落地AI的实际成本远高于预期,行业内Token浪费现象严重,很多企业投入大量成本后并没有实现运营效率提升,反而拖慢了核心产品的开发进度;同时市场存在明显的供需错配,低成本国产模型能力不足,高端模型成本过高,无法满足企业复杂需求,这是当前产业需要解决的核心矛盾。

3. 研究新方向:百度李彦宏提出用DAA(日活智能体数)替代Token消耗量,作为衡量AI平台生态繁荣度的指标,为产业研究和商业模式评估提供了新的思路,值得深入研究。

返回默认

声明:快读内容全程由AI生成,请注意甄别信息。如您发现问题,请发送邮件至 run@ebrun.com 。

我是 品牌商 卖家 工厂 服务商 平台商 研究者 帮我再读一遍。

Quick Summary

This article covers the latest industry shift in the internet sector, where unlimited AI token usage policies are being replaced by quota-based management. Key takeaways are as follows:

1. Industry-wide token consumption has far outpaced initial projections, leaving major global and domestic tech firms facing severe cost overruns: Uber exhausted its entire 2026 AI budget in just four months; Meta spent over $100 million on token costs in 30 days; one domestic Chinese AI project burned through ¥2 million worth of tokens in a single night; if Tencent maintained unlimited access for all employees, annual token costs would hit ¥25.2 billion, accounting for nearly 30% of its projected 2025 R&D budget.

2. Adjustment measures adopted by firms: Tencent now dynamically allocates quotas based on work tasks, prioritizes free internal access to its Hunyuan large language model, sets large quota gaps across different roles, and allows employees to apply for additional quotas from supervisors. Most other major Chinese tech firms implement role-based differentiated quotas with approval processes for extra token purchases. Most small and medium-sized enterprises (SMEs) set fixed monthly quotas and promote more cost-effective domestic large models.

3. Industry trend: After a period of blindly encouraging token consumption that resulted in massive waste, with many firms failing to improve operational efficiency despite heavy investment, the industry is now moving beyond blind token worship and toward pragmatic, rational AI implementation.

This article outlines a new trend in enterprise AI adoption and offers actionable insights for brands looking to manage AI costs and implement AI effectively. Key takeaways are as follows:

1. A critical cost reminder: The actual cost of rolling out AI is far higher than most firms initially expect. Even deep-pocketed top internet giants cannot absorb the cost of unlimited token usage, and the entire industry is now tightening AI cost controls. Brands should draft detailed cost budgets in advance and reserve buffer space for adjustments to avoid cost overruns derailing overall budgets.

2. A reference for implementation strategy: The established industry best practice is dynamically allocating token quotas based on actual business value, prioritizing cost-effective domestic large models, and guaranteeing sufficient resources only for projects that deliver tangible value, to eliminate unnecessary waste.

3. A reference for performance measurement: Many brands previously used token consumption as a metric to evaluate AI adoption, but this approach has proven to encourage waste and fails to improve efficiency. Baidu CEO Robin Li's proposed DAA (Daily Active Agents) metric, which better reflects the actual value of AI implementation, is a useful framework for brands.

The shift to cost controls for enterprise AI use brings both risks and opportunities for sellers leveraging AI. Key insights are summarized below:

1. Risk warning: The actual cost of AI adoption is far higher than initial projections, and unlimited usage can quickly lead to major budget overruns. Many SMEs have already shifted from unlimited access to monthly quota systems, and some even require employees to pay for extra tokens out of pocket when they exceed their quotas. Sellers adopting AI to cut costs and boost efficiency must implement cost controls upfront to avoid runaway spending from blind investment.

2. Actionable implementation approaches: The mainstream industry cost control model, which dynamically allocates token quotas based on tasks, prioritizes cost-effective domestic large models, and reserves sufficient quotas for value-generating projects, controls costs without disrupting AI integration for core business. It is particularly well-suited for small and medium-sized sellers.

3. Opportunity outlook: The industry has shifted from irrational AI hype to pragmatic implementation. Domestic large models are gaining adoption across more firms thanks to their cost advantages. Sellers can capitalize on this trend to build custom AI efficiency tools aligned with their business at low cost, and reduce overall operating expenses.

This article outlines real challenges in enterprise AI transformation and offers key insights for factories advancing digital and AI transformation. Key takeaways are as follows:

1. Implement AI transformation with rational cost controls: Many internet firms initially encouraged unlimited employee AI access to tokens, only to end up with far higher costs than budgeted, leaving many SMEs unable to absorb overspending and forced to tighten quotas. Factories adopting AI for use cases including product design, production scheduling and customer service should calculate costs carefully upfront, avoid blindly pursuing full-stack AI adoption or premium overseas models, and prevent runaway investment.

2. Guidance on implementation paths: The industry's standard pragmatic implementation model, which allocates quotas based on actual business needs, prioritizes cost-effective domestic large models, and reserves resources only for value-generating projects, is well-suited for factory AI transformation. Factories can start with simple, easy-to-implement use cases to verify value before scaling up investment gradually, avoiding excessive one-time spending.

3. New business opportunities: As more firms shift to domestic large models, demand for customized AI applications tailored to specific industries is growing rapidly. Eligible factories can explore partnerships with domestic AI platforms to develop exclusive AI tools customized for manufacturing production and design needs, and unlock new business growth opportunities.

This article identifies the core pain points of enterprise AI clients and outlines emerging industry trends, offering valuable insights for AI service providers. Key takeaways are as follows:

1. Core client pain points: Enterprise demand for AI has shifted from early-stage experimentation to practical implementation, and the most pressing pain point is prohibitively high AI usage costs: unlimited access models easily lead to budget overruns. The market also faces a clear capability gap: premium overseas models are too costly, while low-cost domestic models cannot handle complex tasks. Many firms even require employees to pay for extra tokens out of pocket when quotas run out, creating a clear opportunity for service providers to enter the market.

2. Industry development trends: Enterprises are no longer using token consumption as the core metric for AI adoption, and are instead focusing on actual output value. Robin Li's proposed DAA (Daily Active Agents) metric will likely become the new standard for measuring AI value, as the entire industry shifts from AI hype to rational, practical implementation.

3. Directions for solution development: Service providers can launch tiered token pricing packages tailored to the needs of different roles and projects. They can also accelerate improvements to the complex task processing capabilities of domestic large models to meet enterprises' cost reduction goals. Additionally, they can explore new pay-for-output billing models aligned with enterprises' current performance measurement frameworks.

This article outlines the latest shifts in enterprise demand for large AI models and offers guidance for platform operators on management and risk mitigation. Key takeaways are as follows:

1. Updated client demand: Enterprise demand for large AI models has shifted from unlimited usage to controlled-cost targeted access. A growing number of enterprises now prioritize cost-effective domestic models, and demand for features including differentiated quota management and project-based additional quota applications has risen sharply. Platforms need to quickly adapt to clients' cost control needs and develop corresponding functional modules.

2. Guidance on operations and business development: Large enterprises now require employees to prioritize internal models and block competing external models, while small and medium clients prefer lower-cost models. Platforms can develop differentiated products for different client segments: offer customized quota allocation services for large enterprise clients, and launch cost-effective bundled packages for small and medium clients to attract more users to the platform.

3. Risk mitigation guidance: The previous industry focus on token consumption led to massive unnecessary waste, and the sector is now moving past this hype. Platforms should avoid overmarketing the concept of token volume, and instead guide clients to focus on actual AI output. They can also adopt the DAA metric to measure platform ecosystem health, to mitigate the risk of falling demand as the industry matures.

This article documents the latest developments and core challenges in AI industry implementation, providing first-hand data for industrial research. Key takeaways are as follows:

1. Latest industry developments: The global AI industry is shifting from early-stage blind expansion to rational implementation. From global and domestic top giants to small and medium-sized internet firms, all have adjusted their AI token supply strategies: from previously encouraging maximum employee token usage (even linking consumption to promotions and salary increases) to dynamic quota allocation based on tasks, and promoting domestic large models as replacements for premium overseas models. The entire industry is now tightening AI cost controls, and blind "token worship" is being abandoned.

2. Core new industry challenges: The current cost structure of large AI models is unreasonable, and the actual cost of enterprise AI adoption is far higher than expected. Widespread token waste has left many firms failing to improve operational efficiency despite heavy investment, even slowing down core product development. The market also faces clear supply-demand mismatches: low-cost domestic models lack sufficient capability, while premium models are too expensive to meet enterprises' complex needs, which is the core contradiction the industry must resolve today.

3. New research directions: Robin Li has proposed replacing token consumption with DAA (Daily Active Agents) as the metric for measuring the prosperity of AI platform ecosystems, offering a new framework for industrial research and business model evaluation that merits further in-depth study.

Disclaimer: The "Quick Summary" content is entirely generated by AI. Please exercise discretion when interpreting the information. For issues or corrections, please email run@ebrun.com .

I am a Brand Seller Factory Service Provider Marketplace Seller Researcher Read it again.

|王琳

全员Token-maxxing(把Token用量拉到极限)还没俩月,互联网公司急速调转船头。

6月5日,腾讯已在内部宣布调整AI Token额度。核心的变化是,全员统一额度改为按工作任务动态调配。通知明确表示,总投入只增不减,对能用AI带来显著提效和价值产出的同学,保障 Token额度,不搞 Token消耗量排名,不贩卖焦虑。

大公司们对Token额度的快速消耗甚至超过了他们自己的预期。今年4月,Uber首席技术官普拉文·纳加(Praveen Naga)表示,公司在4个月内就将2026年的AI预算花光了,Uber 2025年的研发支出达到34亿美元。Meta员工在30天内消耗了60.2万亿个AI token,成本超过了1亿美元。

国内亦是如此。5月20日,《崩坏》系列AI NPC&Gameplay技术团队负责人郑银河透露,有员工为了实现项目,建了几十个Agent共同协作,结果一晚上烧了价值200万元的Token。

曾经,为了践行AI策略,不少公司希望员工最大程度去调用AI工具,甚至搞Token用量排名,以此作为升职、加薪的标准之一,但当看到天价Token账单后,互联网公司们懵了。

3天用掉90%额度,大厂狂砍Token用量

腾讯此次Token额度动态调整并没有提前通知,这让一些员工感到措手不及。一位腾讯研发人员称,自己根本不够用,发布通知到当天,就发现自己有10%的额度了,用claude就是挺不住得烧。

Tech星球了解到,此次调整涉及到包括实习生、外包、正式员工在内的所有人员。目前,只有混元大模型对所有人免费。这样的调整在一些人看来在情理之中。“用脚指头想也知道,怎么可能一直超多超量供应”,一位腾讯员工评论道。

一位腾讯大数据方向的外包员工告诉Tech星球,以前他们使用大模型是积分制,有100000积分,就没有关注具体Token量,但是够一个月使用。现在外包只能申请混元大模型,混元是没有Token限制的。

但混元在所有基础大模型中表现并不优异。凭借在“强推理+256K超长上下文”的能力,Hy3 preview曾连续登顶OpenRouter全球周榜,但整体能力上,尤其复杂任务时,比如编程等,Hy3和DeepSeek V4 Flash、Claude Sonnet 4.6等模型依然存在差距。

但Token的调整对每个事业部每个人的体感不一样,有人只剩100美元,有人则有1万多元人民币。

一位腾讯实习生告Tech星球,调整前,自己只有100美元,调整后算起来有200美元了,市面上的先进模型都能用,但200美元确实不够用,写代码的话一天多的时候就能用50美元。一位腾讯AI预研游戏员工称,自己目前还有12600元,而同事有2.1万元。还有一些人则表示,Token直接砍半了。

一位腾讯后端研发称,虽然目前Token额度有所缩减,但自己所在的组并不受影响,不够就可以向上级申请。

此前,腾讯传出为每位员工发放价值约22万元的Token套餐。按照腾讯集团2026年Q1财报中提到的114848名员工计算,腾讯每年需要支付252亿元费用。作为对比,其2025年的研发费用为857.5亿元。

但现在,即便是财大气粗的腾讯也要开始算细账了,而这只是行业的缩影。Tech星球了解到,国内主流大厂都会要求员工优先使用内部大模型,内部大模型对员工基本免费,甚至一些公司还屏蔽了竞对的模型。但是最终内部模型产出的效果可能依然比不上海外模型。

一位字节跳动员工向Tech星球介绍,公司内部并不强制用AI,“Token额度对大厂是很大的负担,不少互联网公司不同岗位不同部门额度存在差异,并且在字节如果AI相关的技术研发岗位如果额度不够,还可以内部审批去外面单独采购”,他补充道。

一位美团员工表示,并没有听到内部会有额度限制,但自己的额度是完全够用的。一位百度员工则称,内部根据部门不同额度限制的情况也会有差异。

破除Token盲目崇拜

大厂或者还在犹豫是否削减Token额度,但更多中小型互联网公司已经撑不住了。

广州一家做跨境支付的企业决定削减员工的Token用量:从上不封顶到人均每月500美元。而此前一个月,他们消耗掉了40万美元的Token。

“这完全不够用了”,上述公司的一位程序员告诉Tech星球。他所在的公司竟然出现互相借Token想象,比如一位后端开发者2天就消耗掉370美元的Token,额度报警,该开发者开始向他借Token用。

以前大大小小的互联网公司们践行Token-maxxing,生怕错过AI浪潮。于是,员工们拼命研究如何消耗Token。一位上述员工分享道,尤其后端程序员,研发了各种封装包、skill,每个业务有一堆提效工具,一些程序员一开就是好几个agent,一小时就能烧掉上亿Token。这导致新规发布前,一些程序员就已经超出预算1000多美元。消耗1亿Token,如果使用目前编程主流选择Claude Sonnet 4.6,需要至少花费2000元,最高甚至达到1万元。

事实上,Token浪费的情况确实存在。一位新能源汽车员工称,公司每个月给他1000美元的Token额度,他根本用不完,为了消耗,只能用AI写原创小说,比如续写《红楼梦》。

上海某老牌互联网公司员工告诉Tech星球,公司以前是不限的,但现在公司开始统一管理大家的Token额度,每个人都需要走钉钉审批申请Token,每个人的额度是几百元到1000元不等。

这样的情况开始变得普遍。北京某腰部互联网公司员工称,原来大家可以不限额使用Claude Code,公司报销。现在是开放了Anthropic的API接口,每个人每月是1000元额度,并且让大家优先使用更便宜的国产大模型。

但现实是,便宜的大模型只能胜任一些简单的代码补全类任务,一旦遇到复杂任务,需要多轮次反复交互,甚至不如自己手搓。“我现在已经开始自己买额度了,1000元的额度可能根本用不了一周。”

一些公司则要求全栈AI化,这导致Token用量大幅度上升。广州某游戏公司员工称,之前Token全员免费,自己一个月用了小3万Token,部门人人超标,之后就只能用DeepSeek的模型了。

上海一位程序员在社交平台分享道,自己的部门只有4个人,但一个月就消耗掉了6万元Token。现在技术老大直接采买了DeepSeek的Token让技术切换。

Token-maxxing的另外一个结果是,在复盘的时候,不少程序员发现看不懂自己写的代码,甚至都找不到代码在哪里,为什么要这么写。公司的管理者们发现,即便使用了AI,整体的运营效率并没有提升,甚至当大模型需要排队时,反而影响重要产品的节奏。

百度创始人李彦宏在今年的AI开发者大会上首次提出日活智能体数(Daily Active Agents,简称DAA),DAA大致对应移动互联网时代通用的日活用户数(DAU),它看起来,比单纯看Token消耗量更能体现平台和生态真实繁荣程度的度量。

从不设上限的肆意挥霍,到如今精打细算的“配额制”与“国产替代”,互联网公司对AI的盲目崇拜正在经历一场必经的祛魅。

注:文/王琳,文章来源:Tech星球(公众号ID:tech618),本文为作者独立观点,不代表亿邦动力立场。

文章来源:Tech星球

广告
微信
朋友圈

这么好看,分享一下?

朋友圈 分享

APP内打开

+1
+1
微信好友 朋友圈 新浪微博 QQ空间
关闭
收藏成功
发送
/140 0