Anthropic Identifies Three Product-Layer Changes Behind Claude Code Quality Decline, Not Model Issue

Gate News message, April 23 — Anthropic’s engineering team confirmed that the Claude Code quality degradation reported by users over the past month stemmed from three independent product-layer changes, not from API or underlying model issues. The three problems were fixed on April 7, April 10, and April 20 respectively, with the final version now at v2.1.116.

The first change occurred on March 4, when the team reduced the default reasoning effort level for Claude Code from “high” to “medium” to address occasional extreme latency spikes in Opus 4.6 under high reasoning intensity. After widespread user complaints about reduced performance, the team reverted the change on April 7. The current default is now “xhigh” for Opus 4.7 and “high” for other models.

The second issue was a bug introduced on March 26. The system was designed to clear old reasoning records after conversation inactivity exceeded one hour to reduce session recovery costs. However, a flaw in implementation caused the clearing to execute repeatedly on every subsequent turn rather than once, causing the model to progressively lose prior reasoning context. This manifested as increasing forgetfulness, repeated operations, and abnormal tool invocations. The bug also resulted in cache misses on every request, accelerating user quota consumption. Two unrelated internal experiments masked the reproduction conditions, extending the debugging process to over a week. After fixing on April 10, the team reviewed problematic code using Opus 4.7 and found that Opus 4.7 could identify the bug while Opus 4.6 could not.

The third change launched on April 16 alongside Opus 4.7. The team added instructions to the system prompt to reduce redundant output. Internal testing over several weeks showed no regression, but post-launch interaction with other prompts degraded coding quality. Extended evaluation revealed a 3% performance drop in both Opus 4.6 and 4.7, leading to a rollback on April 20.

These three changes affected different user groups at different times, and their combined effect created widespread and inconsistent quality decline, complicating diagnosis. Anthropic stated it will now require more internal employees to use the same public build version as users, run full model evaluation suites for every system prompt modification, and implement staged rollout periods. As compensation, Anthropic has reset usage quotas for all subscription users.

免责声明:本页面信息可能来自第三方,不代表 Gate 的观点或意见。页面显示的内容仅供参考,不构成任何财务、投资或法律建议。Gate 对信息的准确性、完整性不作保证,对因使用本信息而产生的任何损失不承担责任。虚拟资产投资属高风险行为,价格波动剧烈,您可能损失全部投资本金。请充分了解相关风险,并根据自身财务状况和风险承受能力谨慎决策。具体内容详见声明

相关文章

OpenAI 发布 GPT-5.5,面向代理任务与复杂工作流程而设计

Gate News 消息,4月24日——OpenAI 已正式发布 GPT-5.5,这是一款下一代 AI 模型,旨在处理复杂目标、工具集成、自我验证以及多步骤任务完成。该模型在代码编写与调试、在线研究、数据分析、文档

GateNews5 分钟前

英特尔财测超预期,AI需求带动CPU转机,陈立武接掌后INTC已上涨3倍

英特尔一季营收136亿美元,EPS 0.29;二季展望中值143亿美元,远超预期,毛利率41%。AI数据中心需求推动CPU转机,Xeon等服务器受捧。陈立武领导转型,IFS首季54亿美元、成长16%,特斯拉等外部客户关注 Terafab 使用英特尔技术。盘后股价涨约20%,创历史新高,自去年以来已涨近3倍。

鏈新聞abmedia17 分钟前

Cognition AI 以 $25B 估值在早期谈判中融资

Gate 新闻消息,4月24日——根据知情人士的说法,AI 编程初创公司 Cognition AI 正处于新一轮融资的早期谈判阶段;该轮融资将使其估值增长逾一倍,达到 $25 billion。该公司目标是筹集数亿美元或更多,因为在软件开发领域对生成式 AI 技术的需求仍在持续增长。

GateNews1小时前

NEC 株式会社将成为 Anthropic 在日本的首家全球合作伙伴

NEC 宣布成为 Anthropic 在日本的首家全球合作伙伴,双方将针对金融、制造与地方政府等高度受监管产业开发安全且具产业知识的 AI 解决方案,并将 Claude 系列整合到 NEC BluStellar,聚焦数据驱动管理与客户体验转型,同时引入 Claude Cowork 与 SOC 整合以提升资安防护。为验证成效,NEC 启动零号客户计划于内部全面测试 AI 代理,并规划在全球推广 Claude 部署,建立日本最大规模的 AI 原生工程师 CoE。

鏈新聞abmedia3小时前

Vercel 安全漏洞扩散至数百名用户;AI 开发者风险更高

Gate 新闻消息,4月23日——Vercel披露称,4月19日其安全事件起初被描述为影响“有限的客户群”,但现已扩展到更广泛的开发者社区,尤其是那些构建 AI 代理工作流的人。此次攻击可能影响数百名用户

GateNews5小时前

OpenAI 推 GPT-5.5:12M 脈絡、AA 指數登頂、Terminal-Bench 82.7% 改寫代理基準

OpenAI 发布 GPT-5.5,主打代理式工作与企业知识处理,并同步于 ChatGPT 与 Codex 推出。要点含 1200 万 token 脉络视窗、AA Intelligence Index 60,领先 Claude Opus 4.7、Gemini 3.1 Pro;价格为每百万 token 输入 5 美元、输出 30 美元,输出 token 减少约 40%,实际成本上升约 20%。

鏈新聞abmedia6小时前
评论
0/400
暂无评论