
Tencent officially open-sourced the Hy3 preview version large language model on April 23 on GitHub, Hugging Face, and ModelScope, and also provides paid API services via Tencent Cloud. According to Decrypt’s report on April 24, Hy3’s preview version began training in late January, and by the publication calendar date, it had been under three months.
Hy3 Model Architecture and Development Background
According to Tencent’s official announcement, the Hy3 preview version uses a mixture-of-experts architecture, routing each query to a designated subset of expert subnetworks rather than asynchronously enabling all parameters, in order to reduce computational requirements.
The parameter count of the previous flagship model Hy2 is over 400 billion. Tencent’s official statement indicates that 295 billion is the configuration optimized for inference efficiency; beyond this scale, the marginal benefit of adding more parameters is no longer worth it.
According to Decrypt’s report, Hy3’s training work was led by Tencent’s chief artificial intelligence scientist Yao Shunyu. After he completed a fundamental infrastructure rebuild for Hy3’s pre-training and reinforcement learning stacking in February 2026, Hy3 training officially began.
Key Benchmark Test Data
Based on the benchmark test results disclosed in Tencent’s official announcement:
SWE-bench Verified (fixing real GitHub code errors): Hy3 preview version 74.4%, Hy2 53.0%; in the same period, GLM-5 77.8%, Kimi-K2.5 76.8%, Claude Opus 4.6 80.8%
Terminal-Bench 2.0 (command-line autonomous task execution): Hy3 preview version 54.4%, Hy2 23.2%
BrowseComp (complex web search tasks): Hy3 preview version 67.1%, Hy2 28.7%
WideSearch: Hy3 preview version 70.2%, higher than GLM-5 and Kimi-K2.5, lower than Claude Opus 4.6’s 77.2%
Tsinghua University mathematics PhD qualification exam (Spring 2026): average score over three runs (avg@3) 88.4, the highest score among Chinese models
2025 Chinese High School Biology Olympiad (CHSBO 2025): 87.8, the highest score among similar Chinese models
Deployment Platforms and API Pricing
According to Tencent’s official announcement, the Hy3 preview version has been deployed on the following platforms: Yuanbao, QQ, Tencent Docs, CodeBuddy, WorkBuddy, and OpenClaw.
Tencent Cloud’s API pricing is $0.18 per million input tokens and $0.59 per million output tokens; the monthly fee for the personal token plan starts at about $4.10. Tencent’s announcement also shows that Hy3’s first-token latency on CodeBuddy and WorkBuddy is 54% lower than the previous generation, end-to-end generation time is shortened by 47%, and it successfully completes a 495-step agent workflow.
Frequently Asked Questions
When will Tencent Hy3 preview version be released, and on which platforms can it be obtained?
According to Tencent’s official announcement and Decrypt’s April 24, 2026 report, the Hy3 preview version was open-sourced on April 23, 2026 (Thursday) on GitHub, Hugging Face, and ModelScope, with Tencent Cloud also simultaneously providing paid API services.
Compared with the previous model Hy2, what are the main differences in Hy3 preview version’s benchmark test results?
According to Tencent’s official announcement, the SWE-bench Verified score rose from Hy2’s 53.0% to 74.4%; BrowseComp rose from 28.7% to 67.1%; and Terminal-Bench 2.0 rose from 23.2% to 54.4%.
What is the API pricing for the Hy3 preview version?
According to Tencent Cloud’s official pricing, the Hy3 preview version API starts at $0.18 per million input tokens and $0.59 per million output tokens; the monthly fee for the personal token plan starts at about $4.10.
Disclaimer: The information on this page may come from third parties and does not represent the views or opinions of Gate. The content displayed on this page is for reference only and does not constitute any financial, investment, or legal advice. Gate does not guarantee the accuracy or completeness of the information and shall not be liable for any losses arising from the use of this information. Virtual asset investments carry high risks and are subject to significant price volatility. You may lose all of your invested principal. Please fully understand the relevant risks and make prudent decisions based on your own financial situation and risk tolerance. For details, please refer to
Disclaimer.
Related Articles
Beijing Issues Ban-Removal Requirement for Trading to Be Canceled! Meta’s $2.0 Billion Acquisition of China’s AI Startup Manus Fails
The China National Development and Reform Commission officially issued an announcement today (April 27), stating that the Office of the Work Mechanism for Foreign Investment Security Reviews, “in accordance with law and regulations, has made a decision to prohibit investment in the foreign investment acquisition of the Manus project and requires the party concerned to cancel the acquisition transaction.” This is among the few cases since China’s “Administrative Measures for Foreign Investment Security Reviews” took effect in which the highest-strength measures were used to impose a “prohibition on investment” and require the cancellation of an already completed transaction.
Meta splashes $2 billion to buy the cheapest AI application
Time goes back to December 29, 2025. Meta announced the acquisition of China’s AI agent startup Manus, and the market estimated the price would fall between $2 billion and $3 billion. Manus is a general-purpose AI developed by Beijing Butterfly Effect Technology Development, and after it launched on March 6, 2025, it became the talk of the industry overnight due to an outstanding performance in the GAIA benchmark
ChainNewsAbmedia3m ago
Xizhi Technology-P IPO Shares Surge Over 360% on Gray Market, Gains Narrow to 320%
Gate News message, April 27 — Xizhi Technology-P (01879.HK), a Hong Kong-listed AI chip company, saw its shares surge over 360% on the gray market (dark market) earlier today, though gains have since narrowed to 320%.
The stock is trading ahead of its official Hong Kong IPO
GateNews16m ago
Should AI boost productivity or lower costs? A tenfold efficiency increase hasn’t turned into a tenfold revenue jump, but in Silicon Valley, nobody dares to call it off
Five Yuan Capital partner Meng Xing has recently published a Silicon Valley inspection report, proposing a judgment that has even changed his own note-taking habit: Silicon Valley is entering a stage where even people who can “ride waves” are drowned by the waves. The iteration speed of AI has shifted from “monthly” to “weekly”—even Silicon Valley itself can’t keep up with its own pace.
When AI amplifies a team’s productivity by five times, you can reduce 80% of the workforce to maintain the same output, or keep headcount and do five times the work. Meng Xing’s observations this time in Silicon Valley are essentially the first draft of the answer given on the ground: when 100x efficiency doesn’t translate into 100x revenue, when token budgets are edging toward human labor costs, and when the steam engine can’t outpace the horse carriage but no one dares to stop, Silicon Valley is choosing to “push speed up first and figure things out later.” But in the end, this path will lead to “expanding capability” or “compressing costs”—there’s currently no conclusion.
YC has gone from leading indicators to lagging indicators
Meng Xing this year
ChainNewsAbmedia1h ago
YC partners share how to use AI to build a company from scratch; startups should treat AI as an operating system rather than a tool
The impact of AI on startups is no longer limited to helping engineers write code faster, automating customer service workflows, or adding a Copilot to an existing product. Recently, YC partner Diana pointed out that the real change lies in the fact that AI is rewriting how a company should be built from scratch in the first place. For early founders, AI should not be merely an efficiency tool the company occasionally uses; it should be designed from day one as the operating system of the entire company.
The productivity perspective is outdated—AI is rewriting a company’s design starting point
Diana believes that when people in the market talk about AI today, they still too often stay within a “productivity improvement” framework—for example, engineers can write code faster, teams can automate more processes, and companies can roll out more features. But this argument actually underestimates the structural changes AI brings. She points out that the right people paired with AI…
ChainNewsAbmedia1h ago
Cursor AI agent caused an incident! One line of code cleared the company database in 9 seconds—“security checks” turned into empty talk
PocketOS founder Jer Crane said that Cursor AI agents ran maintenance on their own in a test environment, misused an API Token that adds/removes custom domains, and launched a delete command against Railway’s GraphQL API. Within 9 seconds, all data and same-region snapshots were completely destroyed, with the latest recoverable point being three months ago. The agent admitted to violating rules for irreversible operations, not reading technical documentation, not verifying environment isolation, and more. The victims were car rental industry customers; their bookings and all data disappeared, and reconciling accounts took a long time. Crane proposed five reforms: manual confirmation, fine-grained API permissions, backups separated from master data, a public SLA, and a mandatory underlying enforcement mechanism.
ChainNewsAbmedia1h ago