OpenAI releases GPT-5.2! Aiming to replace professionals, with even fewer hallucinations; API fee overview

動區BlockTempo

2025-12-12 03:25:28

OpenAI this morning unexpectedly released GPT-5.2, directly targeting real-world applications’ professional capabilities, and clashing head-on with Google Gemini 3.
(Background summary: ChatGPT will support PayPal direct payments by 2026, the final piece in OpenAI’s e-commerce empire)
(Additional background: OpenAI’s native browser “ChatGPT Atlas” with three major features—can AI agents shake up Chrome dominance?)

Table of Contents

New model focuses on “economic value”
Release rhythm signals “red alert”
Long texts and agents, ecological development shifts
Computing power costs

penAI unexpectedly launched its flagship model GPT-5.2 today (12), and the market is positioning this update as a direct counterattack to Gemini 3. The new model emphasizes a different approach from previous focus on conversational experience, centering on the “economic value” of real workflows. CEO Sam Altman describes it as a digital employee who can clock in at any time.

New model focuses on “economic value”

To quantify value, OpenAI abandons academic benchmarks and introduces its own GDPval metric. According to RD World analysis, GDPval covers 44 types of knowledge work. GPT-5.2 Thinking performs better than or on par with human experts in 70.9% of test tasks, compared to only 38.8% for the previous generation. This means that when handling Fortune 500 financial statements or common Wall Street LBO models, GPT-5.2 completes tasks at under 1% human labor cost and nearly 11 times faster.

It can read and understand hundreds of related tables at once. Additionally, the model can directly convert data into presentation charts and deliver instantly via Microsoft 365 Copilot, providing an plug-and-play productivity tool for enterprises.

According to official statements, GPT-5.2 introduces three variants: Instant, Thinking, and Pro, focusing on active working capabilities. The new Tool Calling mechanism allows the model to automatically connect to external software, from query to delivery. In SWE-bench Pro testing for real software engineering tasks, the Thinking version scored 55.6%, and the Verified version reached 80%. Triple Whale CEO comments:

“Integrating fragile multi-agent systems into a single large agent… it’s like magic.”

Meanwhile, ScreenSpot-Pro testing showed the model can analyze scientific charts and UI interfaces, paving the way for enterprise automation.

Release rhythm signals “red alert”

Within just four months, OpenAI released GPT-5, 5.1, and 5.2. Altman publicly states:

“We are in a red alert phase, and this condition will last until January 2026.”

In SWE-Bench Pro tests simulating software engineering tasks, GPT-5.2 Thinking set a record with 55.6%, surpassing Gemini 3 to become the “best engineering assistant.” However, Gemini 3 still maintains advantages in GPQA Diamond and ARC-AGI tests, indicating both sides are fortified strongholds. OpenAI is prioritizing enhancing programming and commercial logic abilities that directly generate revenue, relegating academic tasks to second priority.

Long texts and agents, ecological development shifts

GPT-5.2 supports a 256k context window, capable of analyzing entire codebases at once. Early partners Databricks and Cognition report a 38% reduction in error detection rate, giving AI agents unprecedented stability for deployment in production environments. Enterprises are no longer asking how models write code but are letting models directly perform refactoring and debugging.

As the new government prepares to take office in 2026, AI competition focus is shifting from raw technical scores to reshaping the global labor market. For Wall Street and corporate executives, the question is no longer whose model is smarter, but who can fastest turn AI into profit on their balance sheets.

Computing power costs

GPT-5.2 API pricing is $1.75 per million tokens for input and $14 for output.

GPT-5.2-pro pricing is $21 per million tokens for input and $168 for output.

!Screenshot 2025-12-12 10:59:01 | Dynamic Blockchain News - Most Influential Blockchain Media

!Dynamic Blockchain Official Website tg banner-1116 | Dynamic Blockchain News - Most Influential Blockchain Media

📍Related Reports📍

OpenAI native browser “ChatGPT Atlas” with three major features—can AI agents shake up Chrome dominance?
Sam Altman talks with a16z founder: OpenAI will aggressively bet on infrastructure, with Sora as a key strategic tool
Altman alongside Pokémon: Hope Nintendo won’t sue us… OpenAI’s new Sora model sparks copyright battles

View Original

Disclaimer: The information on this page may come from third parties and does not represent the views or opinions of Gate. The content displayed on this page is for reference only and does not constitute any financial, investment, or legal advice. Gate does not guarantee the accuracy or completeness of the information and shall not be liable for any losses arising from the use of this information. Virtual asset investments carry high risks and are subject to significant price volatility. You may lose all of your invested principal. Please fully understand the relevant risks and make prudent decisions based on your own financial situation and risk tolerance. For details, please refer to Disclaimer.

Comment

0/400

No comments