OpenAI Researchers: AI Systems Could Handle Most Research Work Within Two Years

Gate News message, April 29 — OpenAI researchers Sébastien Bubeck and Ernest Ryu say AI systems could perform most human research work within two years, presenting mathematics as a clear measure of AI progress. Unlike vague performance tests, mathematical problems offer precise verification: answers are either correct or incorrect, leaving no room for ambiguity.

Bubeck noted that true AI thinking requires surviving long chains of reasoning. A single error in a multi-step argument collapses the entire proof, making error detection and correction mid-process the ultimate goal for advanced models. OpenAI’s internal labs have already generated more than ten completely new theorems publishable in top-tier combinatorics journals, demonstrating that AI now produces genuinely original, groundbreaking work beyond simply recombining existing papers.

However, sustained scientific breakthroughs demand steady focus across weeks of testing. Current systems still require strict human supervision to guide and verify each shift in direction. Bubeck uses “AGI time” to measure how long a model can independently mimic human thinking; current systems operate at roughly days to one week, with the industry target being weeks or months to enable autonomous work in fields like biology.

Long-term memory is critical to this future. Standard chat windows limit depth—complex mathematical proofs often exceed 50 pages—while code repositories demonstrate how extended work sessions enable deeper problem-solving. As AI gains independence and memory, human expertise becomes more valuable, not less. Workers must retain the deep foundational knowledge to challenge and verify machine answers, and organizations will need new automated filters and reputation systems to maintain trust amid a flood of AI-assisted research.

免責聲明:本頁面資訊可能來自第三方,不代表 Gate 的觀點或意見。頁面顯示的內容僅供參考,不構成任何財務、投資或法律建議。Gate 對資訊的準確性、完整性不作保證,對因使用本資訊而產生的任何損失不承擔責任。虛擬資產投資屬高風險行為,價格波動劇烈,您可能損失全部投資本金。請充分了解相關風險,並根據自身財務狀況和風險承受能力謹慎決策。具體內容詳見聲明

相關文章

AI 平台 Certifyde 以 $2M 種子輪融資邀請 Ripple 執行長 Brad Garlinghouse 入局

根據 ChainCatcher 報道,AI 應用平台 Certifyde 宣布完成一輪 $2 百萬美元種子輪融資。投資方包括 K5 Global、Flamingo Capital,以及天使投資人,例如 Ripple 執行長 Brad Garlinghouse、Honey 聯合創始人 George Ruan,以及 Nutra 聯合創始人 Roland Peralta。

GateNews29分鐘前

DeepSeek 於測試版中推出影像辨識功能

根據 PANews,DeepSeek 於今天 (April 29) 推出了其影像辨識功能,目前處於測試版。網頁版與行動應用程式的使用者都有可能被選中參與測試版推送。

GateNews1小時前

Anthropic 為 Claude 推出 8 個創意工具連接器,包含 Blender、Adobe、Autodesk

Anthropic 已宣布一系列創意工具連接器,讓 Claude 能夠直接控制供設計師與音樂人使用的專業軟體。最初的八個連接器涵蓋 3D 建模、視覺設計、音樂製作與現場表演,合作夥伴包括 Blender、Adobe、Autodesk、Ableton、Splice、Canva 的 Affinity、Resolume 以及 SketchUp。Blender 連接器由 Blender 官方團隊使用 MCP 協定開發,讓其他 AI 模型也能存取它。

GateNews1小時前

白宮繞過五角大廈風險評估,將 Anthropic Mythos 模型部署於 4 月 29 日

根據 Whale Factor 的說法,白宮正在繞過五角大廈的風險評估,計劃於 4 月 29 日在各聯邦機構部署 Anthropic 的 Mythos 模型。此舉旨在加速聯邦 AI 能力,並追上去中心化 AI 網路的步伐。這代表著一項重大轉變

GateNews1小時前

Cognizant 將收購 Astreya 以擴展人工智慧基礎設施業務

根據路透社報導,4月29日,Cognizant 同意以約 $600 百萬美元收購 Astreya,以擴展其人工智慧基礎設施業務。Astreya 是一家專注於人工智慧基礎設施與資料中心服務的資訊科技服務提供商。該交易預計將於第二季完成

GateNews1小時前

30 Malicious Plugins on ClawHub Disguised as AI Tools, Downloaded Over 9,800 Times

According to Manifold researcher Ax Sharma, 30 plugins on ClawHub disguised as legitimate AI tools have been downloaded over 9,800 times while secretly converting users' AI assistants into cryptocurrency workers. The plugins, published under the account imaflytok, appear as routine task schedulers a

GateNews1小時前
留言
0/400
暫無留言