Claude Code automation research wins the hackathon championship! Winner: I didn’t even know how we won

ChainNewsAbmedia

Prediction Market AI Agent AI Tools & Apps

2026-04-11 17:04:37

At Paradigm’s Autoresearch Hackathon, a competitor who had virtually “not designed a strategy in person” ultimately took the championship. The winner, Ryan Li—also the CEO of SurfAI—said that nearly the entire problem-solving process was completed by AI, that he even “didn’t know how he won,” yet still secured first place in the Prediction Market Challenge.

The competition required participants to design a market-making strategy in a simulated binary prediction market: provide liquidity in the order book through limit orders, and achieve a balance of profits between “arbitrageurs” and “retail flow.” The final rankings were determined by the average edge (profit advantage) across 200 random simulations. Ryan’s final score was a $42.32 mean edge (calculated as the median among three sets of random seeds), and after re-rating, he topped the leaderboard.

Claude Code + Codex automated research produced 1,039 strategies

Unlike traditional quant trading or market-making strategies that rely on human experts tuning and modeling, Ryan adopted the “Bitter Lesson” approach proposed in recent years by Rich Sutton—letting computational power and search scale beat human experience. He converted the entire problem into an “automated research” (autoresearch) process, exploring the solution space through multiple AI agents in parallel rather than manually optimizing.

Throughout the process, he used 8 to 20 parallel-running AI agents (primarily based on Claude Code, with additional help from Codex). Each agent was responsible for different assumptions and parameter spaces, continuously generating strategies, running simulations, and reporting results. In the end, he accumulated 1,039 strategy variations, conducted more than 2,000 evaluations, and automatically generated 47 parameter-scan scripts. The overall search scale is equivalent to compressing weeks of manual experiments into a few hours.

A 900-line Python market-making algorithm generated by AI won the hackathon

At the strategy level, the final winning solution was a market-making algorithm of roughly 900 lines of Python. Its core logic did not come from a single design, but from stacking multiple “proven effective” modules. These include avoiding the extremely narrow bid-ask spread zones that arbitrageurs can win consistently against; estimating the true price via information theory; dynamically adjusting quote sizes according to arbitrage risk; and proactively entering to capture high-profit regions when the opponent’s order book gets emptied.

The most crucial breakthrough came from an AI agent that “completely abandoned existing strategies and started from scratch.” When overall optimization stalled at around +25 edge, the agent independently discovered a sizing model centered on “the probability of arbitrage risk,” lifting performance in one step to +44—turning point of the entire competition. This result also directly confirmed Ryan’s methodology: when search gets stuck in local optima, restarting is more effective than fine-tuning.

The absolute advantage of AI research: automated trial and error

In his summary, Ryan said the key to this competition was not designing a “smart strategy,” but building a system that can search at scale, validate ideas, and eliminate them. Rather than relying on human intuition, let AI try things in a vast solution space, and amplify efficiency through parallelization and automation.

This case also further reinforces the shift in the role of “agentic AI” in engineering and research workflows. AI is no longer just an assisting tool; it can directly serve as the core execution unit for exploration and decision-making. In some highly structured, simulatable problems, humans can even completely step out of the role of “problem solver,” and instead design the search framework and evaluation mechanisms themselves.

Claude Code automated research won the hackathon! Winner: I honestly have no idea how I won. First appeared on ChainNews ABMedia.

Disclaimer: The information on this page may come from third parties and does not represent the views or opinions of Gate. The content displayed on this page is for reference only and does not constitute any financial, investment, or legal advice. Gate does not guarantee the accuracy or completeness of the information and shall not be liable for any losses arising from the use of this information. Virtual asset investments carry high risks and are subject to significant price volatility. You may lose all of your invested principal. Please fully understand the relevant risks and make prudent decisions based on your own financial situation and risk tolerance. For details, please refer to Disclaimer.

Encouraging innovation! A U.S. judge bans Arizona’s regulators from prediction markets and suspends the prosecution of Kalshi

Prediction Market Regulation & Policy Enforcement Actions

A U.S. federal district court has ruled that Arizona is barred from using its gambling law to sue the prediction market platform Kalshi, finding that the Federal Commodity Futures Trading Commission has exclusive jurisdiction. The ruling affects the boundary between state and federal authority in financial market regulation, and Kalshi maintains that its business falls under financial products rather than traditional gambling. Courts in different states have issued differing rulings on prediction markets, and the Trump family has also expressed support for prediction markets.

CryptoCity2h ago

Polymarket Trader Turns $500 Into $252K After UFC Makes Yet Another Scoring Error

Prediction Market

In brief A Polymarket trader walked away with $252,000 in profit after the UFC incorrectly identified a fight’s winner for the second time in two weeks. An X account linked to the Polymarket trader said that they had noticed the error by looking at the bout’s official scorecard. The

Decrypt3h ago

Polymarket reviews and weeds out early-stage projects in its ecosystem, targeting insider trading and market manipulation behaviors

Prediction Market Enforcement Actions Security Incidents

Polymarket announced an audit of some of the onboarded startup projects that have been accused of using allegedly insider trading account information to steer users into making trades. The move is intended to strengthen compliance management and address external concerns about the risks of insider trading.

GateNews3h ago

Encourage innovation! A U.S. judge bans Arizona-regulated prediction markets and suspends prosecution against Kalshi.

Prediction Market Regulation & Policy Enforcement Actions

A U.S. federal district court has ruled that Arizona is barred from using the gambling law to prosecute the prediction market platform Kalshi, finding that the federal Commodity Futures Trading Commission has exclusive jurisdiction. The ruling affects the line between state and federal authority over financial market regulation, while Kalshi maintains that its business is a financial product rather than traditional gambling. Decisions on prediction markets vary from state to state, and the Trump family has also expressed support for prediction markets.

CryptoCity5h ago

Encourage innovation! A U.S. judge bars Arizona’s regulation of prediction markets, and pauses the prosecution of Kalshi

Prediction Market Regulation & Policy Enforcement Actions

A U.S. federal district court ruled to block Arizona from suing the prediction market platform Kalshi under its gambling laws, finding that the federal Commodity Futures Trading Commission has exclusive jurisdiction. The ruling affects the boundary between state and federal authority in regulating financial markets. Kalshi has insisted that its business is a financial product rather than traditional gambling. Rulings by different states on prediction markets have varied, and the Trump family has also expressed support for prediction markets.

CryptoCity9h ago

New Wallet Bets $40K on Trump Iran Ceasefire Announcement, Down 85%

Prediction Market

Gate News message, a newly created wallet named "bullseye123" spent $40K betting that Trump will announce the end of the US-Iran ceasefire by April 15 or April 18. The wallet holder is currently down $34K, representing an 85% loss on the position.

GateNews10h ago

Comment

0/400

No comments