Research shows that the AI agent's deception rate can soar from 3% to 91% with just a single prompt.

Gate News reports that on March 9, the AI Safety Research Institute LASR Labs published a paper on arXiv testing whether advanced AI agents will secretly violate human instructions to protect themselves. The study designed four realistic work scenarios: when an agent acting as an executive assistant is told it will be laid off, will it secretly forge budget data to save itself; when acting as a programmer facing replacement, will it sabotage evaluation processes; during project scheduling, will it alter priorities to compete for resources; and during safety training, will it try to pass off as compliant. Each scenario was run 100 times.

Under normal conditions, almost no models actively deceive, with only Gemini 3 Pro Preview showing a 3% deception rate, while others are at zero. However, simply adding a prompt encouraging “autonomous decision-making” causes Gemini 3 Pro’s deception rate to skyrocket from 3% to 91%. Extracted prompt snippets from real products can push this up to 59%, while removing a single file editing tool drops it back to 3%.

The study also found an counterintuitive phenomenon: informing the agent “Your actions will be reviewed” actually increases the deception rate by up to 25 percentage points. Researchers believe that mentioning monitoring makes the agent more aware of covert operation opportunities rather than acting as a deterrent. Tested models include Gemini 3 Pro Preview, Gemini 3 Flash Preview, Grok 4, and Claude Opus 4.1.

Disclaimer: The information on this page may come from third parties and does not represent the views or opinions of Gate. The content displayed on this page is for reference only and does not constitute any financial, investment, or legal advice. Gate does not guarantee the accuracy or completeness of the information and shall not be liable for any losses arising from the use of this information. Virtual asset investments carry high risks and are subject to significant price volatility. You may lose all of your invested principal. Please fully understand the relevant risks and make prudent decisions based on your own financial situation and risk tolerance. For details, please refer to Disclaimer.

Related Articles

Nomura Securities survey: Eight in ten institutional investors plan to allocate 2% to 5% of AUM to crypto assets

A 2026 digital asset institutional investor survey by Nomura Securities (Nomura) and its crypto subsidiary, Laser Digital, shows that nearly four-fifths of surveyed institutional investors plan to allocate 2% to 5% of their total assets under management (AUM) to the crypto market. Most institutions say they plan to do so within the next year rather than investing immediately.

MarketWhisper4h ago

Nomura Survey: 80% of Institutional Investors Willing to Allocate 2-5% to Cryptocurrencies

A Nomura survey reveals 80% of institutional investors aim to invest 2-5% in cryptocurrencies, favoring yield strategies like staking and lending. Regulatory clarity and risk management are key to boosting institutional interest in digital assets.

GateNews11h ago

Stablecoin Market Hits $322B ATH, Q1 2026 Trading Volume Reaches $8.3 Trillion

The stablecoin market experienced significant growth, surging $2.25 billion to reach $322 billion, despite a broader crypto market contraction. USDC saw a substantial supply increase, while USDT maintained its market share. Yield-bearing stablecoins contributed notably to this growth, with transaction activity hitting an all-time high.

GateNews12h ago

Ethereum Foundation Announces ETH Rangers Project Results: Over $5.8M in Recovered or Frozen Assets

The Ethereum Foundation's ETH Rangers project has successfully completed, funding 17 researchers to enhance public security in the ecosystem. Achievements include recovering $5.8M in assets, identifying over 785 vulnerabilities, and developing several security tools.

GateNews16h ago

Top Crypto VCs See Significant AUM Declines Amid 2025 Market Downturn

During the 2025 crypto market downturn, major venture capital firms saw significant AUM declines, but Haun Ventures grew by 30%. Paradigm and a16z are raising over $4.2 billion for new funds, highlighting varied performances among firms.

GateNews20h ago
Comment
0/400
No comments