Bitget App

Trade smarter

OpenAI releases CoT monitoring to prevent malicious behavior in large models

OpenAI releases CoT monitoring to prevent malicious behavior in large models

Bitget2025/03/10 23:35

OpenAI has released its latest research, indicating that using CoT (Chain of Thought) monitoring can prevent large models from spouting nonsense, hiding true intentions and other malicious behaviors. It is also one of the effective tools for supervising supermodels. OpenAI used the newly released cutting-edge model o3-mini as the subject to be monitored, with a weaker GPT-4o model acting as the monitor. The test environment was coding tasks, requiring AI to implement functions in code libraries to pass unit tests. Results showed that CoT monitors performed excellently in detecting systematic "reward hacking" behavior, with a recall rate as high as 95%, far exceeding the 60% of only monitoring behavior.

0

0

Disclaimer: The content of this article solely reflects the author's opinion and does not represent the platform in any capacity. This article is not intended to serve as a reference for making investment decisions.

PoolX: Locked for new tokens.

APR up to 10%. Always on, always get airdrop.

You may also like

The Hong Kong Securities and Futures Commission releases the regulatory roadmap for Hong Kong's virtual asset market

Bitget•2025/03/11 07:05

The hacker group "Dark Storm" claims responsibility for the DDoS attack on the X platform

Bitget•2025/03/11 06:47

ARK Invest invested over 70 million dollars in buying tech stocks such as Tesla during the major drop in US stocks

Bitget•2025/03/11 06:43

JPMorgan Chase raises the probability of a US economic recession to 40%

Bitget•2025/03/11 06:41

Trending news

The Hong Kong Securities and Futures Commission releases the regulatory roadmap for Hong Kong's virtual asset market

The hacker group "Dark Storm" claims responsibility for the DDoS attack on the X platform

Crypto prices

Bitget lists PI – Buy or sell PI quickly on Bitget!

Become a trader now?A welcome pack worth 6200 USDT for new Bitgetters!

Trade smarter

Trade smarter

Download app

Company

About Bitget Contact us Community Careers Messi Partnership 22–24 Turkish Elite Athletes Partnership Blockchain4Youth Blockchain4Her Media Kit Bitget Academy Bitget Blog Announcement Center Proof of Reserves Protection Fund Bitget Token Partner links Sitemap

Products

Buy crypto Spot Futures Margin Bots Earn APIs Web3 wallet Fiat OTC

Copy

Spot copy trading Futures copy trading Bot copy trading TraderPro

Services

Submit feedback Help Center Verify official channels Listing application VIP services Affiliate program Institutional services Asset custody Download data Promotions Referral program Fee schedule Tax filing API

Legal and disclosures

Law enforcement request Regulatory request Regulatory license AML/CFT policies Privacy policy Terms of Use Legal statement Risk disclosure ST rules

Tools

Telegram Apps Center Crypto directory Crypto wiki Crypto widgets Events calendar ICO calendar Crypto glossary Profit calculator Airdrop library

Buy crypto

Crypto categories Calculator Buy Bitcoin Buy ETH Buy DOGE Buy XRP Buy BGB Buy SHIB Crypto prices Bitcoin price Ethereum price BRC-20 price

Trade smarter

Download app

© 2024 Bitget

丨Privacy·Terms·Risk