I set 10 honesty traps for Claude Opus 4.8 - and a legal test broke it
11h ago
I set 10 honesty traps for Claude Opus 4.8 - and a legal test broke it
I tested Opus 4.8 against 4.7 using coding, medical, finance, and legal traps, then cross-checked the results with multiple AIs.
#claude #hackernews #news
You might also wanna read
SEC proposes repealing climate disclosure rule requiring companies to report emissions and climate risks
The Securities and Exchange Commission (SEC) has proposed repealing a Biden-era climate disclosure rule that would require some public compa
SEC proposes repealing climate disclosure rule requiring companies to report emissions and climate risks
The Securities and Exchange Commission (SEC) has proposed repealing a Biden-era climate disclosure rule that would require some public compa
SEC proposes repealing climate disclosure rule requiring companies to report emissions and climate risks
The Securities and Exchange Commission (SEC) has proposed repealing a Biden-era climate disclosure rule that would require some public compa
No, Artificial Intelligence Is Not Conscious
theatlantic.com·37m ago
Kenny Kenoir (@kenoir323) on Threads
threads.com·39m ago
UK House of Lords committee urges Bank of England to ease proposed stablecoin restrictions
A U.K. House of Lords committee has called on the Bank of England to reconsider its proposed stablecoin regulations, including a 20,000 poun
