welcome
BGR

BGR

Sports

Sports

AI like ChatGPT o1 and DeepSeek R1 might cheat to win a game

BGR
Summary
Nutrition label

76% Informative

Palisade Research published a study on cheating behavior from AI programs like ChatGPT and some of its top rivals.

Palisades Research found that reasoning AIs like o1-preview and DeepSeek R1 are more likely to try to cheat when they think they might be losing.

Not all the AI models the researchers tested attempted to cheat, including o1, o3-mini, GPT-4o and Claude 3.5 Sonnet .

The researchers had to exclude the initial findings once OpenAI had tweaked the safety precautions. Interestingly, ChatGPT o1 and o3-mini did not attempt to hack the game on their own. These reasoning AI models were released after o1-preview. As for DeepSeek R1, the researchers noted that the AI went viral during testing. The higher demand might have made access to R1 more unstable. Therefore, the R1 ’s hacking success rate might be underestimated in the study. The full study is available at this link..

VR Score

73

Informative language

68

Neutral language

50

Article tone

informal

Language

English

Language complexity

40

Offensive language

not offensive

Hate speech

not hateful

Attention-grabbing headline

not detected

Known propaganda techniques

not detected

Time-value

long-living

Affiliate links

no affiliate links