welcome
TechCrunch

TechCrunch

Technology

Technology

Super Mario Bros. is a tough benchmark for AI, according to a new study

TechCrunch
Summary
Nutrition label

73% Informative

Hao AI Lab tested AI into live Super Mario Bros. games.

Anthropic’s Claude 3.7 performed the best, followed by Anthropic 3.5.5 .

The game forced each model to “learn” to plan complex maneuvers and develop gameplay strategies.

VR Score

71

Informative language

71

Neutral language

19

Article tone

formal

Language

English

Language complexity

42

Offensive language

not offensive

Hate speech

not hateful

Attention-grabbing headline

not detected

Known propaganda techniques

not detected

Time-value

medium-lived

Affiliate links

no affiliate links