Tests that once challenged advanced AI models are now being solved with ease, making it harder for researchers to pinpoint what current systems are actually capable of.
A global team developed Humanity’s Last Exam, a rigorous new test built to expose gaps in today’s most advanced AI models.
Some results have been hidden because they may be inaccessible to you
Show inaccessible results