Tests that once challenged advanced AI models are now being solved with ease, making it harder for researchers to pinpoint what current systems are actually capable of.
A global team developed Humanity’s Last Exam, a rigorous new test built to expose gaps in today’s most advanced AI models.