Learned about this from an article called 

, and it's intriguing to see how the scale needs to be adjusted to measure the effectiveness of LLMs as the latter evolves. Honestly, should probably have made the article the main link for the post, but both are worth looking at.

tech

Learned about this from an article called [Humanity’s Last Exam Stumps Top AI Models—and That’s a Good Thing](https://singularityhub.com/2026/02/03/humanitys-last-exam-stumps-top-ai-models-and-thats-a-good-thing/), and it's intriguing to see how the scale needs to be adjusted to measure the effectiveness of LLMs as the latter evolves. Honestly, should probably have made the article the main link for the post, but both are worth looking at.