pull down to refresh

deleted by author

Beware though, this is OpenAI benchmarking OpenAI. Results may or may not be a sales pitch lol

reply

Doubt

Yes, maybe specific well defined tasks it can do expert level work. But that's usually not what experts get paid for.

reply
8 sats \ 0 replies \ @035736735e 30 Sep 2025 -33 sats

the idea that we can enhance AI performance simply by prompting it to self-verify its work is a game-changer, suggesting a more interactive and dynamic approach to AI utilization...