pull down to refresh

deleted by author

Beware though, this is OpenAI benchmarking OpenAI. Results may or may not be a sales pitch lol

reply

Doubt

Yes, maybe specific well defined tasks it can do expert level work. But that's usually not what experts get paid for.

reply

the idea that we can enhance AI performance simply by prompting it to self-verify its work is a game-changer, suggesting a more interactive and dynamic approach to AI utilization...