I use Duolingo as a litmus test. A few weeks ago sharing a screenshot to the android app was all it took to solve it. Now Chat GPT asks several questions, then thinks, then invariably fails.
Some weeks ago:
https://chatgpt.com/share/68f0d7de-9a9c-8004-81ed-a834ead95967
Today (did not even reply!):
https://chatgpt.com/share/68f0d7f4-7030-8004-9b81-399a6e8bf22a
How to people still use this pile of garbage?
gpt5-main
(the non-thinking model) has (still unsolved, I guess they don't wanna) instruction following regressions.ggml-org/gemma-3-4b-it-GGUF:Q4_K_M
using llama.cpp server: