pull down to refresh

You can simulate this with turn-based interaction in a simple python script. I've done some experiments with this and in general you want to have extremely low error rates and extremely high error recovery, just like with agentic solutions, which honestly, we do not have today, even though it came along a bit since I first tried (with llama2.)
I've for example experimented with making LLMs use the socratic method to drill down a problem to its core between different agentic roles, but it is hard to get consistency in the face of high error rates; with llama2 the success rate was under 50% in a 5-turn script, similar with qwen3 though the smaller distills for Qwen are much worse than for other models for some reason.
There are perhaps some optimizations to make, but I'm shy to introduce additional authoritarian roles, and that wouldn't help your IRC idea either.
I really like the idea of getting models to dial down on requests with the Socratic method. Did you publish your experiment anywhere?
reply
Don't think I did. I'll prepare a gist after the next round of playing with it.
reply