pull down to refresh
Hmm no but I suspect that this means that the coding frameworks are working around fundamental issues, rather than the actual ultra-expensive models improving much.
For example, in #1415961, the author is very enthusiastic about ralph-wiggum. This is a plugin that executes your prompt until Claude gets it right. A.k.a. if you bet 100x on a 1:100 odds outcome, you have a real chance at getting it right.
All this bull crap the liars-in-chief have been spilling at Davos is all about simulating until you get it right, rather than actually getting it right.
reply
Smoke and mirrors?