reply on: How Long Contexts Fail \ stacker news

pull down to refresh

109 sats \ 1 reply \ @carter OP 7 Jul 2025 \ parent \ on: How Long Contexts Fail AI

I saw a paper that said the models are cheating and learning the exact test questions because if you add extraneous information to a question it previously answered correctly it gets confused with the extraneous information and answers wrong

11 sats \ 0 replies \ @optimism 7 Jul 2025

That is what I was thinking the other day when seeing the performance against benchmarks. Model trainers are pulling a VW on the bench?