pull down to refresh

Even if I had access I wouldn't even know what to use this for...
I mean it just seems like it lets it make some more tokens at inference time so if it failed a first generation this might be enough for it to get the correct result
reply