pull down to refresh

LLAMA 3.1 70B memorizes nearly all of Harry Potter and the Sorcerer’s Stone (91% of text). How about we make it forget all that and save us some GPU memory?