pull down to refresh
177 sats \ 3 replies \ @optimism 4h \ on: Deep dive into OpenAIs GPT-OSS outputs 🧵 AI
Perfectly aligns with the perceived villain arc of the CEO. I made a small comment yesterday about how it's apparently okay in AI to do what gave VW massive reputation problems: build inferior products that only perform well on the benchmarks and safety tests.
What's RL?
reply
Reinforcement Learning
There's an advanced free and open source course at HuggingFace: https://huggingface.co/learn/deep-rl-course/unit0/introduction
reply
Oh got it. Somehow the initials didn't click
reply