pull down to refresh

some bold claims being made early on...

122 sats \ 1 reply \ @gmd OP 10 Jul

Artificial Analysis Review comparing across foundation models:

https://pbs.twimg.com/media/Gvd9nWIakAULlB9?format=jpg&name=4096x4096

view on x.com
94 sats \ 1 reply \ @cy 10 Jul

still think it's a nothing burger, however their performance on ARC-AGI is impressive

reply

Yeah in the end we're still seeing incremental improvements. Their biggest issue will be grabbing users and mindshare from OpenAI and Google

Pretty amazing to achieve SOTA status after starting barely 2 years ago... Elon is a genius motivator (sounds exhausting really.. i would move to Meta and quiet quit).

reply

Rather sad that I would probably get a zero on Humanity's Last Exam unless it were multiple choice...

reply
10 sats \ 1 reply \ @gmd OP 10 Jul

https://pbs.twimg.com/media/GveF5LLXwAEo0qj?format=jpg&name=large

reply

https://pbs.twimg.com/media/GveEUjUW0AAVA0P?format=jpg&name=large

I'm assuming these results are more reliable than llama's benchmarks...

reply