pull down to refresh
87 sats \ 1 reply \ @zuspotirko OP 20 Dec \ on: OpenAIs new model "o3" achieves amazing scores in benchmarks tech
Paticularly impressive to me
it solves 1/4 of research-level math questions
Scary that just 1 month ago, after evaluating o1, the great Terrence Tao-
anticipated that the benchmark would "resist AIs for several years at least," noting that the problems require substantial domain expertise and that we currently lack sufficient relevant training data.
reply