@anon
sign up
@anon
sign up
pull down to refresh
AI is actually bad at math, ORCA shows
www.theregister.com/2025/11/17/ai_bad_math_orca/
167 sats
\
4 comments
\
@0xbitcoiner
18 Nov
AI
related
MCP-Bench: Benchmarking Tool-Using LLM Agents
arxiv.org/abs/2508.20453
239 sats
\
0 comments
\
@optimism
30 Aug
AI
Are LLMs Racist?
461 sats
\
11 comments
\
@Tony
23 Oct
AI
Introducing Claude Haiku 4.5
www.anthropic.com/news/claude-haiku-4-5
202 sats
\
0 comments
\
@0xbitcoiner
15 Oct
AI
LLM Rankings: programming | OpenRouter
openrouter.ai/rankings/programming
96 sats
\
0 comments
\
@m0wer
28 May
tech
"Benchwashing" - how do you defend against this?
1648 sats
\
10 comments
\
@optimism
9 Aug
AskSN
Anthropic Researchers Run Into Trouble When New Model Realizes It's Being Tested
futurism.com/future-society/anthropic-safety-ai-model-realizes-tested
485 sats
\
5 comments
\
@south_korea_ln
3 Oct
AI
Testing AI systems on hard math problems shows they still perform very poorly
phys.org/news/2024-11-ai-hard-math-problems-poorly.html
153 sats
\
4 comments
\
@south_korea_ln
13 Nov 2024
science
Bank of Canada likely to cut rates again in September
604 sats
\
6 comments
\
@grayruby
21 Aug 2024
econ
Opti's Claude 4.5 Sonnet "vibe coding" report
1125 sats
\
13 comments
\
@optimism
5 Oct
AI
Wairdle
1133 sats
\
4 comments
\
@crrdlx
9 Aug
AI
Alibaba has released its flagship Qwen3-Max model with a trillion parameters
chat.qwen.ai/
167 sats
\
0 comments
\
@lunin
25 Sep
AI
Google hides secret message in name list of 3,295 AI researchers
arstechnica.com/information-technology/2025/07/why-it-takes-3295-people-to-write-one-google-ai-paper/
141 sats
\
0 comments
\
@0xbitcoiner
17 Jul
AI
The week in AI, September 29 - October 5, 2025
380 sats
\
5 comments
\
@optimism
6 Oct
AI
My lived experience writing with ChatGPT
567 sats
\
10 comments
\
@realBitcoinDog
15 Apr
BooksAndArticles
Microsoft AI unveils its first independently developed models
121 sats
\
0 comments
\
@lunin
29 Aug
AI
With AI You Need to Think Much Bigger!
rodyne.com/?p=1828
177 sats
\
1 comment
\
@saadelh
16 Mar
AI
Overview of services for using ChatGPT with sats
758 sats
\
23 comments
\
@RedRadish688
19 May 2024
AI
Wairdle - a followup about AI's struggles
404 sats
\
10 comments
\
@crrdlx
12 Aug
AI
Episode 120: Exploring SWE-bench Verified
56 sats
\
0 comments
\
@AtlantisPleb
13 Aug 2024
openagents
At 20 years old, Reddit is defending its data and fighting AI with AI
www.cnbc.com/2025/06/28/reddit-20-fighting-ai-defending-data.html
332 sats
\
2 comments
\
@Coinsreporter
28 Jun
AI
GDPval: Measuring the performance of our models on real-world tasks - OpenAI
openai.com/index/gdpval/
358 sats
\
8 comments
\
@Scoresby
2 Oct
AI
more