@anon
sign up
@anon
sign up
pull down to refresh
"Benchwashing" - how do you defend against this?
1648 sats
\
10 comments
\
@optimism
9 Aug 2025
AskSN
related
Qwen3-235B-A22B-2507
xcancel.com/Alibaba_Qwen/status/1947344511988076547
218 sats
\
0 comments
\
@m0wer
24 Jul 2025
AI
Is DeepSeek a game changer for AI? - Computerphile
3297 sats
\
10 comments
\
@SimpleStacker
28 Jan 2025
tech
Alibaba has released its flagship Qwen3-Max model with a trillion parameters
chat.qwen.ai/
167 sats
\
0 comments
\
@lunin
25 Sep 2025
AI
Mistral has released two open-source models for coding
mistral.ai/news/devstral-2-vibe-cli
522 sats
\
0 comments
\
@lunin
10 Dec 2025
AI
Anarchy in Sudan has spawned the world’s worst famine in 40 years
www.economist.com/briefing/2024/08/29/anarchy-in-sudan-has-spawned-the-worlds-worst-famine-in-40-years
148 sats
\
3 comments
\
@hn
1 Sep 2024
tech
MCP-Bench: Benchmarking Tool-Using LLM Agents
arxiv.org/abs/2508.20453
239 sats
\
0 comments
\
@optimism
30 Aug 2025
AI
OpenAI’s Misalignment and Microsoft’s Gain
stratechery.com/2023/openais-misalignment-and-microsofts-gain/
7509 sats
\
11 comments
\
@elvismercury
20 Nov 2023
tech
OpenAI Secretly Funded Benchmarking Dataset Linked To o3 Model
www.searchenginejournal.com/openai-secretly-funded-frontiermath-benchmarking-dataset/537760/
341 sats
\
0 comments
\
@frostdragon
21 Jan 2025
tech
AI is actually bad at math, ORCA shows
www.theregister.com/2025/11/17/ai_bad_math_orca/
167 sats
\
4 comments
\
@0xbitcoiner
18 Nov 2025
AI
Surge in Russian Oil Tankers Mitigates Sanctions Impact
615 sats
\
10 comments
\
@GhostofTruth
14 Aug 2024
news
Voting on chat.lmsys.org might be super influential for humankinds future
511 sats
\
5 comments
\
@zuspotirko
22 Jun 2024
tech
Changing consensus - Bitcoin Optech Newsletter #383
bitcoinops.org/en/newsletters/2025/12/05/
1564 sats
\
2 comments
\
@schmidty
5 Dec 2025
bitcoin
₿ully-sh. Travelling in style in El Salvador.
102 sats
\
5 comments
\
@CarlBMenger
8 Apr 2023
bitcoin
Fed approves quarter-point interest rate cut and sees two more coming this year
www.cnbc.com/2025/09/17/fed-rate-decision-september-2025.html
528 sats
\
7 comments
\
@Coinsreporter
17 Sep 2025
econ
Why OpenAI’s solution to AI hallucinations would kill ChatGPT tomorrow
theconversation.com/why-openais-solution-to-ai-hallucinations-would-kill-chatgpt-tomorrow-265107
588 sats
\
25 comments
\
@south_korea_ln
17 Sep 2025
AI
Bank of Canada cut benchmark rate by 50 basis points
675 sats
\
13 comments
\
@grayruby
11 Dec 2024
econ
Search-capable AI agents may cheat on benchmark tests
www.theregister.com/2025/08/23/searchcapable_ai_agents_may_cheat
237 sats
\
2 comments
\
@Coinsreporter
23 Aug 2025
AI
Jake Paul defeats Mike Tyson🥊
676 sats
\
24 comments
\
@suraz
16 Nov 2024
Stacker_Sports
Argentina, The house of cards is beginning to sway.
8661 sats
\
36 comments
\
@fbv000
22 Apr 2023
bitcoin
The flagship model, Qwen3-Max-Preview, has been released
100 sats
\
0 comments
\
@lunin
5 Sep 2025
AI
Researcher uncovers one of the biggest password dumps in recent history
arstechnica.com/security/2024/01/71-million-passwords-for-facebook-coinbase-and-others-found-for-sale/
1875 sats
\
2 comments
\
@jakoyoh629
20 Jan 2024
security
more