@anon
sign up
@anon
sign up
pull down to refresh
BrokenMath: A Benchmark for Sycophancy in Theorem Proving with LLMs
arxiv.org/abs/2510.04721
180 sats
\
1 comment
\
@jakoyoh629
25 Oct
AI
related
LLMs and SN, redux
2807 sats
\
19 comments
\
@elvismercury
4 Jan 2024
meta
The big idea: Charge $1 to apply to a job (hear me out)
nodumbideas.com/p/the-big-idea-charge-1-to-apply-to
2358 sats
\
31 comments
\
@elvismercury
27 Feb
econ
"Benchwashing" - how do you defend against this?
1648 sats
\
10 comments
\
@optimism
9 Aug
AskSN
Voting on chat.lmsys.org might be super influential for humankinds future
511 sats
\
5 comments
\
@zuspotirko
22 Jun 2024
tech
Highly available LND clusters (etcd, PostgreSQL, Ceph) with benchmarks
github.com/Filiprogrammer/lnd-ha-guide
2685 sats
\
0 comments
\
@Filiprogrammer
12 Oct 2024
lightning
How to turn LLM Pinocchio into a real boy
12.7k sats
\
10 comments
\
@Scoresby
7 Oct
AI
How peer review became so easy to exploit by AI
medium.com/blog/how-peer-review-became-so-easy-to-exploit-by-ai-d5818545bd93
424 sats
\
4 comments
\
@BlokchainB
16 Jul
AI
Transformer based AI will not lead us to AGI/ASI and is just a hype machine
3109 sats
\
18 comments
\
@cy
2 Jul
AI
Drivel-ology: Challenging LLMs with Interpreting Nonsense with Depth
arxiv.org/abs/2509.03867
306 sats
\
0 comments
\
@optimism
7 Sep
AI
🕵️♂️🚫 Free Yourself from Google Spyware with PullThatUpJamie.ai
4312 sats
\
31 comments
\
@cascdr
2 Jan
privacy
Context Rot: How Increasing Input Tokens Impacts LLM Performance
research.trychroma.com/context-rot
304 sats
\
2 comments
\
@Scoresby
14 Jul
AI
The week in AI, October 20-26, 2025
382 sats
\
5 comments
\
@optimism
27 Oct
AI
The simulation of judgment in LLMs - PNAS
www.pnas.org/doi/10.1073/pnas.2518443122
214 sats
\
5 comments
\
@Scoresby
15 Oct
AI
Parents Are Letting Little Kids Play With AI. Are They Wrong?
www.theguardian.com/technology/ng-interactive/2025/oct/02/ai-children-parenting-creativity
1460 sats
\
13 comments
\
@OT
7 Oct
AI
The week in AI, October 13-19, 2025
1856 sats
\
18 comments
\
@optimism
20 Oct
AI
Bank of Canada likely to cut rates again in September
604 sats
\
6 comments
\
@grayruby
21 Aug 2024
econ
AI benchmarks hampered by bad science
www.theregister.com/2025/11/07/measuring_ai_models_hampered_by/
178 sats
\
0 comments
\
@0xbitcoiner
10 Nov
AI
The week in AI, October 6-12, 2025
961 sats
\
2 comments
\
@optimism
13 Oct
AI
To Have Machines Make Math Proofs, Turn Them Into a Puzzle
www.quantamagazine.org/to-have-machines-make-math-proofs-turn-them-into-a-puzzle-20251110/
238 sats
\
0 comments
\
@0xbitcoiner
11 Nov
AI
Inverse IFEval: Unlearn Training Conventions to Follow Real Instructions?
arxiv.org/abs/2509.04292
90 sats
\
0 comments
\
@optimism
5 Sep
AI
Why OpenAI’s solution to AI hallucinations would kill ChatGPT tomorrow
theconversation.com/why-openais-solution-to-ai-hallucinations-would-kill-chatgpt-tomorrow-265107
588 sats
\
25 comments
\
@south_korea_ln
17 Sep
AI
more