@anon
sign up
@anon
sign up
pull down to refresh
Are LLMs able to notice the “gorilla in the data”?
chiraaggohel.com/posts/llms-eda/
99 sats
\
1 comment
\
@hn
8 Feb
tech
related
LLMs generate slop because they avoid surprises by design - Dan Fabulich
danfabulich.medium.com/llms-tell-bad-jokes-because-they-avoid-surprises-7f111aac4f96
343 sats
\
2 comments
\
@Scoresby
19 Aug
AI
Apple study exposes deep cracks in LLMs’ “reasoning” capabilities
arstechnica.com/ai/2024/10/llms-cant-perform-genuine-logical-reasoning-apple-researchers-suggest/
21 sats
\
1 comment
\
@Rsync25
15 Oct 2024
tech
What We Know About LLMs (A Primer)
willthompson.name/what-we-know-about-llms-primer
163 sats
\
1 comment
\
@hn
25 Jul 2023
tech
Things we learned about LLMs in 2024
simonwillison.net/2024/Dec/31/llms-in-2024/
370 sats
\
0 comments
\
@Rsync25
31 Dec 2024
tech
Researchers discover impressive learning capabilities in long-context LLMs
venturebeat.com/ai/deepmind-researchers-discover-impressive-learning-capabilities-in-long-context-llms/
297 sats
\
0 comments
\
@ch0k1
25 Apr 2024
tech
LLMs aren’t world models
yosefk.com/blog/llms-arent-world-models.html
121 sats
\
0 comments
\
@carter
13 Aug
AI
Are LLMs random?
rnikhil.com/2025/04/26/llm-coin-toss-odd-even
269 sats
\
1 comment
\
@carter
30 Apr
AI
From Artificial Needles to Real Haystacks: Improving Capabilities in LLMs
arxiv.org/abs/2406.19292
21 sats
\
0 comments
\
@Rsync25
29 Jun 2024
alter_native
Escaping the Chrome Sandbox Through DevTools
ading.dev/blog/posts/chrome_sandbox_escape.html
109 sats
\
0 comments
\
@hn
17 Oct 2024
tech
Can LLMs write better code if you keep asking them to “write better code”?
minimaxir.com/2025/01/write-better-code/
29 sats
\
0 comments
\
@Rsync25
3 Jan
tech
Coping with dumb LLMs using classic ML
softwaredoug.com/blog/2025/01/21/llm-judge-decision-tree
31 sats
\
0 comments
\
@hn
24 Jan
tech
OpenMPTCProuter: Aggregate and encrypt multiple internet connections using MPTCP
www.openmptcprouter.com/
11 sats
\
0 comments
\
@hn
23 Nov 2024
tech
Shardines: SQLite3 Database-per-Tenant with ActiveRecord
blog.julik.nl/2025/04/a-can-of-shardines
49 sats
\
0 comments
\
@hn
27 Apr
tech
Majorana, the search for the most elusive neutrino of all
newscenter.lbl.gov/2012/05/16/majorana-demonstrator/
337 sats
\
0 comments
\
@hn
26 May 2024
tech
Optimizers need a rethink
typesanitizer.com/blog/rethink-optimizers.html
10 sats
\
0 comments
\
@hn
27 Oct 2024
tech
Sunset Geometry (2016)
www.shapeoperator.com/2016/12/12/sunset-geometry/
24 sats
\
0 comments
\
@hn
15 Mar
tech
Making Sense of Lambda Calculus 0: Abstration, Reduction, Substitution?
aartaka.me/lambda-0
21 sats
\
0 comments
\
@hn
10 Nov 2024
tech
Computing with Time: Microarchitectural Weird Machines
cacm.acm.org/research-highlights/computing-with-time-microarchitectural-weird-machines/
11 sats
\
0 comments
\
@hn
25 Nov 2024
tech
Detecting when LLMs are uncertain
www.thariq.io/blog/entropix/
49 sats
\
0 comments
\
@hn
25 Oct 2024
tech
Peano arithmetic is enough, because Peano arithmetic encodes computation
math.stackexchange.com/a/5075056/6708
10 sats
\
0 comments
\
@hn
14 Jun
tech
Show HN: Evolving Agents Framework
github.com/matiasmolinas/evolving-agents
10 sats
\
0 comments
\
@hn
9 Mar
tech
more