@anon
sign up
@anon
sign up
pull down to refresh
Don't Overthink It: A Survey of Efficient R1-style LRMs
arxiv.org/abs/2508.02120
132 sats
\
2 comments
\
@optimism
10 Aug 2025
AI
related
Stacker News Content Guidelines
1612 sats
\
50 comments
\
@sn
15 Oct 2022
bitcoin
Stacker News Changelog
4943 sats
\
10 comments
\
@sn
8 Oct 2022
bitcoin
Is Chain-of-Thought Reasoning of LLMs a Mirage?
arxiv.org/abs/2508.01191
397 sats
\
9 comments
\
@optimism
7 Aug 2025
AI
Giving models more compute time might make them worse at reasoning - Anthropic
arxiv.org/abs/2507.14417
313 sats
\
2 comments
\
@Scoresby
31 Jul 2025
AI
Do you believe in “Predictive Programming”?
6992 sats
\
19 comments
\
@Car
25 Feb 2024
conspiracy
Your Internet Shouldn't Be My Internet
samsharp.ca/your-internet-my-internet/
491 sats
\
7 comments
\
@nym
4 Dec 2024
culture
Road-To-Suave-Fabs / Part 1
1943 sats
\
14 comments
\
@Fabs
24 Aug 2024
ideasfromtheedge
"Benchwashing" - how do you defend against this?
1648 sats
\
10 comments
\
@optimism
9 Aug 2025
AskSN
Sustainable Profit Taking on Stacker News
3570 sats
\
25 comments
\
@Undisciplined
22 Apr 2024
meta
DBRX: A new open LLM
www.databricks.com/blog/introducing-dbrx-new-state-art-open-llm
10 sats
\
1 comment
\
@hn
31 Mar 2024
tech
1-Bit LLM: The Most Efficient LLM Possible?
www.youtube.com/watch?v=7hMoz9q4zv0
533 sats
\
1 comment
\
@carter
24 Jun 2025
AI
Curated list of useful Open Source applications for GrapheneOS
4142 sats
\
24 comments
\
@nullama
14 Aug 2023
tech
what's the best time of day/day of the week for you to achieve creative thought?
1264 sats
\
15 comments
\
@billytheked
24 Jul 2025
AskSN
Is DeepSeek a game changer for AI? - Computerphile
3297 sats
\
10 comments
\
@SimpleStacker
28 Jan 2025
tech
Free Dolly: Introducing the World's First Truly Open Instruction-Tuned LLM
www.databricks.com/blog/2023/04/12/dolly-first-open-commercially-viable-instruction-tuned-llm
306 sats
\
1 comment
\
@nullama
13 Apr 2023
bitcoin
Meet PowerInfer: A Fast LLM on a Single Consumer-Grade GPU
www.marktechpost.com/2023/12/23/meet-powerinfer-a-fast-large-language-model-llm-on-a-single-consumer-grade-gpu-that-speeds-up-machine-learning-model-inference-by-11-times/
10 sats
\
2 comments
\
@ch0k1
24 Dec 2023
AI
AI does math: Multiplication using o1-mini vs GPT-4o
160 sats
\
5 comments
\
@zuspotirko
18 Sep 2024
tech
Optimal Zapping on Stacker News
11.4k sats
\
112 comments
\
@Undisciplined
18 Oct 2023
meta
LiveBench - A Challenging, Contamination-Free LLM Benchmark
livebench.ai
161 sats
\
0 comments
\
@supratic
17 Jul 2025
AI
LLaMA-Factory: Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
github.com/hiyouga/LLaMA-Factory
157 sats
\
0 comments
\
@carter
19 Sep 2025
AI
Verbalized Sampling: How to Mitigate Mode Collapse and Unlock LLM Diversity
arxiv.org/abs/2510.01171
166 sats
\
0 comments
\
@Scoresby
17 Oct 2025
AI
more