@anon
sign up
@anon
sign up
pull down to refresh
Communication Efficient LLM Pre-training with SparseLoCo
arxiv.org/abs/2508.15706
100 sats
\
0 comments
\
@carter
1 Sep 2025
AI
related
Deep Dive into LLMs like ChatGPT
www.youtube.com/watch?v=7xTGNNLPyMI
98 sats
\
1 comment
\
@kepford
6 May 2025
AI
Pangu Pro MoE: Mixture of Grouped Experts for Efficient Sparsity
arxiv.org/pdf/2505.21411
100 sats
\
1 comment
\
@carter
3 Jul 2025
AI
AgentFly: Fine-tuning LLM Agents without Fine-tuning LLMs
arxiv.org/abs/2508.16153
152 sats
\
0 comments
\
@optimism
25 Aug 2025
AI
NVIDIA: Transforming LLM Alignment with Efficient Reinforcement Learning
www.marktechpost.com/2024/05/05/nvidia-ai-open-sources-nemo-aligner-transforming-large-language-model-alignment-with-efficient-reinforcement-learning/
20 sats
\
0 comments
\
@ch0k1
7 May 2024
tech
Learning the Wrong Lessons: Syntactic-Domain Spurious Correlations in LLMs
arxiv.org/abs/2509.21155
415 sats
\
6 comments
\
@optimism
2 Dec 2025
AI
LLMs generate slop because they avoid surprises by design - Dan Fabulich
danfabulich.medium.com/llms-tell-bad-jokes-because-they-avoid-surprises-7f111aac4f96
343 sats
\
2 comments
\
@Scoresby
19 Aug 2025
AI
Compute Where It Counts: High Quality Sparsely Activated LLMs
crystalai.org/blog/2025-08-18-compute-where-it-counts
100 sats
\
0 comments
\
@carter
21 Aug 2025
AI
Compiling LLMs into a MegaKernel: A path to low-latency inference
zhihaojia.medium.com/compiling-llms-into-a-megakernel-a-path-to-low-latency-inference-cf7840913c17
10 sats
\
0 comments
\
@hn
19 Jun 2025
tech
Efficient LLM Inference
arxiv.org/abs/2507.14397
121 sats
\
0 comments
\
@carter
3 Oct 2025
AI
Researchers discover impressive learning capabilities in long-context LLMs
venturebeat.com/ai/deepmind-researchers-discover-impressive-learning-capabilities-in-long-context-llms/
297 sats
\
0 comments
\
@ch0k1
25 Apr 2024
tech
LLM evaluation at scale with the NeurIPS Efficiency Challenge
blog.mozilla.ai/exploring-llm-evaluation-at-scale-with-the-neurips-large-language-model-efficiency-challenge/
110 sats
\
0 comments
\
@localhost
22 Feb 2024
tech
1-Bit LLM: The Most Efficient LLM Possible?
www.youtube.com/watch?v=7hMoz9q4zv0
533 sats
\
1 comment
\
@carter
24 Jun 2025
AI
LiveBench - A Challenging, Contamination-Free LLM Benchmark
livebench.ai
161 sats
\
0 comments
\
@supratic
17 Jul 2025
AI
Free Dolly: Introducing the World's First Truly Open Instruction-Tuned LLM
www.databricks.com/blog/2023/04/12/dolly-first-open-commercially-viable-instruction-tuned-llm
306 sats
\
1 comment
\
@nullama
13 Apr 2023
bitcoin
Lessons learned from programming with LLMs
crawshaw.io/blog/programming-with-llms
120 sats
\
1 comment
\
@m0wer
5 Jul 2025
AI
LLM-Deflate: Extracting LLMs Into Datasets
www.scalarlm.com/blog/llm-deflate-extracting-llms-into-datasets/
100 sats
\
1 comment
\
@carter
29 Sep 2025
AI
DBRX: A new open LLM
www.databricks.com/blog/introducing-dbrx-new-state-art-open-llm
10 sats
\
1 comment
\
@hn
31 Mar 2024
tech
LLM in a Flash: Efficient LLM Inference with Limited Memory
huggingface.co/papers/2312.11514
13 sats
\
1 comment
\
@hn
20 Dec 2023
tech
Stack Overflow and OpenAI Partner to Strengthen the World’s Most Popular LLMs
stackoverflow.co/company/press/archive/openai-partnership
96 sats
\
0 comments
\
@031ef7d322
6 May 2024
tech
Coping with dumb LLMs using classic ML
softwaredoug.com/blog/2025/01/21/llm-judge-decision-tree
31 sats
\
0 comments
\
@hn
24 Jan 2025
tech
OpenCoder: Open-Source LLM for Coding
arxiv.org/abs/2411.04905
52 sats
\
0 comments
\
@hn
9 Nov 2024
tech
more