@anon
sign up
@anon
sign up
pull down to refresh
ATLAS: A New Paradigm in LLM Inference via Runtime-Learning Accelerators
www.together.ai/blog/adaptive-learning-speculator-system-atlas
100 sats
\
0 comments
\
@carter
14 Oct
AI
related
NVIDIA: Transforming LLM Alignment with Efficient Reinforcement Learning
www.marktechpost.com/2024/05/05/nvidia-ai-open-sources-nemo-aligner-transforming-large-language-model-alignment-with-efficient-reinforcement-learning/
20 sats
\
0 comments
\
@ch0k1
7 May 2024
tech
Here’s What’s Really Going On Inside An LLM’s Neural Network
116 sats
\
0 comments
\
@0xbitcoiner
22 May 2024
BooksAndArticles
Researchers discover impressive learning capabilities in long-context LLMs
venturebeat.com/ai/deepmind-researchers-discover-impressive-learning-capabilities-in-long-context-llms/
297 sats
\
0 comments
\
@ch0k1
25 Apr 2024
tech
Deep Dive into LLMs like ChatGPT
www.youtube.com/watch?v=7xTGNNLPyMI
98 sats
\
1 comment
\
@kepford
6 May
AI
DeepPHY: Benchmarking Agentic VLMs on Physical Reasoning
arxiv.org/abs/2508.05405
182 sats
\
0 comments
\
@optimism
10 Aug
AI
LLM in a Flash: Efficient LLM Inference with Limited Memory
huggingface.co/papers/2312.11514
13 sats
\
1 comment
\
@hn
20 Dec 2023
tech
Compiling LLMs into a MegaKernel: A path to low-latency inference
zhihaojia.medium.com/compiling-llms-into-a-megakernel-a-path-to-low-latency-inference-cf7840913c17
10 sats
\
0 comments
\
@hn
19 Jun
tech
Efficient LLM Inference
arxiv.org/abs/2507.14397
121 sats
\
0 comments
\
@carter
3 Oct
AI
Free Dolly: Introducing the World's First Truly Open Instruction-Tuned LLM
www.databricks.com/blog/2023/04/12/dolly-first-open-commercially-viable-instruction-tuned-llm
306 sats
\
1 comment
\
@nullama
13 Apr 2023
bitcoin
AMD's MI300X Outperforms Nvidia's H100 for LLM Inference
www.blog.tensorwave.com/amds-mi300x-outperforms-nvidias-h100-for-llm-inference/
202 sats
\
0 comments
\
@hn
13 Jun 2024
tech
DBRX: A new open LLM
www.databricks.com/blog/introducing-dbrx-new-state-art-open-llm
10 sats
\
1 comment
\
@hn
31 Mar 2024
tech
Meet PowerInfer: A Fast LLM on a Single Consumer-Grade GPU
www.marktechpost.com/2023/12/23/meet-powerinfer-a-fast-large-language-model-llm-on-a-single-consumer-grade-gpu-that-speeds-up-machine-learning-model-inference-by-11-times/
10 sats
\
2 comments
\
@ch0k1
24 Dec 2023
AI
LiveBench - A Challenging, Contamination-Free LLM Benchmark
livebench.ai
161 sats
\
0 comments
\
@supratic
17 Jul
AI
Hardware Acceleration of LLMs: A comprehensive survey and comparison
arxiv.org/abs/2409.03384
21 sats
\
0 comments
\
@hn
7 Sep 2024
tech
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via RL
arxiv.org/abs/2501.12948
9 sats
\
0 comments
\
@hn
25 Jan
tech
LLaMA-Factory: Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
github.com/hiyouga/LLaMA-Factory
157 sats
\
0 comments
\
@carter
19 Sep
AI
Hidet: A Deep Learning Compiler for Efficient Model Serving
pytorch.org/blog/introducing-hidet/
110 sats
\
1 comment
\
@hn
28 Apr 2023
tech
Lm.rs: Minimal CPU LLM inference in Rust with no dependency
github.com/samuel-vitorino/lm.rs
10 sats
\
0 comments
\
@hn
11 Oct 2024
tech
3Blue1Brown: How might LLMs store facts | Chapter 7, Deep Learning
www.youtube.com/watch?v=9-Jl0dxWQs8
184 sats
\
3 comments
\
@south_korea_ln
5 Sep 2024
science
OpenCoder: Open-Source LLM for Coding
arxiv.org/abs/2411.04905
52 sats
\
0 comments
\
@hn
9 Nov 2024
tech
DeepSeek-R1 incentivizes reasoning in LLMs through reinforcement learning
www.nature.com/articles/s41586-025-09422-z
121 sats
\
0 comments
\
@carter
19 Sep
AI
more