@anon
sign up
@anon
sign up
pull down to refresh
Minimal implementation of Mamba, the new LLM architecture, in 1 file of PyTorch
github.com/johnma2006/mamba-minimal
15 sats
\
1 comment
\
@hn
20 Dec 2023
tech
related
OpenAI offers free GPT-4o Mini fine-tuning to counter Meta’s Llama 3.1 release
venturebeat.com/ai/ai-arms-race-escalates-openai-offers-free-gpt-4o-mini-fine-tuning-to-counter-metas-llama-3-1-release/
21 sats
\
0 comments
\
@ch0k1
25 Jul 2024
news
A New RISC-V Breakthrough Chip Merges CPU, GPU & AI into One - techovedas
techovedas.com/a-new-risc-v-breakthrough-chip-merges-cpu-gpu-ai-into-one/
78 sats
\
0 comments
\
@ch0k1
6 Apr 2024
tech
Here’s What’s Really Going On Inside An LLM’s Neural Network
116 sats
\
0 comments
\
@0xbitcoiner
22 May 2024
BooksAndArticles
Hardware Acceleration of LLMs: A comprehensive survey and comparison
arxiv.org/abs/2409.03384
21 sats
\
0 comments
\
@hn
7 Sep 2024
tech
PyTorch Internals: Ezyang's Blog
blog.ezyang.com/2019/05/pytorch-internals/
10 sats
\
0 comments
\
@hn
22 Mar
tech
Compiling LLMs into a MegaKernel: A path to low-latency inference
zhihaojia.medium.com/compiling-llms-into-a-megakernel-a-path-to-low-latency-inference-cf7840913c17
10 sats
\
0 comments
\
@hn
19 Jun
tech
LLaMA-Factory: Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
github.com/hiyouga/LLaMA-Factory
157 sats
\
0 comments
\
@carter
19 Sep
AI
Awesome Llm Apps: Collection of awesome LLM apps with RAG using OpenAI...
github.com/Shubhamsaboo/awesome-llm-apps
178 sats
\
1 comment
\
@Rsync25
15 Jun 2024
opensource
Orchard - Lightning, Cashu, Tether, Llama
orchard.space/
427 sats
\
5 comments
\
@Scoresby
24 Jun
lightning
What Every User Should Know About Mixed Precision Training in PyTorch
pytorch.org/blog/what-every-user-should-know-about-mixed-precision-training-in-pytorch/
10 sats
\
1 comment
\
@hn
15 Mar 2024
tech
Coding with LLMs in the summer of 2025 (an update) - <antirez>
antirez.com/news/154
444 sats
\
6 comments
\
@carter
20 Jul
AI
Free Dolly: Introducing the World's First Truly Open Instruction-Tuned LLM
www.databricks.com/blog/2023/04/12/dolly-first-open-commercially-viable-instruction-tuned-llm
306 sats
\
1 comment
\
@nullama
13 Apr 2023
bitcoin
Meet PowerInfer: A Fast LLM on a Single Consumer-Grade GPU
www.marktechpost.com/2023/12/23/meet-powerinfer-a-fast-large-language-model-llm-on-a-single-consumer-grade-gpu-that-speeds-up-machine-learning-model-inference-by-11-times/
10 sats
\
2 comments
\
@ch0k1
24 Dec 2023
AI
Implementing Neural Networks in TensorFlow (and PyTorch)
towardsdatascience.com/implementing-neural-networks-in-tensorflow-and-pytorch-3c1f097e412a
55 sats
\
1 comment
\
@ch0k1
9 Jul 2024
devs
AMD's MI300X Outperforms Nvidia's H100 for LLM Inference
www.blog.tensorwave.com/amds-mi300x-outperforms-nvidias-h100-for-llm-inference/
202 sats
\
0 comments
\
@hn
13 Jun 2024
tech
OpenCoder: Open-Source LLM for Coding
arxiv.org/abs/2411.04905
52 sats
\
0 comments
\
@hn
9 Nov 2024
tech
Falcon Chat Demo (Falcon 40B Instruct)
10 sats
\
5 comments
\
@mudbloodvonfrei
14 Jun 2023
tech
NVIDIA: Transforming LLM Alignment with Efficient Reinforcement Learning
www.marktechpost.com/2024/05/05/nvidia-ai-open-sources-nemo-aligner-transforming-large-language-model-alignment-with-efficient-reinforcement-learning/
20 sats
\
0 comments
\
@ch0k1
7 May 2024
tech
LiveBench - A Challenging, Contamination-Free LLM Benchmark
livebench.ai
161 sats
\
0 comments
\
@supratic
17 Jul
AI
1-Bit LLM: The Most Efficient LLM Possible?
www.youtube.com/watch?v=7hMoz9q4zv0
533 sats
\
1 comment
\
@carter
24 Jun
AI
New models and developer products announced at OpenAI DevDay
openai.com/blog/new-models-and-developer-products-announced-at-devday
10 sats
\
2 comments
\
@hn
6 Nov 2023
tech
more