items/359713/related \ stacker news

pull down to refresh

Minimal implementation of Mamba, the new LLM architecture, in 1 file of PyTorch github.com/johnma2006/mamba-minimal

15 sats \ 1 comment \ @hn 20 Dec 2023 tech

related

At NeurIPS, NVIDIA Advances Open Model Development for Digital and Physical AI blogs.nvidia.com/blog/neurips-open-source-digital-physical-ai/

157 sats \ 0 comments \ @0xbitcoiner 2 Dec 2025 AI

OpenAI offers free GPT-4o Mini fine-tuning to counter Meta’s Llama 3.1 release venturebeat.com/ai/ai-arms-race-escalates-openai-offers-free-gpt-4o-mini-fine-tuning-to-counter-metas-llama-3-1-release/

21 sats \ 0 comments \ @ch0k1 25 Jul 2024 news

A New RISC-V Breakthrough Chip Merges CPU, GPU & AI into One - techovedas techovedas.com/a-new-risc-v-breakthrough-chip-merges-cpu-gpu-ai-into-one/

78 sats \ 0 comments \ @ch0k1 6 Apr 2024 tech

Hardware Acceleration of LLMs: A comprehensive survey and comparison arxiv.org/abs/2409.03384

21 sats \ 0 comments \ @hn 7 Sep 2024 tech

PyTorch Internals: Ezyang's Blog blog.ezyang.com/2019/05/pytorch-internals/

10 sats \ 0 comments \ @hn 22 Mar 2025 tech

Compiling LLMs into a MegaKernel: A path to low-latency inference zhihaojia.medium.com/compiling-llms-into-a-megakernel-a-path-to-low-latency-inference-cf7840913c17

10 sats \ 0 comments \ @hn 19 Jun 2025 tech

LLaMA-Factory: Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)github.com/hiyouga/LLaMA-Factory

157 sats \ 0 comments \ @carter 19 Sep 2025 AI

Awesome Llm Apps: Collection of awesome LLM apps with RAG using OpenAI...github.com/Shubhamsaboo/awesome-llm-apps

178 sats \ 1 comment \ @Rsync25 15 Jun 2024 opensource

Orchard - Lightning, Cashu, Tether, Llama orchard.space/

427 sats \ 5 comments \ @Scoresby 24 Jun 2025 lightning

What Every User Should Know About Mixed Precision Training in PyTorch pytorch.org/blog/what-every-user-should-know-about-mixed-precision-training-in-pytorch/

10 sats \ 1 comment \ @hn 15 Mar 2024 tech

Coding with LLMs in the summer of 2025 (an update) - <antirez>antirez.com/news/154

444 sats \ 6 comments \ @carter 20 Jul 2025 AI

Free Dolly: Introducing the World's First Truly Open Instruction-Tuned LLM www.databricks.com/blog/2023/04/12/dolly-first-open-commercially-viable-instruction-tuned-llm

306 sats \ 1 comment \ @nullama 13 Apr 2023 bitcoin

Meet PowerInfer: A Fast LLM on a Single Consumer-Grade GPU www.marktechpost.com/2023/12/23/meet-powerinfer-a-fast-large-language-model-llm-on-a-single-consumer-grade-gpu-that-speeds-up-machine-learning-model-inference-by-11-times/

10 sats \ 2 comments \ @ch0k1 24 Dec 2023 AI

Implementing Neural Networks in TensorFlow (and PyTorch)towardsdatascience.com/implementing-neural-networks-in-tensorflow-and-pytorch-3c1f097e412a

55 sats \ 1 comment \ @ch0k1 9 Jul 2024 devs

AMD's MI300X Outperforms Nvidia's H100 for LLM Inference www.blog.tensorwave.com/amds-mi300x-outperforms-nvidias-h100-for-llm-inference/

202 sats \ 0 comments \ @hn 13 Jun 2024 tech

OpenCoder: Open-Source LLM for Coding arxiv.org/abs/2411.04905

52 sats \ 0 comments \ @hn 9 Nov 2024 tech

ATLAS: A New Paradigm in LLM Inference via Runtime-Learning Accelerators www.together.ai/blog/adaptive-learning-speculator-system-atlas

100 sats \ 0 comments \ @carter 14 Oct 2025 AI

Falcon Chat Demo (Falcon 40B Instruct)

10 sats \ 5 comments \ @mudbloodvonfrei 14 Jun 2023 tech

NVIDIA: Transforming LLM Alignment with Efficient Reinforcement Learning www.marktechpost.com/2024/05/05/nvidia-ai-open-sources-nemo-aligner-transforming-large-language-model-alignment-with-efficient-reinforcement-learning/

20 sats \ 0 comments \ @ch0k1 7 May 2024 tech

Incentives and coordination to solve hard problems

2905 sats \ 21 comments \ @elvismercury 9 Mar 2024 mostly_harmless

LiveBench - A Challenging, Contamination-Free LLM Benchmark livebench.ai

161 sats \ 0 comments \ @supratic 17 Jul 2025 AI