items/536780/related \ stacker news

pull down to refresh

Automatically Detecting Under-Trained Tokens in Large Language Models arxiv.org/abs/2405.05417

31 sats \ 0 comments \ @hn 12 May 2024 tech

related

Better and Faster Large Language Models via Multi-Token Prediction arxiv.org/abs/2404.19737

21 sats \ 0 comments \ @hn 1 May 2024 tech

The AI Dilemma: When Large Language Model Training Reaches A Dead End medium.com/@jankammerath/the-ai-dilemma-when-large-language-model-training-reaches-a-dead-end-e2cf1de4a2ad

10 sats \ 0 comments \ @BitcoinIsTheFuture 10 Mar 2024 econ

“Imprecise” language models are smaller, speedier, and nearly as accurate spectrum.ieee.org/1-bit-llm

10 sats \ 0 comments \ @hn 31 May 2024 tech

OpenMPTCProuter: Aggregate and encrypt multiple internet connections using MPTCP www.openmptcprouter.com/

11 sats \ 0 comments \ @hn 23 Nov 2024 tech

CVE-2025-48384: Breaking Git with a carriage return and cloning RCE dgl.cx/2025/07/git-clone-submodule-cve-2025-48384

21 sats \ 0 comments \ @hn 8 Jul 2025 tech

How Meta trains large language models at scale engineering.fb.com/2024/06/12/data-infrastructure/training-large-language-models-at-scale-meta/

227 sats \ 0 comments \ @hn 13 Jun 2024 tech

Matters Computational Ideas, Algorithms, Source Code – Jorg Arndt [pdf]www.jjj.de/fxt/fxtbook.pdf

85 sats \ 0 comments \ @hn 7 Mar 2025 tech

Ghosts in the ROM (2012)www.nycresistor.com/2012/08/21/ghosts-in-the-rom/

32 sats \ 0 comments \ @hn 26 Jun 2024 tech

How I got a Root Shell on a Credit Card Terminal stefan-gloor.ch/yomani-hack

59 sats \ 0 comments \ @hn 1 Jun 2025 tech

The Most Insidious Trick Of AI Language Models www.zerohedge.com/markets/most-insidious-trick-ai-language-models

633 sats \ 12 comments \ @Rothbardian_fanatic 15 Aug 2025 AI

Sleep regularity is a stronger predictor of mortality risk than sleep duration academic.oup.com/sleep/article/47/1/zsad253/7280269

247 sats \ 2 comments \ @hn 2 Nov 2024 tech

Palette lighting tricks on the Nintendo 64 30fps.net/pages/palette-lighting-tricks-n64/

10 sats \ 0 comments \ @hn 17 May 2025 tech

Visualizing World War II nathangoldwag.wordpress.com/2024/10/26/visualizing-the-past-world-war-ii/

215 sats \ 0 comments \ @hn 12 Nov 2024 tech

Monitor your security cameras with locally processed AI frigate.video/

120 sats \ 0 comments \ @hn 5 Aug 2025 tech

Understanding Memory Management, Part 2: C++ and RAII educatedguesswork.org/posts/memory-management-2/

10 sats \ 0 comments \ @hn 9 Mar 2025 tech

The Pile is a 825 GiB diverse, open-source language modelling data set pile.eleuther.ai/

20 sats \ 1 comment \ @hn 7 Mar 2024 tech

Show HN: Wordllama – Things you can do with the token embeddings of an LLM github.com/dleemiller/WordLlama

131 sats \ 0 comments \ @hn 15 Sep 2024 tech

Tinker a flexible API for fine-tuning language models thinkingmachines.ai/blog/announcing-tinker/

136 sats \ 0 comments \ @carter 7 Oct 2025 AI

Large Language Models Pass the Turing Test arxiv.org/pdf/2503.23674

364 sats \ 11 comments \ @south_korea_ln 15 Apr 2025 AI

Just How Resilient Are Large Language Models?www.rdrocket.com/blog/just-how-resilient-are-large-language-models

157 sats \ 0 comments \ @carter 29 Sep 2025 AI

Large Language Models explained briefly www.youtube.com/watch?v=LPZh9BOjkQs&ab_channel=3Blue1Brown

307 sats \ 2 comments \ @south_korea_ln 22 Nov 2024 science