@anon
sign up
@anon
sign up
pull down to refresh
Automatically Detecting Under-Trained Tokens in Large Language Models
arxiv.org/abs/2405.05417
31 sats
\
0 comments
\
@hn
12 May 2024
tech
related
Better and Faster Large Language Models via Multi-Token Prediction
arxiv.org/abs/2404.19737
21 sats
\
0 comments
\
@hn
1 May 2024
tech
The AI Dilemma: When Large Language Model Training Reaches A Dead End
medium.com/@jankammerath/the-ai-dilemma-when-large-language-model-training-reaches-a-dead-end-e2cf1de4a2ad
10 sats
\
0 comments
\
@BitcoinIsTheFuture
10 Mar 2024
econ
“Imprecise” language models are smaller, speedier, and nearly as accurate
spectrum.ieee.org/1-bit-llm
10 sats
\
0 comments
\
@hn
31 May 2024
tech
OpenMPTCProuter: Aggregate and encrypt multiple internet connections using MPTCP
www.openmptcprouter.com/
11 sats
\
0 comments
\
@hn
23 Nov 2024
tech
CVE-2025-48384: Breaking Git with a carriage return and cloning RCE
dgl.cx/2025/07/git-clone-submodule-cve-2025-48384
21 sats
\
0 comments
\
@hn
8 Jul 2025
tech
How Meta trains large language models at scale
engineering.fb.com/2024/06/12/data-infrastructure/training-large-language-models-at-scale-meta/
227 sats
\
0 comments
\
@hn
13 Jun 2024
tech
Matters Computational Ideas, Algorithms, Source Code – Jorg Arndt [pdf]
www.jjj.de/fxt/fxtbook.pdf
85 sats
\
0 comments
\
@hn
7 Mar 2025
tech
Ghosts in the ROM (2012)
www.nycresistor.com/2012/08/21/ghosts-in-the-rom/
32 sats
\
0 comments
\
@hn
26 Jun 2024
tech
How I got a Root Shell on a Credit Card Terminal
stefan-gloor.ch/yomani-hack
59 sats
\
0 comments
\
@hn
1 Jun 2025
tech
The Most Insidious Trick Of AI Language Models
www.zerohedge.com/markets/most-insidious-trick-ai-language-models
633 sats
\
12 comments
\
@Rothbardian_fanatic
15 Aug 2025
AI
Sleep regularity is a stronger predictor of mortality risk than sleep duration
academic.oup.com/sleep/article/47/1/zsad253/7280269
247 sats
\
2 comments
\
@hn
2 Nov 2024
tech
Palette lighting tricks on the Nintendo 64
30fps.net/pages/palette-lighting-tricks-n64/
10 sats
\
0 comments
\
@hn
17 May 2025
tech
Visualizing World War II
nathangoldwag.wordpress.com/2024/10/26/visualizing-the-past-world-war-ii/
215 sats
\
0 comments
\
@hn
12 Nov 2024
tech
Monitor your security cameras with locally processed AI
frigate.video/
120 sats
\
0 comments
\
@hn
5 Aug 2025
tech
Understanding Memory Management, Part 2: C++ and RAII
educatedguesswork.org/posts/memory-management-2/
10 sats
\
0 comments
\
@hn
9 Mar 2025
tech
The Pile is a 825 GiB diverse, open-source language modelling data set
pile.eleuther.ai/
20 sats
\
1 comment
\
@hn
7 Mar 2024
tech
Show HN: Wordllama – Things you can do with the token embeddings of an LLM
github.com/dleemiller/WordLlama
131 sats
\
0 comments
\
@hn
15 Sep 2024
tech
Tinker a flexible API for fine-tuning language models
thinkingmachines.ai/blog/announcing-tinker/
136 sats
\
0 comments
\
@carter
7 Oct 2025
AI
Large Language Models Pass the Turing Test
arxiv.org/pdf/2503.23674
364 sats
\
11 comments
\
@south_korea_ln
15 Apr 2025
AI
Just How Resilient Are Large Language Models?
www.rdrocket.com/blog/just-how-resilient-are-large-language-models
157 sats
\
0 comments
\
@carter
29 Sep 2025
AI
Large Language Models explained briefly
www.youtube.com/watch?v=LPZh9BOjkQs&ab_channel=3Blue1Brown
307 sats
\
2 comments
\
@south_korea_ln
22 Nov 2024
science
more