@anon
sign up
@anon
sign up
pull down to refresh
LLMs are getting better at character-level text manipulation
blog.burkert.me/posts/llm_evolution_character_manipulation/
157 sats
\
0 comments
\
@carter
14 Oct
AI
related
NVIDIA: Transforming LLM Alignment with Efficient Reinforcement Learning
www.marktechpost.com/2024/05/05/nvidia-ai-open-sources-nemo-aligner-transforming-large-language-model-alignment-with-efficient-reinforcement-learning/
20 sats
\
0 comments
\
@ch0k1
7 May 2024
tech
Context Rot: How Increasing Input Tokens Impacts LLM Performance
research.trychroma.com/context-rot
304 sats
\
2 comments
\
@Scoresby
14 Jul
AI
Deep Dive into LLMs like ChatGPT
www.youtube.com/watch?v=7xTGNNLPyMI
98 sats
\
1 comment
\
@kepford
6 May
AI
Researchers discover impressive learning capabilities in long-context LLMs
venturebeat.com/ai/deepmind-researchers-discover-impressive-learning-capabilities-in-long-context-llms/
297 sats
\
0 comments
\
@ch0k1
25 Apr 2024
tech
Here’s What’s Really Going On Inside An LLM’s Neural Network
116 sats
\
0 comments
\
@0xbitcoiner
22 May 2024
BooksAndArticles
Apple just released an interesting diffusion based coding language model
9to5mac.com/2025/07/04/apple-just-released-a-weirdly-interesting-coding-language-model/
131 sats
\
1 comment
\
@carter
8 Jul
AI
From Artificial Needles to Real Haystacks: Improving Capabilities in LLMs
arxiv.org/abs/2406.19292
21 sats
\
0 comments
\
@Rsync25
29 Jun 2024
alter_native
LLM Alignment: Reward-Based vs Reward-Free Methods
towardsdatascience.com/llm-alignment-reward-based-vs-reward-free-methods-ef0c0f6e8d88?gi=90f7a78bfcff
17 sats
\
0 comments
\
@ch0k1
6 Jul 2024
news
LLM-Deflate: Extracting LLMs Into Datasets
www.scalarlm.com/blog/llm-deflate-extracting-llms-into-datasets/
100 sats
\
1 comment
\
@carter
29 Sep
AI
Show HN: Wordllama – Things you can do with the token embeddings of an LLM
github.com/dleemiller/WordLlama
131 sats
\
0 comments
\
@hn
15 Sep 2024
tech
Llama.vim – Local LLM-assisted text completion
github.com/ggml-org/llama.vim
24 sats
\
0 comments
\
@hn
23 Jan
tech
Things we learned about LLMs in 2024
simonwillison.net/2024/Dec/31/llms-in-2024/
370 sats
\
0 comments
\
@Rsync25
31 Dec 2024
tech
Lessons learned from programming with LLMs
crawshaw.io/blog/programming-with-llms
120 sats
\
1 comment
\
@m0wer
5 Jul
AI
LLM Engineer's Handbook: Master the art of engineering large language models
www.amazon.com/LLM-Engineers-Handbook-engineering-production/dp/1836200072/
53 sats
\
0 comments
\
@Rsync25
19 Nov 2024
BooksAndArticles
Building LLMs from the Ground Up: A 3-hour Coding Workshop
magazine.sebastianraschka.com/p/building-llms-from-the-ground-up
55 sats
\
0 comments
\
@Rsync25
31 Aug 2024
tech
Small LLMs Can Beat Large Ones at 5-30x Lower Cost with Automated Data Curation
www.tensorzero.com/blog/fine-tuned-small-llms-can-beat-large-ones-at-5-30x-lower-cost-with-programmatic-data-curation/
274 sats
\
1 comment
\
@carter
5 Aug
AI
AI and the Abdication of Thought
www.psychologytoday.com/intl/blog/the-digital-self/202411/ai-and-the-abdication-of-thought
302 sats
\
2 comments
\
@ch0k1
25 Nov 2024
tech
Elia: An Open Source Terminal UI for Interacting with LLMs
www.marktechpost.com/2024/05/25/elia-an-open-source-terminal-ui-for-interacting-with-llms/
21 sats
\
0 comments
\
@ch0k1
26 May 2024
news
Apertus: Democratizing Open and Compliant LLMs for Global Language Environments
arxiv.org/abs/2509.14233
294 sats
\
1 comment
\
@optimism
21 Sep
AI
Show HN: Transductive regular expressions for text editing
github.com/c0stya/trre
9 sats
\
0 comments
\
@hn
7 Feb
tech
Coping with dumb LLMs using classic ML
softwaredoug.com/blog/2025/01/21/llm-judge-decision-tree
31 sats
\
0 comments
\
@hn
24 Jan
tech
more