@anon
sign up
@anon
sign up
pull down to refresh
The Illustrated GPT-2: Visualizing Transformer Language Models (2019)
jalammar.github.io/illustrated-gpt2/
69 sats
\
1 comment
\
@hn
19 Dec 2023
tech
related
All World Languages in One Visualization
532 sats
\
2 comments
\
@jakoyoh629
16 Nov 2024
charts_and_numbers
RAG to ReST: A Survey of Advanced Techniques in Large Language Model Development
www.marktechpost.com/2024/07/22/from-rag-to-rest-a-survey-of-advanced-techniques-in-large-language-model-development/
21 sats
\
0 comments
\
@ch0k1
23 Jul 2024
news
How large are large language models?
gist.github.com/rain-1/cf0419958250d15893d8873682492c3e
201 sats
\
0 comments
\
@carter
14 Jul
AI
To Understand AI, Watch How It Evolves
www.quantamagazine.org/to-understand-ai-watch-how-it-evolves-20250924/
100 sats
\
0 comments
\
@0xbitcoiner
24 Sep
AI
The Pile is a 825 GiB diverse, open-source language modelling data set
pile.eleuther.ai/
20 sats
\
1 comment
\
@hn
7 Mar 2024
tech
LLM Engineer's Handbook: Master the art of engineering large language models
www.amazon.com/LLM-Engineers-Handbook-engineering-production/dp/1836200072/
53 sats
\
0 comments
\
@Rsync25
19 Nov 2024
BooksAndArticles
Large Language Models explained briefly
www.youtube.com/watch?v=LPZh9BOjkQs&ab_channel=3Blue1Brown
307 sats
\
2 comments
\
@south_korea_ln
22 Nov 2024
science
“Imprecise” language models are smaller, speedier, and nearly as accurate
spectrum.ieee.org/1-bit-llm
10 sats
\
0 comments
\
@hn
31 May 2024
tech
Elia: An Open Source Terminal UI for Interacting with LLMs
www.marktechpost.com/2024/05/25/elia-an-open-source-terminal-ui-for-interacting-with-llms/
21 sats
\
0 comments
\
@ch0k1
26 May 2024
news
Development with Large Language Models (2023)
www.freecodecamp.org/news/development-with-large-language-models/
10 sats
\
0 comments
\
@Rsync25
23 Sep 2024
tech
How Attention Sinks Keep Language Models Stable
hanlab.mit.edu/blog/streamingllm
110 sats
\
0 comments
\
@carter
8 Aug
AI
LLM Alignment: Reward-Based vs Reward-Free Methods
towardsdatascience.com/llm-alignment-reward-based-vs-reward-free-methods-ef0c0f6e8d88?gi=90f7a78bfcff
17 sats
\
0 comments
\
@ch0k1
6 Jul 2024
news
OpenChat: Advancing Open-source Language Models with Imperfect Data
github.com/imoneoi/openchat
61 sats
\
0 comments
\
@ama
15 Nov 2023
tech
But what is a GPT? Visual intro to Transformers | Deep learning, chapter 5
m.youtube.com/watch?v=wjZofJX0v4M
1000 sats
\
0 comments
\
@south_korea_ln
2 Apr 2024
science
But what is a GPT? Visual intro to transformers, the T in GPT
www.youtube.com/watch?v=wjZofJX0v4M
747 sats
\
9 comments
\
@k00b
8 Apr 2024
AI
Anthropic can now track the bizarre inner workings of a large language model
www.technologyreview.com/2025/03/27/1113916/anthropic-can-now-track-the-bizarre-inner-workings-of-a-large-language-model/
166 sats
\
0 comments
\
@south_korea_ln
1 Apr
science
Scramble: Open-Source Grammarly Alternative
github.com/zlwaterfield/scramble
215 sats
\
1 comment
\
@jennann
22 Sep 2024
tech
Large language models, explained with a minimum of math and jargon
www.understandingai.org/p/large-language-models-explained-with
10 sats
\
0 comments
\
@byzantine
29 Jul 2023
tech
On the Biology of a Large Language Model
transformer-circuits.pub/2025/attribution-graphs/biology.html
50 sats
\
0 comments
\
@carter
28 Mar
AI
Researchers discover impressive learning capabilities in long-context LLMs
venturebeat.com/ai/deepmind-researchers-discover-impressive-learning-capabilities-in-long-context-llms/
297 sats
\
0 comments
\
@ch0k1
25 Apr 2024
tech
Large Language Models for Data Annotation and Synthesis: A Survey
github.com/Zhen-Tan-dmml/LLM4Annotation
21 sats
\
0 comments
\
@Rsync25
26 Dec 2024
tech
more