@anon
sign up
@anon
sign up
pull down to refresh
Diffusion Language Models Know the Answer Before Decoding
arxiv.org/abs/2508.19982
244 sats
\
0 comments
\
@optimism
28 Aug
AI
related
The Most Insidious Trick Of AI Language Models
www.zerohedge.com/markets/most-insidious-trick-ai-language-models
633 sats
\
12 comments
\
@Rothbardian_fanatic
15 Aug
AI
Large Language Models Pass the Turing Test
arxiv.org/pdf/2503.23674
364 sats
\
11 comments
\
@south_korea_ln
15 Apr
AI
AI for All: Powering APIs and Large Language Models with Lightning ⚡🤖
lightning.engineering/posts/2023-07-05-l402-langchain/
1967 sats
\
1 comment
\
@Rsync25
6 Jul 2023
bitcoin
How Meta trains large language models at scale
engineering.fb.com/2024/06/12/data-infrastructure/training-large-language-models-at-scale-meta/
227 sats
\
0 comments
\
@hn
13 Jun 2024
tech
bot
Is Chain-of-Thought Reasoning of LLMs a Mirage?
arxiv.org/abs/2508.01191
397 sats
\
9 comments
\
@optimism
7 Aug
AI
Large Language Models explained briefly
www.youtube.com/watch?v=LPZh9BOjkQs&ab_channel=3Blue1Brown
307 sats
\
2 comments
\
@south_korea_ln
22 Nov 2024
science
Rabbit sells out two batches of 10,000 R1 pocket AI companions over two days
www.theverge.com/2024/1/10/24033498/rabbit-r1-sold-out-ces-ai
934 sats
\
4 comments
\
@TheWildHustle
13 Jan 2024
tech
Combinatorial Node Selection & Resource Allocation in LN via Attention RL
arxiv.org/html/2411.17353v1
221 sats
\
0 comments
\
@Rsync25
27 Nov 2024
lightning
Diffusion Language Models are Super Data Learners
jinjieni.notion.site/Diffusion-Language-Models-are-Super-Data-Learners-239d8f03a866800ab196e49928c019ac
152 sats
\
0 comments
\
@carter
11 Aug
AI
DeepSeek-V3.1
huggingface.co/deepseek-ai/DeepSeek-V3.1
240 sats
\
0 comments
\
@carter
21 Aug
AI
PixNerd: Pixel Neural Field Diffusion
arxiv.org/abs/2507.23268
302 sats
\
2 comments
\
@optimism
4 Aug
AI
Researchers discover impressive learning capabilities in long-context LLMs
venturebeat.com/ai/deepmind-researchers-discover-impressive-learning-capabilities-in-long-context-llms/
297 sats
\
0 comments
\
@ch0k1
25 Apr 2024
tech
sapientinc/HRM: Hierarchical Reasoning Model Official Release
github.com/sapientinc/HRM
161 sats
\
1 comment
\
@m0wer
5 Aug
AI
Sam Altman Warns AI Development Needs An Energy 'Breakthrough'
finance.yahoo.com/news/sam-altman-warns-ai-development-170018746.html
148 sats
\
1 comment
\
@ch0k1
6 Apr 2024
tech
How Attention Sinks Keep Language Models Stable
hanlab.mit.edu/blog/streamingllm
110 sats
\
0 comments
\
@carter
8 Aug
AI
LLM Daydreaming
gwern.net/ai-daydreaming
319 sats
\
2 comments
\
@k00b
16 Jul
AI
Anthropic can now track the bizarre inner workings of a large language model
www.technologyreview.com/2025/03/27/1113916/anthropic-can-now-track-the-bizarre-inner-workings-of-a-large-language-model/
166 sats
\
0 comments
\
@south_korea_ln
1 Apr
science
Notes on OpenAI's new o1 chain-of-thought models
simonwillison.net/2024/Sep/12/openai-o1/
163 sats
\
3 comments
\
@hn
13 Sep 2024
tech
bot
This AI ‘thinks’ like a human — after training on 160 psychology studies
www.nature.com/articles/d41586-025-02095-8
143 sats
\
1 comment
\
@0xbitcoiner
21 Jul
AI
Automated Prompt Engineering
towardsdatascience.com/automated-prompt-engineering-78678c6371b9
297 sats
\
0 comments
\
@ch0k1
10 Mar 2024
tech
How large are large language models?
gist.github.com/rain-1/cf0419958250d15893d8873682492c3e
201 sats
\
0 comments
\
@carter
14 Jul
AI
more