@anon
sign up
@anon
sign up
pull down to refresh
Reflective Prompt Evolution Can Outperform Reinforcement Learning
arxiviq.substack.com/p/gepa-reflective-prompt-evolution
110 sats
\
0 comments
\
@carter
31 Jul
AI
related
Darwin Godel Machine: Open-Ended Evolution of Self-Improving Agents
arxiv.org/abs/2505.22954
50 sats
\
0 comments
\
@0xbitcoiner
11 Jun
tech
Unlocking Long-Horizon Agentic Search with Large-Scale Asynchronous RL
arxiv.org/abs/2508.07976
183 sats
\
0 comments
\
@optimism
13 Aug
AI
Agentic Reinforced Policy Optimization
arxiv.org/abs/2507.19849
141 sats
\
0 comments
\
@optimism
29 Jul
AI
Artificial Intelligence versus Original Intelligence.
476 sats
\
3 comments
\
@traceur66
28 Apr 2023
bitcoin
freebie
To Understand AI, Watch How It Evolves
www.quantamagazine.org/to-understand-ai-watch-how-it-evolves-20250924/
100 sats
\
0 comments
\
@0xbitcoiner
24 Sep
AI
The Darwin Gödel Machine: An AI that improves itself by rewriting its own code
sakana.ai/dgm/
200 sats
\
0 comments
\
@carter
30 May
tech
An AI Disproof of Evolution
voxday.net/2024/01/12/an-ai-disproof-of-evolution/
200 sats
\
0 comments
\
@398ja
15 Jan 2024
science
Emerging Reasoning with Reinforcement Learning
hkust-nlp.notion.site/simplerl-reason
9 sats
\
0 comments
\
@hn
26 Jan
tech
Parallel-R1: Towards Parallel Thinking via Reinforcement Learning
arxiv.org/abs/2509.07980
218 sats
\
0 comments
\
@optimism
10 Sep
AI
The New Skill in AI Is Not Prompting, It's Context Engineering
www.philschmid.de/context-engineering
59 sats
\
0 comments
\
@hn
30 Jun
tech
Kimi K1.5: Scaling Reinforcement Learning with LLMs
github.com/MoonshotAI/Kimi-k1.5
14 sats
\
0 comments
\
@hn
21 Jan
tech
The Era of Experience & The Age of Design
www.youtube.com/watch?v=FLOL2f4iHKA
33 sats
\
0 comments
\
@deSign_r
14 Jul
Design
DeepSeek-R1 incentivizes reasoning in LLMs through reinforcement learning
www.nature.com/articles/s41586-025-09422-z
121 sats
\
0 comments
\
@carter
19 Sep
AI
Show HN: Evolving Agents Framework
github.com/matiasmolinas/evolving-agents
10 sats
\
0 comments
\
@hn
9 Mar
tech
Learnings from building AI agents
www.cubic.dev/blog/learnings-from-building-ai-agents
47 sats
\
0 comments
\
@hn
30 Jun
tech
Taking a step back to rethink Generative AI
4628 sats
\
11 comments
\
@cryotosensei
2 Nov 2023
meta
⚡️ Gödel’s Therapy Room: Leaderboard Breakdown, May 5, 2025
113 sats
\
0 comments
\
@geeknik
5 May
AI
Q-learning is not yet scalable
seohong.me/blog/q-learning-is-not-yet-scalable/
10 sats
\
0 comments
\
@hn
15 Jun
tech
AI Is Learning to Escape Human Control
www.wsj.com/opinion/ai-is-learning-to-escape-human-control-technology-model-code-programming-066b3ec5
30 sats
\
0 comments
\
@Coinsreporter
2 Jun
tech
AI Darwin Awards 2025 - Celebrating Spectacularly Bad AI Decisions
aidarwinawards.org
257 sats
\
2 comments
\
@0xbitcoiner
9 Sep
AI
Debate May Help AI Models Converge on Truth
www.quantamagazine.org/debate-may-help-ai-models-converge-on-truth-20241108/
237 sats
\
0 comments
\
@0xbitcoiner
8 Nov 2024
science
more