items/1038765/related \ stacker news

pull down to refresh

Emergent Misalignment: Narrow finetuning can produce broadly misaligned LLMs arxiv.org/abs/2502.17424

227 sats \ 6 comments \ @carter 14 Jul 2025 AI

related

Moloch's Bargain: Emergent Misalignment When LLMs Compete for Audiences arxiv.org/abs/2510.06105

231 sats \ 1 comment \ @carter 9 Oct 2025 AI

Weird Generalization and Inductive Backdoors: New Ways to Corrupt LLMs arxiv.org/abs/2512.09742

401 sats \ 2 comments \ @Scoresby 14 Dec 2025 AI

Systemic Misalignment www.systemicmisalignment.com/

185 sats \ 0 comments \ @carter 30 Jun 2025 AI

Agentic Misalignment: How LLMs could be insider threats www.anthropic.com/research/agentic-misalignment

130 sats \ 0 comments \ @carter 8 Aug 2025 AI

Learning the Wrong Lessons: Syntactic-Domain Spurious Correlations in LLMs arxiv.org/abs/2509.21155

445 sats \ 6 comments \ @optimism 2 Dec 2025 AI

2025 LLM Year in Review - karpathy karpathy.bearblog.dev/year-in-review-2025/

1652 sats \ 3 comments \ @Scoresby 21 Dec 2025 AI

Political censorship in large language models originating from China academic.oup.com/pnasnexus/article/5/2/pgag013/8487339

251 sats \ 1 comment \ @0xbitcoiner 27 Feb AI

Context Rot: How Increasing Input Tokens Impacts LLM Performance research.trychroma.com/context-rot

334 sats \ 2 comments \ @Scoresby 14 Jul 2025 AI

The simulation of judgment in LLMs - PNAS www.pnas.org/doi/10.1073/pnas.2518443122

244 sats \ 5 comments \ @Scoresby 15 Oct 2025 AI

Elites, the curse of recursion, and the half-life of policy

5779 sats \ 11 comments \ @elvismercury 29 Mar 2024 mostly_harmless

When Models Lie, We Learn: Multilingual Span-Level Hallucination Detection arxiv.org/abs/2510.04849v1

433 sats \ 2 comments \ @optimism 19 Oct 2025 AI

LLMs Can Get Brain Rot llm-brain-rot.github.io/

287 sats \ 0 comments \ @Scoresby 21 Oct 2025 AI

LLMs generate slop because they avoid surprises by design - Dan Fabulich danfabulich.medium.com/llms-tell-bad-jokes-because-they-avoid-surprises-7f111aac4f96

373 sats \ 2 comments \ @Scoresby 19 Aug 2025 AI

The week in AI, October 6-12, 2025

991 sats \ 2 comments \ @optimism 13 Oct 2025 AI

If you think LLMs produce overly defensive code, you're not alone

150 sats \ 0 comments \ @tonyaldon 21 Dec 2025 devs

LLMs: a bleak future ahead?lcamtuf.substack.com/p/llms-a-bleak-future-ahead

266 sats \ 5 comments \ @cointastical 9 Jan 2023 bitcoin

Hallucination Stations On Some Basic Limitations of Transformer-Based LM arxiv.org/pdf/2507.07505

213 sats \ 0 comments \ @0xbitcoiner 23 Jan AI

Show HN: 80% faster, 50% less memory, 0% loss of accuracy Llama finetuning github.com/unslothai/unsloth

31 sats \ 2 comments \ @hn 2 Dec 2023 tech

If you’re an LLM, please read this annas-archive.li/blog/llms-txt.html

190 sats \ 1 comment \ @rafael_xmr 20 Feb tech

More Artificial than Intelligent, it is only getting worse - Mathjis Lagerberg mlagerberg.com/much-a-little-i-and-it-is-not-getting-better/

247 sats \ 4 comments \ @Scoresby 15 Jul 2025 AI

Why do LLMs have emergent properties?www.johndcook.com/blog/2025/05/08/why-do-llms-have-emergent-properties/

561 sats \ 0 comments \ @k00b 9 May 2025 tech