items/164043/related \ stacker news

pull down to refresh

Free Dolly: Introducing the World's First Truly Open Instruction-Tuned LLM www.databricks.com/blog/2023/04/12/dolly-first-open-commercially-viable-instruction-tuned-llm

307 sats \ 1 comment \ @nullama 13 Apr 2023 bitcoin

related

LLaMA-Factory: Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)github.com/hiyouga/LLaMA-Factory

187 sats \ 0 comments \ @carter 19 Sep 2025 AI

AgentFly: Fine-tuning LLM Agents without Fine-tuning LLMs arxiv.org/abs/2508.16153

182 sats \ 0 comments \ @optimism 25 Aug 2025 AI

Satoshi 7B - a bitcoin-centric LLM is now open source huggingface.co/LaierTwoLabsInc/Satoshi-7B

1791 sats \ 14 comments \ @Tony 13 Apr 2024 bitcoin

2025 LLM Year in Review - karpathy karpathy.bearblog.dev/year-in-review-2025/

1652 sats \ 3 comments \ @Scoresby 21 Dec 2025 AI

Learning the Wrong Lessons: Syntactic-Domain Spurious Correlations in LLMs arxiv.org/abs/2509.21155

445 sats \ 6 comments \ @optimism 2 Dec 2025 AI

Inception Labs Releases the World's Fastest Reasoning LLM chat.inceptionlabs.ai/

396 sats \ 0 comments \ @lunin 2 Mar AI

DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open LLMs arxiv.org/abs/2402.03300

762 sats \ 0 comments \ @zuspotirko 6 Feb 2024 science

MCP-Bench: Benchmarking Tool-Using LLM Agents arxiv.org/abs/2508.20453

269 sats \ 0 comments \ @optimism 30 Aug 2025 AI

LLMs and Programming in the first days of 2024 antirez.com/news/140

2759 sats \ 20 comments \ @hn 2 Jan 2024 tech

Extracting memorized pieces of (copyrighted) books from open-weight llm models arxiv.org/pdf/2505.12546

2372 sats \ 2 comments \ @carter 24 Jun 2025 AI

Awesome Llm Apps: Collection of awesome LLM apps with RAG using OpenAI...github.com/Shubhamsaboo/awesome-llm-apps

188 sats \ 1 comment \ @Rsync25 15 Jun 2024 opensource

Jan v3 4B: great in instruction following huggingface.co/janhq/Jan-v3-4B-base-instruct

519 sats \ 0 comments \ @optimism 2 Feb AI

Apertus: Democratizing Open and Compliant LLMs for Global Language Environments arxiv.org/abs/2509.14233

324 sats \ 1 comment \ @optimism 21 Sep 2025 AI

Elites, the curse of recursion, and the half-life of policy

5779 sats \ 11 comments \ @elvismercury 29 Mar 2024 mostly_harmless

What instruction do you use to tell LLMs to be concise?

140 sats \ 4 comments \ @tonyaldon 11 Dec 2025 AI

If you’re an LLM, please read this annas-archive.li/blog/llms-txt.html

190 sats \ 1 comment \ @rafael_xmr 20 Feb tech

LLM generated context files reduce task performance and increase costs by 20%arxiv.org/pdf/2602.11988v1

434 sats \ 3 comments \ @k00b 3 Mar AI devs

Hallucination Stations On Some Basic Limitations of Transformer-Based LM arxiv.org/pdf/2507.07505

213 sats \ 0 comments \ @0xbitcoiner 23 Jan AI

1-Bit LLM: The Most Efficient LLM Possible?www.youtube.com/watch?v=7hMoz9q4zv0

563 sats \ 1 comment \ @carter 24 Jun 2025 AI

LLM Memory grantslatton.com/llm-memory

299 sats \ 2 comments \ @carter 2 Jul 2025 AI

Masking private information on the fly when using cloud LLMs

233 sats \ 0 comments \ @m0wer 26 May 2025 tech