@anon
sign up
@anon
sign up
pull down to refresh
Extracting memorized pieces of (copyrighted) books from open-weight llm models
arxiv.org/pdf/2505.12546
2342 sats
\
2 comments
\
@carter
24 Jun
AI
related
NVIDIA: Copyrighted Books Are Just Statistical Correlations to Our AI Models
42 sats
\
1 comment
\
@byzantine
17 Aug 2024
tech
Deep Dive into LLMs like ChatGPT
www.youtube.com/watch?v=7xTGNNLPyMI
98 sats
\
1 comment
\
@kepford
6 May
AI
Intro to Large Language Models - Andrej Karpathy
youtu.be/zjkBMFhNj_g
31 sats
\
0 comments
\
@dk
7 Jan 2024
videos
NVIDIA: Transforming LLM Alignment with Efficient Reinforcement Learning
www.marktechpost.com/2024/05/05/nvidia-ai-open-sources-nemo-aligner-transforming-large-language-model-alignment-with-efficient-reinforcement-learning/
20 sats
\
0 comments
\
@ch0k1
7 May 2024
tech
Replit - How to train your own LLM Models
blog.replit.com/llm-training
11 sats
\
1 comment
\
@hn
20 Apr 2023
tech
LLM-Deflate: Extracting LLMs Into Datasets
www.scalarlm.com/blog/llm-deflate-extracting-llms-into-datasets/
100 sats
\
1 comment
\
@carter
29 Sep
AI
LLMs use a surprisingly simple mechanism to retrieve some stored knowledge
news.mit.edu/2024/large-language-models-use-surprisingly-simple-mechanism-retrieve-stored-knowledge-0325
128 sats
\
1 comment
\
@hn
31 Mar 2024
tech
NVIDIA: Copyrighted Books Are Just Statistical Correlations to Our AI Models
10 sats
\
0 comments
\
@byzantine
17 Aug 2024
tech
How to turn any LLM into an embedding model
bdtechtalks.com/2024/04/22/llm2vec/
10 sats
\
0 comments
\
@ch0k1
23 Apr 2024
tech
What We Know About LLMs (A Primer)
willthompson.name/what-we-know-about-llms-primer
163 sats
\
1 comment
\
@hn
25 Jul 2023
tech
Improving LLM information retrieval: ETL to ECL (Extract-Contextualize-Load)
medium.com/enterprise-rag/improving-llm-information-retrieval-etl-to-ecl-extract-contextualize-load-12a4ac259faa
76 sats
\
0 comments
\
@BitcoinIsTheFuture
18 Mar 2024
econ
Researchers discover impressive learning capabilities in long-context LLMs
venturebeat.com/ai/deepmind-researchers-discover-impressive-learning-capabilities-in-long-context-llms/
297 sats
\
0 comments
\
@ch0k1
25 Apr 2024
tech
Awesome Llm Apps: Collection of awesome LLM apps with RAG using OpenAI...
github.com/Shubhamsaboo/awesome-llm-apps
178 sats
\
1 comment
\
@Rsync25
15 Jun 2024
opensource
DBRX: A new open LLM
www.databricks.com/blog/introducing-dbrx-new-state-art-open-llm
10 sats
\
1 comment
\
@hn
31 Mar 2024
tech
Efficient LLM Inference
arxiv.org/abs/2507.14397
121 sats
\
0 comments
\
@carter
3 Oct
AI
OpenCoder: Open-Source LLM for Coding
arxiv.org/abs/2411.04905
52 sats
\
0 comments
\
@hn
9 Nov 2024
tech
Coping with dumb LLMs using classic ML
softwaredoug.com/blog/2025/01/21/llm-judge-decision-tree
31 sats
\
0 comments
\
@hn
24 Jan
tech
Sampling and structured outputs in LLMs
parthsareen.com/blog.html#sampling.md
157 sats
\
0 comments
\
@carter
23 Sep
AI
LLM Engineer's Handbook: Master the art of engineering large language models
www.amazon.com/LLM-Engineers-Handbook-engineering-production/dp/1836200072/
53 sats
\
0 comments
\
@Rsync25
19 Nov 2024
BooksAndArticles
LLMs aren’t world models
yosefk.com/blog/llms-arent-world-models.html
121 sats
\
0 comments
\
@carter
13 Aug
AI
Things we learned about LLMs in 2024
simonwillison.net/2024/Dec/31/llms-in-2024/
370 sats
\
0 comments
\
@Rsync25
31 Dec 2024
tech
more