@anon
sign up
@anon
sign up
pull down to refresh
Ferret: A Multimodal Large Language Model by Apple
github.com/apple/ml-ferret
10 sats
\
1 comment
\
@zuspotirko
23 Dec 2023
AI
related
RAG to ReST: A Survey of Advanced Techniques in Large Language Model Development
www.marktechpost.com/2024/07/22/from-rag-to-rest-a-survey-of-advanced-techniques-in-large-language-model-development/
21 sats
\
0 comments
\
@ch0k1
23 Jul 2024
news
Scalable MatMul-Free Language Modeling — 10x Reduction On LLMs Computation
arxiv.org/abs/2406.02528
110 sats
\
1 comment
\
@0xbitcoiner
10 Jun 2024
science
freebie
How large are large language models?
gist.github.com/rain-1/cf0419958250d15893d8873682492c3e
201 sats
\
0 comments
\
@carter
14 Jul
AI
The Pile is a 825 GiB diverse, open-source language modelling data set
pile.eleuther.ai/
20 sats
\
1 comment
\
@hn
7 Mar 2024
tech
PlebAI - an AI Chatbot that relies solely on open-source large Language Models
plebai.com/
216 sats
\
8 comments
\
@k00b
26 Jul 2023
bitcoin
01-AI/Yi: A series of large language models trained from scratch
github.com/01-ai/Yi
10 sats
\
1 comment
\
@hn
6 Nov 2023
tech
Researchers discover impressive learning capabilities in long-context LLMs
venturebeat.com/ai/deepmind-researchers-discover-impressive-learning-capabilities-in-long-context-llms/
297 sats
\
0 comments
\
@ch0k1
25 Apr 2024
tech
NVIDIA: Transforming LLM Alignment with Efficient Reinforcement Learning
www.marktechpost.com/2024/05/05/nvidia-ai-open-sources-nemo-aligner-transforming-large-language-model-alignment-with-efficient-reinforcement-learning/
20 sats
\
0 comments
\
@ch0k1
7 May 2024
tech
LLM Daydreaming
gwern.net/ai-daydreaming
319 sats
\
2 comments
\
@k00b
16 Jul
AI
OpenChat: Advancing Open-source Language Models with Imperfect Data
github.com/imoneoi/openchat
61 sats
\
0 comments
\
@ama
15 Nov 2023
tech
Large Language Models explained briefly
www.youtube.com/watch?v=LPZh9BOjkQs&ab_channel=3Blue1Brown
307 sats
\
2 comments
\
@south_korea_ln
22 Nov 2024
science
InternVL3.5: Advancing Open-Source Multimodal Models
arxiv.org/abs/2508.18265
147 sats
\
0 comments
\
@optimism
26 Aug
AI
Deep Dive into LLMs like ChatGPT
www.youtube.com/watch?v=7xTGNNLPyMI
620 sats
\
1 comment
\
@k00b
8 Feb
AI
The AI Dilemma: When Large Language Model Training Reaches A Dead End
medium.com/@jankammerath/the-ai-dilemma-when-large-language-model-training-reaches-a-dead-end-e2cf1de4a2ad
10 sats
\
0 comments
\
@BitcoinIsTheFuture
10 Mar 2024
econ
How Meta trains large language models at scale
engineering.fb.com/2024/06/12/data-infrastructure/training-large-language-models-at-scale-meta/
227 sats
\
0 comments
\
@hn
13 Jun 2024
tech
Intro to Large Language Models - Andrej Karpathy
youtu.be/zjkBMFhNj_g
31 sats
\
0 comments
\
@dk
7 Jan 2024
videos
Better and Faster Large Language Models via Multi-Token Prediction
arxiv.org/abs/2404.19737
21 sats
\
0 comments
\
@hn
1 May 2024
tech
“Imprecise” language models are smaller, speedier, and nearly as accurate
spectrum.ieee.org/1-bit-llm
10 sats
\
0 comments
\
@hn
31 May 2024
tech
Tracing the thoughts of a large language model
www.anthropic.com/research/tracing-thoughts-language-model
10 sats
\
0 comments
\
@hn
27 Mar
tech
Smartphones will be obsolete in 10 years, says Meta's AI Chief
analyticsindiamag.com/smartphones-will-be-obsolete-in-10-years-says-metas-ai-chief/
21 sats
\
1 comment
\
@ch0k1
30 Apr 2024
tech
Mixtral 8x7B: A Sparse Mixture of Experts language model
arxiv.org/abs/2401.04088
51 sats
\
1 comment
\
@hn
9 Jan 2024
tech
more