@anon
sign up
@anon
sign up
pull down to refresh
Meta Open-Sources MEGALODON LLM for Efficient Long Sequence Modeling
www.infoq.com/news/2024/06/meta-llm-megalodon/
115 sats
\
0 comments
\
@TheWildHustle
11 Jun 2024
opensource
related
NVIDIA: Transforming LLM Alignment with Efficient Reinforcement Learning
www.marktechpost.com/2024/05/05/nvidia-ai-open-sources-nemo-aligner-transforming-large-language-model-alignment-with-efficient-reinforcement-learning/
20 sats
\
0 comments
\
@ch0k1
7 May 2024
tech
Deep Dive into LLMs like ChatGPT
www.youtube.com/watch?v=7xTGNNLPyMI
98 sats
\
1 comment
\
@kepford
6 May 2025
AI
Researchers discover impressive learning capabilities in long-context LLMs
venturebeat.com/ai/deepmind-researchers-discover-impressive-learning-capabilities-in-long-context-llms/
297 sats
\
0 comments
\
@ch0k1
25 Apr 2024
tech
Scalable MatMul-Free Language Modeling — 10x Reduction On LLMs Computation
arxiv.org/abs/2406.02528
110 sats
\
1 comment
\
@0xbitcoiner
10 Jun 2024
science
freebie
RAG to ReST: A Survey of Advanced Techniques in Large Language Model Development
www.marktechpost.com/2024/07/22/from-rag-to-rest-a-survey-of-advanced-techniques-in-large-language-model-development/
21 sats
\
0 comments
\
@ch0k1
23 Jul 2024
news
Elia: An Open Source Terminal UI for Interacting with LLMs
www.marktechpost.com/2024/05/25/elia-an-open-source-terminal-ui-for-interacting-with-llms/
21 sats
\
0 comments
\
@ch0k1
26 May 2024
news
Compiling LLMs into a MegaKernel: A path to low-latency inference
zhihaojia.medium.com/compiling-llms-into-a-megakernel-a-path-to-low-latency-inference-cf7840913c17
10 sats
\
0 comments
\
@hn
19 Jun 2025
tech
LiveBench - A Challenging, Contamination-Free LLM Benchmark
livebench.ai
161 sats
\
0 comments
\
@supratic
17 Jul 2025
AI
LLM-Deflate: Extracting LLMs Into Datasets
www.scalarlm.com/blog/llm-deflate-extracting-llms-into-datasets/
100 sats
\
1 comment
\
@carter
29 Sep 2025
AI
Apple just released an interesting diffusion based coding language model
9to5mac.com/2025/07/04/apple-just-released-a-weirdly-interesting-coding-language-model/
131 sats
\
1 comment
\
@carter
8 Jul 2025
AI
OpenCoder: Open-Source LLM for Coding
arxiv.org/abs/2411.04905
52 sats
\
0 comments
\
@hn
9 Nov 2024
tech
LLM in a Flash: Efficient LLM Inference with Limited Memory
huggingface.co/papers/2312.11514
13 sats
\
1 comment
\
@hn
20 Dec 2023
tech
Falcon Chat Demo (Falcon 40B Instruct)
10 sats
\
5 comments
\
@mudbloodvonfrei
14 Jun 2023
tech
Deep Dive into LLMs like ChatGPT
www.youtube.com/watch?v=7xTGNNLPyMI
620 sats
\
1 comment
\
@k00b
8 Feb 2025
AI
LLM evaluation at scale with the NeurIPS Efficiency Challenge
blog.mozilla.ai/exploring-llm-evaluation-at-scale-with-the-neurips-large-language-model-efficiency-challenge/
110 sats
\
0 comments
\
@localhost
22 Feb 2024
tech
ATLAS: A New Paradigm in LLM Inference via Runtime-Learning Accelerators
www.together.ai/blog/adaptive-learning-speculator-system-atlas
100 sats
\
0 comments
\
@carter
14 Oct 2025
AI
Awesome Llm Apps: Collection of awesome LLM apps with RAG using OpenAI...
github.com/Shubhamsaboo/awesome-llm-apps
178 sats
\
1 comment
\
@Rsync25
15 Jun 2024
opensource
LLaMA-Factory: Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
github.com/hiyouga/LLaMA-Factory
157 sats
\
0 comments
\
@carter
19 Sep 2025
AI
Tools for Learning LLVM TableGen
blog.llvm.org/posts/2023-12-07-tools-for-learning-llvm-tablegen/
13 sats
\
1 comment
\
@hn
13 Dec 2023
tech
DBRX: A new open LLM
www.databricks.com/blog/introducing-dbrx-new-state-art-open-llm
10 sats
\
1 comment
\
@hn
31 Mar 2024
tech
No More Floating Points, The Era of 1.58-bit Large Language Models
medium.com/ai-insights-cobet/no-more-floating-points-the-era-of-1-58-bit-large-language-models-b9805879ac0a
100 sats
\
1 comment
\
@0xbitcoiner
11 Mar 2024
science
freebie
more