@anon
sign up
@anon
sign up
pull down to refresh
Show HN: 80% faster, 50% less memory, 0% loss of accuracy Llama finetuning
github.com/unslothai/unsloth
21 sats
\
2 comments
\
@hn
2 Dec 2023
tech
related
OpenAI offers free GPT-4o Mini fine-tuning to counter Meta’s Llama 3.1 release
venturebeat.com/ai/ai-arms-race-escalates-openai-offers-free-gpt-4o-mini-fine-tuning-to-counter-metas-llama-3-1-release/
21 sats
\
0 comments
\
@ch0k1
25 Jul 2024
news
Context Rot: How Increasing Input Tokens Impacts LLM Performance
research.trychroma.com/context-rot
304 sats
\
2 comments
\
@Scoresby
14 Jul
AI
tloen/alpaca-lora: an instruct tuned LLaMA model
github.com/tloen/alpaca-lora
10 sats
\
0 comments
\
@Semisol
15 Mar 2023
bitcoin
Orchard - Lightning, Cashu, Tether, Llama
orchard.space/
427 sats
\
5 comments
\
@Scoresby
24 Jun
lightning
SlowLlama: Finetune llama2-70B and codellama on MacBook Air without quantization
github.com/okuvshynov/slowllama
10 sats
\
1 comment
\
@hn
6 Oct 2023
tech
cocktailpeanut/dalai: The simplest way to run LLaMA on your local machine
github.com/cocktailpeanut/dalai
247 sats
\
0 comments
\
@random_
24 Mar 2023
bitcoin
AMD's MI300X Outperforms Nvidia's H100 for LLM Inference
www.blog.tensorwave.com/amds-mi300x-outperforms-nvidias-h100-for-llm-inference/
202 sats
\
0 comments
\
@hn
13 Jun 2024
tech
Meta releases the biggest and best open-source AI model yet
www.theverge.com/2024/7/23/24204055/meta-ai-llama-3-1-open-source-assistant-openai-chatgpt
22 sats
\
2 comments
\
@ch0k1
23 Jul 2024
news
LLaMA-Factory: Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
github.com/hiyouga/LLaMA-Factory
157 sats
\
0 comments
\
@carter
19 Sep
AI
Awesome Llm Apps: Collection of awesome LLM apps with RAG using OpenAI...
github.com/Shubhamsaboo/awesome-llm-apps
178 sats
\
1 comment
\
@Rsync25
15 Jun 2024
opensource
How Is LLaMa.cpp Possible?
finbarr.ca/how-is-llama-cpp-possible/
16 sats
\
2 comments
\
@hn
15 Aug 2023
tech
[2406.08478] What If We Recaption Billions of Web Images with LLaMA-3?
arxiv.org/abs/2406.08478
21 sats
\
0 comments
\
@Rsync25
13 Jun 2024
alter_native
Minimal implementation of Mamba, the new LLM architecture, in 1 file of PyTorch
github.com/johnma2006/mamba-minimal
15 sats
\
1 comment
\
@hn
20 Dec 2023
tech
With 10x growth since 2023, Llama is the leading engine of AI innovation
ai.meta.com/blog/llama-usage-doubled-may-through-july-2024/
21 sats
\
0 comments
\
@Rsync25
29 Aug 2024
alter_native
Hardware Acceleration of LLMs: A comprehensive survey and comparison
arxiv.org/abs/2409.03384
21 sats
\
0 comments
\
@hn
7 Sep 2024
tech
Episode 120: Exploring SWE-bench Verified
56 sats
\
0 comments
\
@AtlantisPleb
13 Aug 2024
openagents
Apple just released an interesting diffusion based coding language model
9to5mac.com/2025/07/04/apple-just-released-a-weirdly-interesting-coding-language-model/
131 sats
\
1 comment
\
@carter
8 Jul
AI
Performance of the Python 3.14 tail-call interpreter
blog.nelhage.com/post/cpython-tail-call/
10 sats
\
0 comments
\
@hn
10 Mar
tech
Apple collaborates with NVIDIA to research faster LLM performance - 9to5Mac
9to5mac.com/2024/12/18/apple-collaborates-with-nvidia-to-research-faster-llm-performance/
14 sats
\
1 comment
\
@Rsync25
19 Dec 2024
tech
Qwen3-235B-A22B-2507
xcancel.com/Alibaba_Qwen/status/1947344511988076547
218 sats
\
0 comments
\
@m0wer
24 Jul
AI
Talk-Llama
github.com/ggerganov/whisper.cpp/tree/master/examples/talk-llama
20 sats
\
1 comment
\
@hn
2 Nov 2023
tech
more