@anon
sign up
@anon
sign up
pull down to refresh
What Every User Should Know About Mixed Precision Training in PyTorch
pytorch.org/blog/what-every-user-should-know-about-mixed-precision-training-in-pytorch/
10 sats
\
1 comment
\
@hn
15 Mar 2024
tech
related
Implementing Neural Networks in TensorFlow (and PyTorch)
towardsdatascience.com/implementing-neural-networks-in-tensorflow-and-pytorch-3c1f097e412a
55 sats
\
1 comment
\
@ch0k1
9 Jul 2024
devs
The Math Behind Batch Normalization
towardsdatascience.com/the-math-behind-batch-normalization-90ebbc0b1b0b
100 sats
\
0 comments
\
@ch0k1
9 May 2024
science
Google unveils Ironwood their seventh-generation Tensor Processing Unit (TPU)
blog.google/products/google-cloud/ironwood-tpu-age-of-inference/
55 sats
\
0 comments
\
@carter
9 Apr
AI
Deep Learning with Python
62 sats
\
0 comments
\
@devJack
30 Dec 2024
BooksAndArticles
OpenAI offers free GPT-4o Mini fine-tuning to counter Meta’s Llama 3.1 release
venturebeat.com/ai/ai-arms-race-escalates-openai-offers-free-gpt-4o-mini-fine-tuning-to-counter-metas-llama-3-1-release/
21 sats
\
0 comments
\
@ch0k1
25 Jul 2024
news
Pangu Pro MoE: Mixture of Grouped Experts for Efficient Sparsity
arxiv.org/pdf/2505.21411
100 sats
\
1 comment
\
@carter
3 Jul
AI
PixNerd: Pixel Neural Field Diffusion
arxiv.org/abs/2507.23268
302 sats
\
2 comments
\
@optimism
4 Aug
AI
PyTorch Internals: Ezyang's Blog
blog.ezyang.com/2019/05/pytorch-internals/
10 sats
\
0 comments
\
@hn
22 Mar
tech
Minimal implementation of Mamba, the new LLM architecture, in 1 file of PyTorch
github.com/johnma2006/mamba-minimal
15 sats
\
1 comment
\
@hn
20 Dec 2023
tech
Open-Source Python Library Supports Intervention-Based Research on ML Models
www.marktechpost.com/2024/03/16/researchers-at-stanford-university-introduce-pyvene-an-open-source-python-library-that-supports-intervention-based-research-on-machine-learning-models/
10 sats
\
0 comments
\
@ch0k1
23 Mar 2024
devs
Why PyTorch Gets All the Love
thenewstack.io/why-pytorch-gets-all-the-love/
10 sats
\
0 comments
\
@Rsync25
27 Nov 2024
tech
DeepScaleR: Surpassing O1-Preview with a 1.5B Model by Scaling RL
pretty-radio-b75.notion.site/DeepScaleR-Surpassing-O1-Preview-with-a-1-5B-Model-by-Scaling-RL-19681902c1468005bed8ca303013a4e2
21 sats
\
0 comments
\
@hn
11 Feb
tech
Hill Space: Neural nets that do perfect arithmetic (to 10⁻¹⁶ precision)
hillspace.justindujardin.com/
230 sats
\
0 comments
\
@carter
14 Jul
AI
DeepSeek: Ch Leadership On Cost, True Training Cost, Closed Model Margin Impacts
semianalysis.com/2025/01/31/deepseek-debates/
82 sats
\
0 comments
\
@0xbitcoiner
3 Feb
AI
tinygrad: A simple and powerful neural network framework
tinygrad.org/
10 sats
\
1 comment
\
@premitive1
15 Aug 2023
tech
OpenAI’s Misalignment and Microsoft’s Gain
stratechery.com/2023/openais-misalignment-and-microsofts-gain/
7509 sats
\
11 comments
\
@elvismercury
20 Nov 2023
tech
Hidet: A Deep Learning Compiler for Efficient Model Serving
pytorch.org/blog/introducing-hidet/
110 sats
\
1 comment
\
@hn
28 Apr 2023
tech
OpenAI’s long-awaited GPT-5 model nears release
www.reuters.com/business/retail-consumer/openais-long-awaited-gpt-5-model-nears-release-2025-08-06
176 sats
\
2 comments
\
@Coinsreporter
6 Aug
AI
OpenAI's GPT-5 is a cost cutting exercise
www.theregister.com/2025/08/13/gpt_5_cost_cutting
217 sats
\
1 comment
\
@Coinsreporter
13 Aug
AI
OpenAI’s hunger for data is coming back to bite it
www.technologyreview.com/2023/04/19/1071789/openais-hunger-for-data-is-coming-back-to-bite-it/
50 sats
\
0 comments
\
@shadowymartian
20 Apr 2023
bitcoin
Episode 120: Exploring SWE-bench Verified
56 sats
\
0 comments
\
@AtlantisPleb
13 Aug 2024
openagents
more