@anon
sign up
@anon
sign up
pull down to refresh
Every model learned by gradient descent is approximately a kernel machine (2020)
arxiv.org/abs/2012.00152
100 sats
\
1 comment
\
@hn
25 Feb 2024
tech
related
Linear Representation Transferability: Using Small Models to Steer Larger Ones
arxiv.org/pdf/2506.00653
110 sats
\
0 comments
\
@carter
2 Jul
AI
How a Stubborn Computer Scientist Accidentally Launched The Deep Learning Boom
arstechnica.com/ai/2024/11/how-a-stubborn-computer-scientist-accidentally-launched-the-deep-learning-boom/
245 sats
\
4 comments
\
@0xbitcoiner
11 Nov 2024
based
Zero-Knowledge Proofs of Training for Deep Neural Networks
eprint.iacr.org/2024/162
2026 sats
\
5 comments
\
@0xbitcoiner
5 Feb 2024
crypto
Secrets of DeepSeek AI Model Revealed in Landmark Paper
www.scientificamerican.com/article/secrets-of-chinese-ai-model-deepseek-revealed-in-landmark-paper/
121 sats
\
0 comments
\
@ch0k1
20 Sep
AI
When Gradient Descent Is a Kernel Method
cgad.ski/blog/when-gradient-descent-is-a-kernel-method.html
15 sats
\
1 comment
\
@hn
28 Oct 2023
tech
Kolmogorov-Arnold networks may make neural networks more understandable
www.quantamagazine.org/novel-architecture-makes-neural-networks-more-understandable-20240911/
20 sats
\
0 comments
\
@hn
12 Sep 2024
tech
Nobel in physics given to researchers for pioneering work on machine learning
www.theguardian.com/science/2024/oct/08/nobel-prize-physics-john-hopfield-geoffrey-hinton-machine-learning
108 sats
\
3 comments
\
@south_korea_ln
8 Oct 2024
science
Understanding Machine Learning: From Theory to Algorithms
www.cs.huji.ac.il/~shais/UnderstandingMachineLearning/copy.html
10 sats
\
0 comments
\
@hn
4 Apr
tech
Nobel Prize in Physics Awarded for Machine Learning and Neural Networks
www.nobelprize.org/prizes/physics/2024/summary/
21 sats
\
0 comments
\
@hn
8 Oct 2024
tech
Mathematical Introduction to Deep Learning: Methods, Implementations, and Theory
arxiv.org/abs/2310.20360
24 sats
\
1 comment
\
@hn
1 Jan 2024
tech
Advancements in machine learning for machine learning
blog.research.google/2023/12/advancements-in-machine-learning-for.html
162 sats
\
1 comment
\
@hn
16 Dec 2023
tech
DeepSeek: Ch Leadership On Cost, True Training Cost, Closed Model Margin Impacts
semianalysis.com/2025/01/31/deepseek-debates/
82 sats
\
0 comments
\
@0xbitcoiner
3 Feb
AI
Pen and Paper Exercises in Machine Learning (2022)
arxiv.org/abs/2206.13446
10 sats
\
0 comments
\
@hn
21 Mar
tech
The most important machine learning equations: A comprehensive guide
chizkidd.github.io//2025/05/30/machine-learning-key-math-eqns/
120 sats
\
1 comment
\
@hn
28 Aug
tech
Deep Learning Is Not So Mysterious or Different
arxiv.org/abs/2503.02113
10 sats
\
0 comments
\
@hn
17 Mar
tech
Super-learning
newsletter.tomosman.com/p/super-learning
2320 sats
\
9 comments
\
@kr
26 Jan 2024
tech
Achieving 10,000x training data reduction with high-fidelity labels
research.google/blog/achieving-10000x-training-data-reduction-with-high-fidelity-labels/
232 sats
\
3 comments
\
@carter
8 Aug
AI
Diffusion Language Models are Super Data Learners
jinjieni.notion.site/Diffusion-Language-Models-are-Super-Data-Learners-239d8f03a866800ab196e49928c019ac
152 sats
\
0 comments
\
@carter
11 Aug
AI
Q-learning is not yet scalable
seohong.me/blog/q-learning-is-not-yet-scalable/
10 sats
\
0 comments
\
@hn
15 Jun
tech
OpenAI's new open-source model is basically Phi-5
www.seangoedecke.com/gpt-oss-is-phi-5/
171 sats
\
1 comment
\
@carter
8 Aug
AI
How AI on Microcontrollers Works: Operators and Kernels
danielmangum.com/posts/ai-microcontrollers-operators-kernels/
11 sats
\
0 comments
\
@hn
5 Jul
tech
more