pull down to refresh

The Hierarchical Reasoning Model (HRM) is a novel recurrent architecture that attains significant computational depth while maintaining both training stability and efficiency.
TLDR: tiny model from a Singaporean startup that beats LLMs like Claude (in some tasks) by using a new architecture.
Also see #1066465 for the paper
reply