Hacker News Logo

Offline

dayweek

NanoGPT Slowrun: Language Modeling with Limited Data, Infinite Compute

83 points|qlabs.sh|
sdpmas|4hrs