![Autonomous AI research for nanogpt speedrun [Scaling experiments compute to 14k GPU-hours; human SoTA surpassed but lack of novel ideas]](https://external-preview.redd.it/BhqlZ2cyRaiP7gxEahxUJtCYAYJYJowtrJ8rXTyHgGw.jpeg?width=140&height=82&auto=webp&s=76662d50939c535249e6dff64a868cb16c2dd59c)
▲ 20 r/mlscaling
Autonomous AI research for nanogpt speedrun [Scaling experiments compute to 14k GPU-hours; human SoTA surpassed but lack of novel ideas]
primeintellect.aiu/StartledWatermelon — 7 days ago
![Autonomous AI research for nanogpt speedrun [Scaling experiments compute to 14k GPU-hours; human SoTA surpassed but lack of novel ideas]](https://external-preview.redd.it/BhqlZ2cyRaiP7gxEahxUJtCYAYJYJowtrJ8rXTyHgGw.jpeg?width=140&height=82&auto=webp&s=76662d50939c535249e6dff64a868cb16c2dd59c)
Paper: https://arxiv.org/pdf/2604.24827
Interactive demo: https://01.me/research/ikp/
Visual abstract:
Estimates. Note the consistency in pricing vs. est. params difference for the models from the same vendor
Non-speculative results (accuracy per difficulty tier):