u/StartledWatermelon

Autonomous AI research for nanogpt speedrun [Scaling experiments compute to 14k GPU-hours; human SoTA surpassed but lack of novel ideas]

▲ 20 r/mlscaling

Autonomous AI research for nanogpt speedrun [Scaling experiments compute to 14k GPU-hours; human SoTA surpassed but lack of novel ideas]

primeintellect.ai

u/StartledWatermelon — 7 days ago

▲ 3 r/mlscaling

MLS-Bench: A Holistic and Rigorous Assessment of AI Systems on Building Better AI, Lyu et al. 2026 [Extensive breadth; focus on solutions that generalize well]

u/StartledWatermelon — 10 days ago

▲ 24 r/mlscaling

Paper: https://arxiv.org/pdf/2604.24827

Interactive demo: https://01.me/research/ikp/

Visual abstract:

https://preview.redd.it/k3ld3764x4yg1.png?width=1267&format=png&auto=webp&s=d909c1577ba2750067523013bbe06d60c72f8fdb

Estimates. Note the consistency in pricing vs. est. params difference for the models from the same vendor

https://preview.redd.it/0m1jyczmx4yg1.png?width=783&format=png&auto=webp&s=a5e6ce5f5d4660abd58c82c875cba9db8e5eeb63

Non-speculative results (accuracy per difficulty tier):

https://preview.redd.it/hnyblgfmy4yg1.png?width=1053&format=png&auto=webp&s=f43aaf951730e5bb56df054980f91cb5996acbcf

https://preview.redd.it/plyylygwy4yg1.png?width=983&format=png&auto=webp&s=0e37cdffa7ce9db1624dae7c23ce4faa94e15d4c

u/StartledWatermelon — 23 days ago