u/award_reply

▲ 389 r/DeepSeek

DeepSeek V4 pro effectively reverse-engineered a recently released 100B LLM architecture entirely on its own and then adapted llama.cpp to run it. (in ~10M token and less then $2 )

u/award_reply — 20 days ago