Disagree. Even going by your example, AlphaGo uses many iterations of a "dumb" model in order to achieve incredible performance. If it had to single shot the solution with a model 100x bigger, it would perform worse. All that matters is the frontier of intelligence vs cost, and larger foundation models aren't necessarily going to push that frontier forward. AlphaCode hints at that.