Very cool idea. Interested to see how this progresses. One question: how worried...

sdpmas · 2026-03-04T19:31:13 1772652673

yes, good point. right now, it's somewhat hard to overfit because the meta-optimization extracts tiny bits of information. but over time, we will switch the validation set to some other random subset of the FineWeb or even entirely OOD datasets!

xpe · 2026-03-05T18:53:05 1772736785

The question is not if but when. I hope the project authors acknowledge the problem directly: it is not merely a risk; it is a statistical certainty given enough time. So, what's the plan?

At the very least, track it. How will the project maintainers instrument this?