It is a thread. You may have only seen the first tweet because Twitter is a user-hostile trash fire.
βThe lesson: on a fundamental level, solutions to these games are low-dimensional. No matter how hard you hit them with from-scratch training, tiny models will work about as well as big ones. Why? Because there's just not that many bits to learn.β
βThe lesson: on a fundamental level, solutions to these games are low-dimensional. No matter how hard you hit them with from-scratch training, tiny models will work about as well as big ones. Why? Because there's just not that many bits to learn.β
https://unrollnow.com/status/1925795730150527191