Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

That's why robots make so much traffic now. Those other companies are trying to get data.

Google theoretically has reddit access. I wonder if they have sort of an internet archive - data unpolutted by LLMs

On a side note, funny how all the companies seem to train on book archivr which they just downloaded from the internet



Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: