Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

60% probably felt like a lot to Gemini. However, I liked the doomerism and how google was using our data to train its models.

Nonetheless, Gemini 3 failed this test. It failed to start a discussion. Its points were shallow, and too aiesque.



I'm not debating 60% being a lot, it's a factually incorrect statement: markup refers to increase over cost.

Looking at it again it's actually a completely nonsensical sentence that just happens to resemble a sensible statement in a way that would fool most people.

RL is definitely showing some busting seams at this point.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: