Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Yeah I'm currently using Gemini 2 Flash (exp) free quota for a premium hosted model, it's a surprisingly great model, IMO Google has caught up with the leaders with their latest experimental models. I've also tested Nova's models, which are pretty high quality and exception value (lite/micro) for their performance.

Also worth shouting out you can get Meta's latest llama-3.3:70b (comparable to llama3.1:405b but must faster and cheaper) within GroqCloud's free quotas running at an impressive 276 tok/s.



Groq limits context window to 8192 is that your experience too




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: