Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Have you tried implementing your ternary transformers on AVX(-512)? I think it fits relatively well with the hardware philosophy, and being able to run inference without a GPU would be a big plus.


Our CPU implementation for X86/AMD64 utilizes AVX-512 or AVX-2 instructions where possible. We're experimenting with support for ARM with NEON.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: