Have you tried implementing your ternary transformers on AVX(-512)? I think it f... | Hacker News

Hacker Newsnew | past | comments | ask | show | jobs | submit

		bjornsing on Sept 10, 2024 \| parent \| context \| favorite \| on: Launch HN: Deepsilicon (YC S24) – Software and har... Have you tried implementing your ternary transformers on AVX(-512)? I think it fits relatively well with the hardware philosophy, and being able to run inference without a GPU would be a big plus.

areddyyt on Sept 12, 2024 [–]

Our CPU implementation for X86/AMD64 utilizes AVX-512 or AVX-2 instructions where possible. We're experimenting with support for ARM with NEON.

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact