Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

If you don't need full ieee-754 double precision, ozaki scheme (emulation with tensor cores) might do the trick. It's been added (just a little bit) to cublas recently.


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: