Does anyone have a good guide to share for how to self-host one of these models ...

Does anyone have a good guide to share for how to self-host one of these models and put it behind an API? I’d like to tinker with building a chatbot on my home lab server, so I guess it would need to be runnable on a VM with a few GB of RAM and a couple of cores. Or is that not possible with these kinds of models yet?