Any of you have a self-hosted AI "hub"? (e.g. for LLM, stable-diffusion, ...)

@robber@lemmy.ml · 1 month ago

Any of you have a self-hosted AI "hub"? (e.g. for LLM, stable-diffusion, ...)

@robber@lemmy.ml · 1 month ago

Thanks! Glad to see the 8x7B performing not too bad - I assume that’s a Mistral model? Also, does the CPU significantly affect inference speed in such a setup, do you know?

@Audalin@lemmy.world · 1 month ago

If your CPU isn’t ancient, it’s mostly about memory speed. VRAM is very fast, DDR5 RAM is reasonably fast, swap is slow even on a modern SSD.

8x7B is mixtral, yeah.