

31·
12 days agoheavily depends on the model and quantization level
choose the model you want on this website and it’ll give you some specs likely to run it
any/most distros will do, especially if you run it on Docker
if you’re going with intel cards (best $ per GB VRAM right now), you could get a decent machine under $3k




crosspost it to
c/actually_infuriating