@BetaDoggo_

BetaDoggo_@lemmy.world · 1 month ago

Probably just a reporting bug. Comments stayed consistent.

BetaDoggo_@lemmy.world · 11 months ago

Koboldcpp should allow you to run much larger models with a little bit of ram offloading. There’s a fork that supports rocm for AMD cards: https://github.com/YellowRoseCx/koboldcpp-rocm

Make sure to use quantized models for the best performace, q4k_M being the standard.

BetaDoggo_@lemmy.world · 1 year ago

I’ve used the tplink ones that they’re using and they’ve been pretty solid. I can’t say how they’d fare in a 24/7 setup though since they’re not really intended for that.