Comment by KronisLV

1 day ago

> On a side note, any self hosted model I can get for my PC? I have 96 GB of RAM.

Try the 8 bit quantized version (UD-Q8_K_X) of Qwen 3.6 35B A3B by Unsloth: https://huggingface.co/unsloth/Qwen3.6-35B-A3B-GGUF

Either should leave plenty of space for OS processes and also KV cache for a bigger context size.

I'm guessing that MoE models might work better, though there are also dense versions you can try if you want.

Performance and quality will probably both be worse than cloud models, though, but it's a nice start!

0 comments

KronisLV

No comments yet