Comment by davely

5 months ago

I think we need to wait for someone to convert it into a GGUF file format.

However, once that happens, you can run it (and any GGUF model) from Hugging Face![0]

[0] https://huggingface.co/docs/hub/en/ollama

6 comments

davely

mettamage 5 months ago

So this?

https://huggingface.co/brittlewis12/s1-32B-GGUF

mettamage 5 months ago

I ran it, so far it seems like a pretty good model, especially locally.
withinboredom 5 months ago
oh god, this is terrible!
I just said "Hello!" and it went off the rails.
- delijati 5 months ago
  
  why how what? can you add a sample prompt with output ?
  
  1 reply →

fl0id 5 months ago

you can load the safetensors with ollama, you just have to provide a modelfile. or wait for someone to do it. It will in theory also quantize it for you, as I guess most ppl cannot load a 129 GB model...