← Back to context

Comment by janalsncm

3 days ago

I worked on it for a more specialized task (query rewriting). It’s blazing fast.

A lot of inference code is set up for autoregressive decoding now. Diffusion is less mature. Not sure if Ollama or llama cpp support it.

4 comments

janalsncm

Reply

philipportner 3 days ago

Did you publish anything you could link wrt. query rewriting?

stavros 3 days ago

How was the quality?

janalsncm 3 days ago
Quality was about the same. I will say it was a pain to train since it isn’t as popular and there isn’t out of the box support.
- stavros 3 days ago
  
  Interesting, thanks! That's pretty cool though!