Comment by tealcod

2 months ago

Would it be possible to run something like vLLM or TensortRT-llm with tinfoil?

1 comment

tealcod

We’re already using vllm as our inference server for our standard models. We can run whatever inference server for custom deployments.