Comment by rurban

25 days ago

Of course not. Users love the chatbot. It's fast and easier to use than manually searching for answers or sticking together reports and graphs.

There is no latency, because the inference is done locally. On a server at the customer with a big GPU

4 comments

rurban

> There is no latency

Every chat bot I was ever forced to use has built-in latency, together with animated … to simulate a real user typing. It’s the worst of all worlds.

williamcotton 25 days ago
> to simulate a real user typing
The models return a realtime stream of tokens.
- Semaphor 25 days ago
  
  This was already the case before LLMs became a thing. This is still the case for no-intelligence step by step bots.
rurban 25 days ago

Because they are all using some cloud service and external LLM for that. We not.
We sell our users a strong server, where he has all his data and all his services. The LLM is local, and trained by us.