Comment by kuerbel
10 hours ago
I work in infrastructure (backups, networking etc) and no longer in software. I just don't see llms being that useful right now. If I have a problem and ask an LLM the answer is either fabricated or useless, rarely does it know what it's talking about. And yes I know how to describe the problem so that it has a chance to give an useful answer.
Also even with agents, you can't just try and error your way out of some (most) of the problems I encounter without doing harm if the solution fails.
Might be different if used for infrastructure as code or ansible or some such. That I can see.
Well Coding agents are being tackled. Infrastructure agents that would read your host event logs, device configuration, ilo, etc, etc - that is probably the missing piece.
Having a chat with chatgpt may give you clues or ideas when you have gone throught your own checklist of what could have went wrong, but can go only as far.
Agent on the other side will decompile .dll to find out issues if needed to go deep enought.
Providing access to the data is easy. It is just an MCP or equivalent, and coding such CRUD is cheap now.
Applying the actions is unsolved. Unless you YOLO the LLMs, taking stateful actions automatically requires a lot of protective infrastructure, solid testing infra etc.
It’s all just more code, but a “create me a shopping website” LLM is likely not going to be doing the infrastructure level thinking required to handle it for now.
Might be but I just can't imagine a customer being fine with a loose cannon agent in their environment. E.g. coding agents are ignoring instructions. Who is to say that Claudes solution to a, say, slow backup isn't deleting the backup?
Imagine an agent shadowing all your terminals, providing ideas and asking to run commands that will let it verify the hypotheses it comes up with, while at the same time doing research on vendor docs, etc...
Quite safe, and already a force multiplier - this would be a harness. Maybe have it be able to write to a shadow system with similar (ideally same) hardware to verify it's hypothesis on how the system works, etc...