Comment by aspenmartin

32 minutes ago

A really sincere piece of advice that I really hope you take to heart: everyone who disagrees with you is not simply beneath your genius. I am not a PM (yet that is also quite insulting to some very competent and technical PM's I have worked with), I have an actual math degree, alongside a physics degree and a PhD in astrophysics from a strong department; I have worked in companies both at the MAANG scale and companies as small as 20 people for the last decade. It feels gross to have to type this but evidently you seem blocked from considering other viewpoints because you think I am a "vibe coding PM". It's ok if you want to cling to this as a comfort but just know it's a troubling way of going through life and also happens to be leading you astray in this particular case.

I don't see really any hard source at all here from you except anecdotes that you seem to hold in very high regard. I do see an incredible amount of condescension and chest pounding about what is ultimately a very technical and...ahem...mathematical topic. I don't know about you but I don't really see many conference paper reviews that start with "I am not here to enlighten every vibe coding PM that shows up". I am sure you would agree with me.

> But for the sake of bringing you closer, the "agentic" code is often very inferior, implementing happy paths or just bluntly exposing secrets in clear texts, etc. Probably a consequence of it being trained on, as you put it "p50 engineering code".

I do appreciate this tiny delicious gift of "bringing me closer" because it (1) answers my question about "inherent" properties of agentic systems by giving anecdotal examples of existing systems, (2) completely misunderstands how agentic coding models are trained. Human code traces are a bootstrap to an RL with verifiable rewards stage. Not having the same "you are too beneath me to explain my wrong opinions" attitude, I will genuinely explain a bit because this isn't as trivial and obvious as you make it sound, nor is it a giant pissing contest. Likely the most important property of coding agents that has resulted in their existing and future success is that they are not limited by the quality of human training data. Seems to be a very common misconception, but this is, like you say, just math:

- Agentic coding models like Claude go through several complex training stages

- Pretraining which is kind of a compression step and gives them semantic understanding and a bit more

- Supervised fine tuning which gives them some task specific performance (this is where human traces and verified synthetic traces are used)

- Alignment to make them not give you meth recipes and to behave in the way you want agents to behave

- Reinforcement learning with verifiable rewards (RLVR): then they go forth and solve open ended questions. RLVR is not new mathematics, we know what happens when you take RL with good rewards and throw a bunch of compute at it, we've known that for decades now. This is where the "superhuman" performance comes in, it's not some "vibe coding PM" that's giving you an empty promise, it is the math that you and I so highly revere that promises you this.

> Running my own company and been paying the LLM-Shit-Generators for my whole team for a long time, in the hope they would bring the advertised benefits. Guess what - for serious use-cases, they bring shit and more shit.

This sounds like the experience I would mostly expect from a small company adopting Claude, it is not magic nor is it at the point where you can blindly trust it to not mess something up. It will waste your time. I find it kind of doubtful it has not given you any benefits, but I'm not sitting where you're sitting so I can't refute your experience. People talk resentfully about "advertised benefits" but then never cite what advertised benefits they interpreted these systems as having. Do you have like a quote or something that you can point at?

> Oh yeah obviously not, I mean, its not like understanding software development would help you understand how LLMs are not similar to a "p50 engineer" at all:). I'd take the latter over the former every time.

Maybe I misinterpreted you: I found you telling me to "read a book" to be more of a dismissive condescending comment but maybe you mean it sincerely in which case, sure I will continue to read programming books and following the published work in the field as I have done for years now.

> Well for one, LLMs are not humans, but it should be obvious to even to most cretinous of the e/acc crowd. It's not like they can think in abstract terms or come up with completely new concepts. But then again, don't mind me - if you can live with below average AI slop - go for it.

I do agree with you that LLMs are not humans but when you say this is obvious and then don't back it up, that is really not convincing. I think you overestimate the capability of human beings and underestimate the asymptotic capabilities of these systems. Their performance improvements are predictable and these predictions continue to hold. It seems the burden of proof is on you to explain why we should expect some sort of fundamental limit to these capabilities and where those fundamental limits would arise. I'm not aware of very many.

1 comment

aspenmartin

hansmayer 13 minutes ago

> have an actual math degree, alongside a physics degree and a PhD in astrophysics from a strong department

Good for you, I suppose, but all it tells me is that you have probably not developed software professionally - after all, PhDs in astrophysics "from a strong department" rarely end up in commercial software development...

> This sounds like the experience I would mostly expect from a small company adopting Claude

Who said it was a small company? You're making too many assumptions buddy :)

> will genuinely explain a bit because this isn't as trivial and obvious as you make it sound

It is literally the same technology developed in the 1940s mate, adding more GPUs will not magically make it become a god-in-the-box. How fucking innovative can you still claim it to be?

> I think you overestimate the capability of human beings and underestimate the asymptotic capabilities of these systems

Right, remember when LLMs constructed the rockets and modules for landing on the moon, using practically just the logarithmic tables? Or when they invented the vaccine? How about X-rays? Cars? Aeroplanes? You don't? Oh right, me neither! We must be downplaying their nonexistent "capabilities". And the use of word "asymptotic" - is absolutely not conveying the meaning you think it does.

> Do you have like a quote or something that you can point at?

Well, how about the CEOs of companies claiming to be worth 1T and upwards, stating that their products have almost superhuman intelligence? PhDs in the pocket etc?