Comment by slopinthebag

18 hours ago

What kind of software are people building where AI can just one shot tickets? Opus 4.6 and GPT 5.4 regularly fail when dealing with complicated issues for me.

17 comments

slopinthebag

girvo 14 hours ago

GPT 5.4 straight up just dies with broken API responses sometimes, let alone when it struggles with a even moderately complex task.

I still can't get a good mental model for when these things will work well and when they won't. Really does feel like gambling...

withinboredom 18 hours ago

Not just complicated, but even simple ones if the current software is too “new” of a pattern they’ve never seen before or trained on.

slopinthebag 17 hours ago

I dunno if Rust async or native platform API's which have existed for years count as new patterns, but if you throw even a small wrench in the works they really struggle. But that's expected really when you look at what the technology is - it's kind of insane we've even gotten to this point with what amounts to fancy autocomplete.

victorbjorklund 17 hours ago

Of course not all tickets are complex. Last week I had to fix a ticket which was to display the update date on a blog post next to the publish date. Perfect use case for AI to one shot.

thin_carapace 18 hours ago

i dont see anyone sane trusting ai to this degree any time soon, outside of web dev. the chances of this strategy failing are still well above acceptable margins for most software, and in safety critical instances it will be decades before standards allow for such adoption. anyway we are paying pennies on the dollar for compute at the moment - as soon as the gravy train stops rolling, all this intelligence will be out of access for most humans. unless some more efficient generalizable architecture is identified.

heavyset_go 17 hours ago
> as soon as the gravy train stops rolling, all this intelligence will be out of access for most humans. unless some more efficient generalizable architecture is identified.
All Chinese labs have to do to tank the US economy is to release open-weight models that can run on relatively cheap hardware before AI companies see returns.
Maybe that's why AI companies are looking to IPO so soon, gotta cash out and leave retail investors and retirement funds holding the bag.
- PeterStuer 16 hours ago
  
  They could still eliminate relatively cheap hardware.
- g947o 10 hours ago
  
  I mean, they have been doing that for at least a year, and I haven't seen signs of US economy tanking?... You need to find some better arguments
- thin_carapace 16 hours ago
  
  i was under the impression that we were approaching performance bottlenecks both with consumer GPU architecture and with this application of transformer architecture. if my impression is incorrect, then i agree it is feasible for china to tank the US economy that way (unless something else does it first)
  
  1 reply →
m00x 17 hours ago
Several fintechs like Block and Stripe are boasting thousands of AI-generated PRs with little to no human reviews.
Of course it's in the areas where it doesn't matter as much, like experiments, internal tooling, etc, but the CTOs will get greedy.
- slopinthebag 17 hours ago
  
  I don't think anybody is doubting its ability to generate thousands of PR's though. And yes, it's usually in the stuff that should have been automated already regardless of AI or not.
  
  1 reply →
- thin_carapace 17 hours ago
  
  these companies contribute to swathes of the west's financial infrastructure, not quite safety critical but critical enough, insane to involve automation here to this degree
slopinthebag 18 hours ago
Even in webdev it rots your codebase unchecked. Although it's incredibly useful for generating UI components, which makes me a very happy webslopper indeed.
- thin_carapace 16 hours ago
  
  im grateful to have never bothered learning web dev properly, it was enlightening witnessing chat gpt transform my ten second ms paint job into a functional user interface

hrmtst93837 13 hours ago

[dead]