Comment by sneak
6 days ago
THANK YOU.
I was a 3-4x programmer before. Now I’m a 9-15x programmer when wrangling LLMs.
This is a sea change and it’s already into “incredible” territory and shows no signs of slowing down.
> Think of anything you wanted to build but didn’t. You tried to home in on some first steps. If you’d been in the limerent phase of a new programming language, you’d have started writing. But you weren’t, so you put it off, for a day, a year, or your whole career.
I have been banging out little projects that I have wanted to exist for years but always had on the back burner. Write a detailed readme and ask the agent to interrogate you about the missing parts of the spec then update the README. Then have it make a TODO and start implementing. Give it another code base for style guide.
I’ve made more good and useful and working code in the last month than I have in the last two years.
That’s nothing, I was a 4X programmer and now I’m a 500x programmer!
I don’t just run one agent, I run all of them!
My time to close tickets is measured in minutes!
I don’t even review code, I have a different agent review it for me!
And to make sure that agent doesn’t make mistakes I have a different agent review that agents work!
Why would you do this and not just read the code yourself?
5 replies →
I can’t tell if you’re being sarcastic or not, but if you are, the real world is not far behind. I can imagine a world where a mixture of AI agents (some doing hypercritical code review) can return you tested and idiomatic PRs faster than you can describe the new architecture in issues.
I think a lot of people are unfamiliar with the (expensive) SOTA.
Lol omg I guess your original comment was NOT sarcastic!?
I'm not sure about giving specific metrics or kpis of efficiency or performance
It definitely feels different to develop using LLMs, especially things from scratch. At this point, you can't just have the LLM do everything. Sooner or later you need to start intervening more often, and as the complexity of the project grows, so does the attention you need to give to guiding the LLM. At that point the main gains are mostly in typing and quickly looking some things up, which are still really nice gains
I hate how the discourse around LLM-assisted programing is so polarized. It's either detractors saying it's "a fad that's useless and going nowhere, wasting billions of megawatts every year" or it's true believers calling it "the most miraculous sea change technological advancement in my lifetime" or "more important than fire and electricity[1]." There just doesn't seem to be any room in the middle.
I tried out Copilot a few months back to see what all the fuss was about and so that I could credibly engage with discussions having actually used the technology. I'd rate it as "kind of neat-o" but not earth shattering. It was like the first time I used an IDE with auto-complete. Oh, cool, nice feature. Would I pay monthly for it? No way. Would I integrate it into my development workflow if it were free? Maybe, I guess? Probably wouldn't bother unless it came literally set up for me out of the box like autocomplete does nowadays.
Don't get me wrong--it's cool technology. Well done, AI people. Is it "the 2nd most important thing to happen over the course of my career" as OP wrote? Come on, let's come down to earth a little.
1: https://www.cnbc.com/2018/02/01/google-ceo-sundar-pichai-ai-...
I don't know. I think 9-12 months ago I'd agree with you. But I feel like the last 6 months my productivity has vastly improved. Not only that, it's also brought back a little bit of passion for the field.
It's easy to come up with some good ideas for new project, but then not want to do a lot of the garbage work related to the project. I offload all that shit to the LLM now.
Seriously, the LLMs have increased my productivity 2-4x.
Copilot is a bad yardstick. The article literally addresses exactly this. It’s not just “cool technology”, that’s the point. It enables things that were previously impossible.
I spent $600 on claude via cursor last month and it was easily worth 2-3x that.
Since the "state of the art" seems to change every week, what's a good way to try out the current "state of the art", without spending $600? I'd love to give it a shot and be proven wrong, but I'm not going to spend 1/2 a mortgage payment on a trial.
EDIT: Looks like the "Cursor" thing has a free trial. Might start there.
1 reply →
> I was a 3-4x programmer before. Now I’m a 9-15x programmer
What the fuck does this mean?
It means cranking out hello world even faster i guess. I wonder how complex all these projects are people are proud to have completed with the help of AI.
I don't use AI to crank out complex parts of projects -- I use to crank out the tedious straight forward stuff that takes a lot of time that is necessary but low-value. Then I'm freed up to work on the hard and interesting stuff.
It can honestly do a lot of complex stuff. But sometimes you have to guide it there.
Nerds got taken aside and talked to about how it's not nice or cool to brag about IQ score so they invented a new artificial metric to brag about.
It depends on the value of x. I think it's safe to assume x <= 0.75, else they'd contribute negatively to their teams (happens from time to time, but let's be generous). Previously they'd be anywhere from a 0/10 to 3/10 programmer, and now they get up to 9/10 on a good day but sometimes are a net negative, as low as -2.25/10 on a bad day. I imagine that happens when tired or distracted and unable to adequately police LLM output.
It’s a riff on the “10x programmer” concept. People who haven’t worked with 10x programmers tend to not believe they exist.
I’m nowhere near that, but even unaided I’m quite a bit faster than most people I’ve hired or worked with. With LLMs my high quality output has easily tripled.
Writing code may be easier than reading it - but reading it is FASTER than writing it. And that’s what matters.
0x3 and 0x15 is the same value.