← Back to context

Comment by delaminator

4 months ago

Your assessment of Claude simply isn’t true.

Or Stackoverflow is really good.

I’m producing multiple projects per week that are weeks of work each.

Would you mind sharing some of these projects?

I've found Claude's usefulness is highly variable, though somewhat predictable. It can write `jq` filters flawlessly every time, whereas I would normally spend 30 minutes scanning docs because nobody memorizes `jq` syntax. And it can comb through server logs in every pod of my k8s clusters extremely fast. But it often struggles making quality code changes in a large codebase, or writing good documentation that isn't just an English translation of the code it's documenting.

  • It is always "I'm producing 300 projects in a nanosecond" but it's almost never about sharing or actually deploying these ;)

    • The problem I had that the larger your project gets, the more mistakes Claude makes. I (not a parent commenter) started with a basic CRUD web app and was blown away by how detailed it was, new CSS, good error handling, good selection and use of libraries, it could even write the terminal commands for package management and building. As the project grew to something larger Claude started forgetting that some code already existed in the project and started repeating itself, and worse still when I asked for new features it would pick a copy at random leaving them out of sync with eachother. Moving forward I've been alternating between writing stuff with AI, then rewriting it myself.

      7 replies →

    • If you haunt the pull requests of projects you use I bet you'll find there's a new species of PR:

      > I'm not an expert in this language or this project but I used AI to add a feature and I think its pretty good. Do you want to use it?

      I find myself writing these and bumping into others doing the same thing. It's exciting, projects that were stagnant are getting new attention.

      I understand that a maintainer may not want to take responsibility for new features of this sort, but its easier than ever to fork the project and merge them yourself.

      I noticed this most recently in https://github.com/andyk/ht/pulls which has two open (one draft) PRs of that sort, plus several closed ones.

      Issues that have been stale for years are getting traction, and if you look at the commit messages, it's AI tooling doing the work.

      People feel more capable to attempt contributions which they'd otherwise have to wait for a specialist for. We do need to be careful not to overwhelm the specialists with such things, as some of them are of low quality, but on the whole it's a really good thing.

      If you're not noticing it, I suggests hanging out in places where people actually share code, rather than here where we often instead brag about unshared code.

      3 replies →

    • At this point my prior is that all these 300/ns projects are some kind of internal tools, with very narrow scope and many just for a one-off use.

      Which is also fine and great and very useful and I am also making those, but it probably does not generalize to projects that require higher quality standards and actual maintenance.

      2 replies →

    • There's a massive incentive not to share them. If I wrote a project using AI I'd be reluctant to publish it at all because of the backlash I've seen people get for it.

      1 reply →

    • Building something is easy

      Building something that works ? Not so easy

      Pushing that thing in production ? That the hardest part

  • Claude has taught me so much about how to use jq better. And really, way more efficient ways of using the command line in general. It's great. Ironically, the more I learn the less I want to ask it to do things.

    • In an ideal world we function in exactly this way - using LLMs to bootstrap our skill/knowledge improvement journeys.

    • Yeah, if you pay attention to its output you can pick up little tips and tricks all over the place.

  • Not the OP you're replying to, but I've put together quite a few projects using only LLMs, no hand crafted code anywhere (I couldn't do it!)

    https://dnbfamily.com

    https://eventme.app

    https://blazingbanana.com

    https://play.google.com/store/apps/details?id=com.blazingban...

    https://play.google.com/store/apps/details?id=com.blazingban...

    https://play.google.com/store/apps/details?id=com.blazingban...

    Are they perfect? No probably not, but I wouldn't have been able to make any of these without LLMs. The last app was originally built with GPT-3.5.

    There is a whole host of other non-public projects I've built with LLMs, these are just a few of the public ones.

    • Maybe the most depressing part of all this is if people start thinking they would not have been able to do things without the LLM. Of course they would have, it's not like LLMs can do anything that you cannot. Maybe it would have taken more time at least the first time and you would have learned a few things in the process.

      7 replies →

  • It’s not just good for small code bases. In the last six months I’ve built a collaborative word processor with its own editor engine and canvas renderer using Claude, mostly Opus. It’s practically a mini Google Docs, but with better document history and an AI agent built in. I could never have built this in 6 months by myself without Claude Code.

    https://revise.io

    I think if you stick with a project for a while, keep code organized well, and most importantly prioritize having an excellent test suite, you can go very far with these tools. I am still developing this at a high pace every single day using these tools. It’s night and day to me, and I say that as someone who solo founded and was acquired once before, 10 years ago.

  • https://github.com/lawless-m

    You can see by Contributors which ones Claude has done.

    I have no idea if the code is any good, I’ve never looked at it and I have no idea how to code in Rust or Racket or Erlang anyway.

    • > I have no idea if the code is any good, I’ve never looked at it and I have no idea how to code in Rust or Racket or Erlang anyway.

      In that case, are you really producing multiple projects per week? If you've never looked at the code, have you verified that they work?

      1 reply →

  • The former tasks are directly from the training material, directly embedded into the model. For the latter task, it needs a context window and intelligence.

  • They really should have been supplying at least a week worth of readymade "projects" to every freelance AI promoter out there to demonstrate x9000 AI productivity gains for the skeptics.

    Because vibing the air about those gains without any evidence looks too shilly.

I'm just as much of an avid llm code generator fan as you may be but I do wonder about the practicality of spending time making projects anymore.

Why build them if other can just generate them too, where is the value of making so many projects?

If the value is in who can sell it the best to people who can't generate it, isn't it just a matter of time before someone else will generate one and they may become better than you at selling it?

  • > Why build them if other can just generate them too, where is the value of making so many projects?

    No offence to anyone but these generated projects are nothing ground-breaking. As soon as you venture outside the usual CRUD apps where novelty and serious engineering is necessary, the value proposition of LLMs drops considerably.

    For example, I'm exploring a novel design for a microkernel, and I have no need for machine generated boilerplate, as most of the hard work is not implementing yet another JSON API boilerplate, but it's thinking very hard with pen and paper about something few have thought before, and even fewer LLMs have been trained on, and have no intelligence to ponder upon the material.

    To be fair, even for the most dumb side-projects, like the notes app I wrote for myself, there is still a joy in doing things by hand, because I do not care about shipping early and getting VC money.

    • Weird, because I've created a webcam app that does segmentation so they can delete the background and put a new background in I mean, I suppose that's not groundbreaking. But it's not just reading and writing to a database.

      I've just added a ATA over Ethernet server in Rust, I thought of doing it in the car on the way home and an hour later I've got a working version.

      I type this comment using a voice to text system I built, admittedly it uses Whisper as the transcriber but I've turned it into a personal assistant.

      I make stuff every day I just wouldn't bother to make if I had to do it myself. and on top of that it does configuration. So I've had it build full wireguard configs that is taking on our pay addresses so that different destinations cause different routing. I don't know how to do that off the top of my head. I'm not going to spend weeks trying to find out how it works. It took me an evening of prompting.

      2 replies →

  • The value is that we need a lot more software and now, because building software has gotten so much less time consuming, you can sell software to people that could/would not have paid for it previously at a different price point.

    • We don’t need more software, we need the right software implemented better. That’s not something LLMs can possibly give us because they’re fucking pachinko machines.

      Here’s a hint: Nobody should ever write a CRUD app, because nobody should ever have to write a CRUD app; that’s something that can be generated fully and deterministically (i.e. by a set of locally-executable heuristics, not a goddamn ocean-boiling LLM) from a sufficiently detailed model of the data involved.

      In the 1970s you could wire up an OS-level forms library to your database schema and then serve literally thousands of users from a system less powerful than the CPU in modern peripheral or storage controller. And in less RAM too.

      People need to take a look at what was done before in order to truly have a proper degree of shame about how things are being done now.

      4 replies →

Sure but these are likely just variations of existing things. And yet the quality is still behind the original