← Back to context

Comment by km144

6 days ago

Creative works carry meaning through their author. The best art gives you insight into the imaginative mind of another human being—that is central to the experience of art at a fundamental level.

But the machine does not intend anything. Based on the article as I understand it, this product basically does some simulated annealing of the quality of art as judged by an AI to achieve the "best possible story"—again, as judged by an AI.

Maybe I am an outlier or an idiot, but I don't think you can judge every tool by its utility. People say that AI helps them write stories, I ask to what end? AI helps write code, again to what end? Is the story you're writing adding value to the world? Is the software you're writing adding value to the world? These seem like the important questions if AI does indeed become a dominant economic force over the coming decades.

Ah, fair enough. I believe quite strongly that creative works' meaning exists for the reader / audience / user. I don't think interpretation of art is towards an authorial, authoritative truth - rather that it's a lens to view the world through, and change one's perspective on it - so this is where we differ. But I understand your viewpoint.

I do agree that the LLM's idea of achieving the 'best possible story' is defined entirely by its design and prompting, and that is obviously completely ridiculous - not least because appreciating (or enduring) a story is a totally subjective experience.

I do disagree that one needs to ask "to what end?" when talking about writing stories, the same way one shouldn't need to ask "to what end?" about a pencil or a paintbrush. The joy of creating should be in the creation.

Commercial software is absolutely a more nuanced, complex topic - it's so much more intertwined with people's jobs, livelihoods, aeroplanes not falling out of the sky, power grids staying on, etc. That's a different, separate question. I don't think it's fair to equate them.

I think LLMs are the most interesting paintbrush-for-words we've come up with since the typewriter (at least), and that, historically, artists who embrace new technologies that arise in their forms are usually proven to be correct in their embrace of them.

  • I think that is a fair perspective. When I say "to what end" I am mostly implying the "end" of a product for the market. I think writing in particular is always a thing where if you tell people you do it as a hobby, they assume your goal is a published book, not the process itself. Creativity as the end is a wonderful thing, but I just have a feeling AI is going to be more widely adopted to pump out passable (or even arguably "good") content that people will pay money for.

    Again the same thing with writing software, where you can be creative with it and it can enhance the experience. But most people just use AI to help them do their job better—and in an era where many software companies appear to have a net negative effect on society, it's hard to see the good in that.

    • > "but I just have a feeling AI is going to be more widely adopted to pump out passable (or even arguably "good") content"

      Absolutely! And, as you say - the vast majority of books are already written to be passable-enough for publication. I guess it'll be slightly less charming when it's unclear whether a book you're buying has had at least one human believe it is good. Maybe this is already the case on Amazon!

      > "that people will pay money for."

      Haha - authors aren't making much money as it stands. I do really hope that a (much) higher volume of 'slop-work' means audiences value 'good-work' more, as 'good-work' will be harder to seek out, and that as a result of this better revenue models for creators of freely-duplicatable work (like books and music) are forced into creation. That's the best possible outcome. But - I think we agree that material reward isn't a good incentive for the creation of art.

      I'm not wildly concerned about the arts, in this sense - I think it's (over a long enough timespan) a highly meritocratic world. I trust readers / audiences / users. Good work finds its audience and time and floats eventually. And DRM-locked, Kindle-Unlimited-type work will, by design, not be on anybody's shelves in fifty or a hundred years.

      The alternative, I think, is that LLMs start making beautiful art completely unprompted (something I've seen zero evidence of being possible thus far). That's a universe I would be fascinated to exist in. A shame its probably paradoxical - I can imagine it being like whalesong :-)

      Software is very different, as you say, not least because of its contingency on utility and temporality. Another thing that I find nice to imagine is a future canon of 'classical' software. I'm sure that this will exist at some point, given how young a form it is, relatively speaking. That too, I hope, will be predicated on beauty of design, as we've done with all our other canons.

  • > The joy of creating should be in the creation

    > I think LLMs are the most interesting paintbrush-for-words we've come up with since the typewriter

    I cannot reconcile these thoughts in my head

    For me, the joy of creating does not come from asking the computer to create something for me. It doesn't matter what careful prompt I made, I did not create the outcome. The computer did

    And no, this is not the same as other computer tools. A drawing tablet may offer tools to me, but I still have to create myself

    AI is not a "tool" it is the author

    Prompt engineers are editors at best

    • I understand that point of view.

      Perhaps this is contextually useful - when writing prose fiction, one technique I've played with recently which I found interesting is generating a really broad spectrum of 'next tokens' halfway through a sentence, via multiple calls to different models on different temp. settings, etc.

      It's fascinating to see the expected route for a sentence, and (this is much harder to get LLMs to output!) the unexpected route for a sentence.

      But seeing some expected routes, per the LLM, can make the unexpected, surprising, or interesting routes much more clear in the mind's eye. It makes sentences feel closer to music theory.

      You are right that this does create a more 'editorial' relationship between yourself and the work.

      I'd stress that this isn't a negative thing, and has heavy literary precedence - an example that comes to mind is Gordon Lish's "intuitive structuring" principle, in which you just write the best-sounding next word, and see what the story becomes by itself, then edit from there - a totally sonic approach.

      My example here with "arrays of next tokens" is a super granular, paintbrush-type example, but I want to be clear that I'm not at all advocating for the workflow of 'write a prompt, get a piece of art'.

      I do however think that there's a vast middleground between "write me a whole book" and "show me the expected next token", and that this middleground is absolutely fascinating.

      Not least because it makes literature (an artform previously more resistant to mathematics than say, music, or painting) more in touch with its own mathematics, which were previously very hidden, and are only currently being discovered.

    • To some, the joy in writing is in producing a story. To others, it lies in crafting the words. If you want to "direct" a story, an LLM that could take your exposition of your plot and make it into something enjoyable to read could still let you enjoy the process. If what you care about is crafting the perfect sentence, an LLM that could assist with the plot could still let you enjoy the process. Maybe more, because not everyone enjoys every aspect of the process.

      And even authors who enjoy both might hate the many subsequent steps to publishing a book, such as getting editorial feedback and doing rewrites that can sometimes feel like a gut-punch (I sat on my first editorial feedback for a month, agonising over what to accept and what to ignore, and it was anxiety-inducing and felt awful - since I don't expect to make much money from my novels, I decided to ignore a lot of it, even when I knew the editor was probably right from a mass-market appeal point of view, but it sure as hell was not an enjoyable part of the process).

      And some people don't enjoy the actual writing at all, but enjoys coming up with high-level plots and seeing what pops out.

      In other words: It's not all or nothing, and people enjoy wildly different things about the process.

      Or to put it yet another way: Some people - even adults - enjoy paint by numbers too. Not everyone want to create - sometimes people just want to be adjacent to creation and discover things.

I don't give a shit about getting an insight into the authors mind. It is not even revelant to the experience of art for me.

You're presuming that your experience of it is universal, and it is not.

To me, a tool that would produce stories that I enjoy reading would add value to my world if I meant I got more stories I enjoy.

  • I think that discussing this subject in the abstract, with some ideal notion of a tool that generates perpetually enjoyable stories misses the thrust of the general objection, which is actually mechanistic, and not social. LLMs are not this tool, for many (I would say most, but...). LLMs recycle the same ideas over and over and over with trite stylistic variation. Once you have read enough LLM generated/adapted works they're all the same and they lose all value as entertainment.

    There is a moment I come to over and again when reading any longer form work informed by AI. At first, I don't notice (if the author used it 'well'). But once far enough in, there is a moment where everything aligns and I see the structure of it and it is something I have seen a thousand times before. I have seen it in emails and stories and blog posts and articles and comments and SEO spam and novels passed off as human work. In that moment, I stop caring. In that moment, my brain goes, "Ah, I know this." And I feel as if I have already finished reading its entirety.

    There is some amount of detail I obviously do not 'recall in advance of reading it'. The sum total of this is that which the author supplied. The rest is noise. There is no structure beyond that ever present skein patterned out by every single LLM in the same forms, and that skein I am bored of. It's always the same. I am tired of reading it again and again. I am tired of knowing exactly how what is coming up will come, if not the precise details of it, and the way every reaction will occur, and how every pattern of interaction will develop. I am tired of how LLMs tessellate the same shapes onto every conceptual seam.

    I return now to my objection to your dismissal of the value of insight into the author's mind. The chief value, as I see it, is merely that it is always different. Every person has their own experiences and that means when I read them I will never have a moment where I know them (and consequently, the work) in advance, as I do the ghost-writing LLMs, which all share a corpus of experience.

    Further, I would argue that the more apt notion of insight into the work is the sole value of said work (for entertainment), and that insight is one time use (or strongly frequency dependent, for entertainment value). Humans actively generate 'things to be insightful of' through lived experience, which enriches their outputs, while LLMs have an approximately finite quantity of such due to their nature as frozen checkpoints, which leads you to "oh, I have already consumed this insight; I have known this" situations.

    If you have a magic tool that always produces a magically enjoyable work, by all means, enjoy. If you do not, which I suspect, farming insight from a constantly varying set of complex beings living rich real life experiences is the mechanical process through which a steady supply of enjoyable, fresh, and interesting works can be acquired.

    Being unaware of this process does not negate its efficacy.

    TLDR; from the perspective of consumption, generated works are predominantly toothless as reading any AI work depletes from a finite, shared pool of entertaining-insight that runs dry too quickly