← Back to context

Comment by password54321

12 hours ago

Not sure why they put so much investment into videoSlop and imageSlop. Anthropic seems to be more focused at least.

Because almost everyone involved in AI race grew up in "winner takes it all" environments, typical for software, and they try really hard to make it reality. This means your model should do everything to just take 90% of market share, or at least 90% of specific niche.

The problem is, they can't find the moat, despite searching very hard, whatever you bake into your AI, your competitors will be able to replicate in few months. This is why OpenAI is striking deal with Disney, because copyright provides such moat.

  • > copyright provides such a moat.

    Been saying this since the 2016 Alice case. Apple jumped into content production in 2017. They saw the long term value of copyright interests.

    https://arstechnica.com/information-technology/2017/08/apple...

    Alice changed things such that code monkeys algorithms were not patentable (except in some narrow cases where true runtime novelty can be established.) Since the transformers paper, the potential of self authoring content was obvious to those who can afford to think about things rather than hustle all day.

    Apple wants to sell AI in an aluminum box while VCs need to prop up data center agrarianism; they need people to believe their server farms are essential.

    Not an Apple fanboy but in this case, am rooting for their "your hardware, your model" aspirations.

    Altman, Thiel, the VC model of make the serfs tend their server fields, their control of foundation models, is a gross feeling. It comes with the most religious like sense of fealty to political hierarchy and social structure that only exists as hallucination in the dying generations. The 50+ year old crowd cannot generationally churn fast enough.

    • OpenAIs opsec must be amazing, I had fully expected some version of ChatGPT to be leaked on torrent sites at some point this year. How do you manage to avoid something that could be exfiltrated on a hard disk from escaping your servers in all cases, forever?

      1 reply →

    • Totally agree, people love to talk about how hopelessly behind Apple is in terms of AI progress when they’re in a better position to compete directly against Nvidia on hardware than anyone else.

      2 replies →

    • My goodness, are you really saying, in effect, "I wish people over 50 would just hurry up and die"?!?

      Good lord, expressing that kind of sentiment does not make for a useful and engaging conversation here on hacker news.

      1 reply →

  • Striking deals without a proper vision is a waste of resources. And that’s the path OAI is on.

OpenAI is (was?) extremely good at making things that go viral. The successful ones for sure boost subscriber count meaningfully

Studio Ghibli, Sora app. Go viral, juice numbers then turn the knobs down on copyrighted material. Atlas I believe was a less successful than they would've hoped for.

And because of too frequent version bumps that are sometimes released as an answer to Google's launch, rather than a meaningful improvement - I believe they're also having harder time going viral that way

Overall OpenAI throws stuff at the wall and see what sticks. Most of it doesn't and gets (semi) abandoned. But some of it does and it makes for better consumer product than Gemini

It seems to have worked well so far, though I'm sceptical it will be enough for long

  • Selling a bunch of $20 a month subscriptions isn’t going to make a dent in OpenAI losses. Going viral for a day or two doesn’t help.

    Normal people are already getting tired of AI Slop

Because as with the internet 99% of the usage won’t be for education, work, personal development, what have you. It will be for effing kitten videos and memes.

  • Are the posters of effing kitten videos a customer base with a significant LTV?

    (The obvious well-paying market would be erotic / furry / porn, but it's too toxic to publicly touch, at least in the US.)

    • Openrouter stats already mention 52% usage is roleplay.

      As for photo/video very large number of people use it for friends and family (turn photo into creative/funny video, change photo, etc.).

      Also I would think photoshop-like features are coming more and more in chatgpt and alike. For example, “take my poorly-lit photo and make it look professional and suitable for linkedin profile”

    • Also FWIW I understand that the furry community has a strong culture of commissioning artists for their work, so that's likely to be a headwind against using genAI that isn't explicitly trained only on licensed materials. Sure, there are likely some who would use it regardless, but I expect the use of genAI to generate furry porn to be at least as toxic within that community as the use of genAI to generate furry porn outside of that community.

Because OpenAI stands for AI leader.

If Gemini can create or edit an image, chatgpt needs to be able to do this too. Who wants to copy&paste prompts between ai agents?

Also if you want to have more semantics, you add image, video and audio to your model. It gets smarter because of it.

OpenAI is also relevant bigger than antropic and is known as a generic 'helper'. Antropic probably saw the benefits of being more focused on developer which allows it to succeed longer in the game for the amount of money they have.

  • > Who wants to copy&paste prompts between ai agents?

    An AI!

    The specialist vs generalist debate is still open. And for complex problems, sure, having a model that runs on a small galaxy may be worth it. But for most tasks, a fleet of tailor-made smaller models being called on by an agent seems like a solidly-precedented (albeit not singularity-triggering) bet.

    •   > But for most tasks, a fleet of tailor-made smaller models being called on by an agent seems like a solidly-precedented (albeit not singularity-triggering) bet.
      

      not an expert by any means, but wouldn't smaller but highly refined models also output more reproducible results?

      intuitively it sounds akin to the unix model...

      1 reply →

  • >Also if you want to have more semantics, you add image, video and audio to your model. It gets smarter because of it.

    I think you are confusing generation with analysis. As far I am aware your model does not need to be good at generating images to be able to decode an image.

    • It is, to first approximation, the same thing. The generative part of genAI is just running the analysis model in reverse.

      Now there are all sorts of tricks to get the output of this to be good, and maybe they shouldn't be spending time and resources on this. But the core capability is shared.

      1 reply →

  • I think you're partially right, but I don't think being an AI leader is the main motivation -- that's a side effect.

    I think it's important to OpenAI to support as many use-cases as possible. Right now, the experience that most people have with ChatGPT is through small revenue individual accounts. Individual subscriptions with individual needs, but modest budgets.

    The bigger money is in enterprise and corporate accounts. To land these accounts, OpenAI will need to provide coverage across as many use-cases as they can so that they can operate as a one-stop AI provider. If a company needs to use OpenAI for chat, Anthropic for coding, and Google for video, what's the point? If Google's chat and coding is "good enough" and you need to have video generation, then that company is going to go with Google for everything. For the end-game I think OpenAI is playing for, they will need to be competitive in all modalities of AI.

  • > Because OpenAI stands for AI leader.

    It'll just end up spreading itself too thin and be second or third best at everything.

    The 500lb gorilla in the room is Google. They have endless money and maybe even more importantly they have endless hardware. OpenAI are going to have an increasingly hard time competing with them.

    That Gemini 3 is crushing it right now isn't the problem. It's Gemini 4 or 5 that will likely leave them in the dust for the general use case, meanwhile specialist models will eat what remains of their lunch.

Because there is only so much programmers and companies will pay for AI coders. The big prizes is AI-generated TikTok.

The entertainment industry is by far the easiest way to tap into global discretionary income.

When they released their first good image model is when they got a new 100 million users in a week.

Because for all the incessant whining about "slop," multimodal AI i/o is incredibly useful. Being able to take a photo of a home repair issue, have it diagnosed, and return a diagram showing you what to do with it is great, and it's the same algos that power the slop. "Sorry, you'll have to go to Gemini for that use case, people got mad about memes on the internet" is not really a good way for them to be a mass consumer company.

Because their main use is for advertising/propaganda, which is largely videoSlop & imageSlop even without AI.

  • Outside of this: https://openai.com/index/disney-sora-agreement/ I don't think there has been much of a win for them even in advertising for image/video slop.

    • It's like half the poster on here live in some parallel universe. I am making real money using generated image/video advertising content for both B2C and B2B goods. I am using Whisper and LLMs to review customer service call logs at scale and identity development opportunities for staff. I am using GPT/Gemini to help write SQL queries and little python scripts to do data analysis on my customer base. My business's productivity is way up since GenAI become accessible.

      1 reply →

because these are mostly the same players of the 2010's. So when they can't get more investor money and the hard problems are still being cracked, the easiest fallback is the same social media slop they used to become successful 10-15 years prior. Falling back on old ways to maximize engagement and grind out (eventually) ad revenue.

But how much more profitable are they? We see revenue but not profits / spending. Anthropic seems to be growing faster than OpenAI did but that could be the benefit of post-GPT hype.