← Back to context

Comment by jackb4040

10 hours ago

Didn't they explicitly say the ads wouldn't be made aware of prompt data when they announced them? And if so, how is that not securities fraud?

Maybe someone with more time at hands could look up what Google said with respect to ads and what happened later.

This is one of the rare instances where it's very easy to predict the future: the prompt auction market will look similar to the existing online ad market, financial firms will pay for prompt streams for sentiment analysis, companies and interest groups will pay to have their products or agenda included favorably in the training data for future open weights models... any way you can think of that LLMs can be monetized, you will see it happen. And fast. The financial pressure is way too high for there to be too long of a honeymoon phase like we had with web 2.0

  • And how much trust are you going to have with your model results that they haven't been transformed and adjusted by advertising priorities?

    search engine results do this all the time, reordering output by advertiser input. its a pretty small jump from that to rewriting output from models, and even better where its all a black box.

    • >And how much trust are you going to have with your model results that they haven't been transformed and adjusted by advertising priorities?

      None.

    • Also Google did it over-time - they didn't suddenly become who they are today 10 years ago even.

    • I mean search engine results are pretty poor and have been for a long time. They reflect SEO, not credibility or quality.

      LLMs have plenty of issues, but they’re relatively clean compared with what the future will look like.

In what way would that be securities fraud? I guess you could get nailed under Section 17(a), but really hard to make a case they're defrauding investors by representing they were going to make ads worse performing than they ended up making them.

In order for it to be securities fraud it has to be tied to a securities transaction and the misstatement has to be material to a reasonable investor's decision.

I think they said the ad vendors wouldn't but the matching algorithm would still be aware of it. Which IMO is the bare requirement to have ads be anything but magazine style ads.

I mean, the ad doesn’t necessarily have to be made aware of the exact prompt context, just that the ad itself was relevant. You can basically have the ads prequalified for areas and serve them when relevant. Now that does show the user is talking about something relevant most likely, and depending on how they decide to serve them or provide referring, it may traceable to a profile/identity built for that user externally.

I’d be more concerned as to how this ends up in agent platforms using the LLMs, when you don’t have a fairly autonomous agent based system using these the entire point is that a human isn’t involved, so who are you serving ads to and where are you injecting them.

Moreover, if you are injecting them everywhere, does that survive stare for subsequent steps, meaning from the first set of results I get, does that loop back in again with the ad injected into the context. Because now, we have yet another dangerous way of injecting instructions into an already issue prone surface area.

I’m guessing they’re going to have special APIs that don’t include ads, and those are going to cost more, especially for non embedded agents (processes that already exist inside ChatGPT that kick off transparently from prompts, like asking it to work with an office document). After all the customers using agents aside from developers are mostly businesses, so it’s where the money is. The ads will exist for the poor to subsidize their use, and probably create even more barriers for agentic use like I described. Just my thoughts.

And good luck litigating against any business in this administration. Unless they explicitly tick off certain people or refuse to kiss the ring, they can get away with almost anything right now and there’s little risk of doing it or not because ticking off this admin will raise illegitimate prosecution even if you’re perfectly legal, almost the same level of if you’re not. It’s the ideal playground for doing all sorts of manipulation, just kiss the ring and you’ll be fine.

Wouldn't it have to have a negative effect on the security to be securities fraud? Causing an investor loss is a key point of securities fraud.

"We made a ton more money with ads and the stock went up" lacks that key element of fraud?

It would also be a huge security risk. But I can't think of any fundamental difference with Google queries, other than the sheer entropy of user data involved.

And I'm not a tinfoil internet anarchist, but just because Google only leaks user data in aggregated form to advertisers, doesn't mean that they don't leak their user data, it's just that they did so in a legal and responsible manner.

Maybe considering the difference in data volume and intimacy between queries and AI conversations, the privacy implications of advertising merit a difference in treatment, but I wouldn't be surprised if that is lost to a more simple 'Google did this so we can do it too' momentum.

  • The difference is you can make full use of Google without logging in

    Even with a throw away, no chance I use OpenAI now - if/when Anthropocene does this I’ll be in a tough spot

    • you can use chatgpt without an account, just not all of it

      and you can't make full use of Google without an account. for example, you need an account to upload to YouTube, manage your website in search, place ads, opt out of data usage. the list goes on

      2 replies →

Easy they lied to the public not investors and have more money than you.

Local llm or nothing at all.

  • This is a classic example highlighting the upside of local llms.

    However the local llms I can run on reasonable hardware are so dumb compared to opus, and even if I shelled out five figures of hardware to run the largest/smartest open model it still will be noticeably worse.

    Right now the remote models are just so much smarter and more affordable under most usage patterns.

  • > Local llm or nothing at all.

    I'm not as familiar with LLMs as I am media models, but there can't seriously be local contenders for beating Opus, GPT-5, etc. Right?

    At home hardware isn't good enough.

    Nobody "far enough behind" that isn't scared to release their model as open weights actually has a competitive model within 70% of the lead models.

    Now that the Chinese are catching up and even pulling ahead (eg. in video), they've stopped releasing the weights.

    Stragglers release weights. And those weights aren't competitive.

    Am I missing something?

    • GLM and Kimi are still releasing weights for near-SOTA models. DeepSeek, Qwen and arguably MiniMax are the ones that are perhaps falling behind.