Comment by Davidzheng
2 days ago
Phrasing as a question bc I don't know, but it seems like the update allowed grok 3 answers to tweets to be affected in some way by its responses to other tweets? Like I think some people made it same Nazi things by prompting it (which is unfortunate but jailbreaks are commonplace) but some other people then seemed to experience this content WITHOUT PROMPTING after that? Is this a correct statement? [I know it's complicated by the fact that there were some new techniques for hiding jailbreaks being used around same time]
No comments yet
Contribute on Hacker News ↗