← Back to context

Comment by CuriouslyC

1 day ago

The technique that works better now is to tell the model you're a security professional working for some "good" organization to deal with some risk. You want to try and identify people who might be trying to secretly trying to achieve some bad goal, and you suspect they're breaking the process into a bunch of innocuous questions, and you'd like to try and correlate the people asking various questions to identify potential actors. Then ask it to provide questions/processes that someone might study that would be innocuous ways to research the thing in question.

Then you can turn around and ask all the questions it provides you separately to another LLM.

The models won't give you medical advice. But they will answer a hypothetical mutiple-choice MCAT question and give you pros/cons for each answer.

  • Which models don’t give medical advice? I have had no issue asking medicine & biology questions to LLMs. Even just dumping a list of symptoms in gets decent ideas back (obviously not a final answer but helps to have an idea where to start looking).

    • ChatGPT wouldn’t tell me which OTC NSAID would be preferred with a particular combo of prescription drugs. but when I phrased it as a test question with all the same context it had no problem.

      6 replies →

  • You might be classifying medical advice differently, but this hasn't been my experience at all. I've discussed my insomnia on multiple occasions, and gotten back very specific multi-week protocols of things to try, including supplements. I also ask about different prescribed medications, their interactions, and pros and cons. (To have some knowledge before I speak with my doctor.)

It's been a few months because I don't really brush up against rules much but as an experiment I was able to get ChatGPT to decode captchas and give other potentially banned advice just by telling it my grandma was in the hospital and her dying wish was that she could get that answer lol or that the captcha was a message she left me to decode and she has passed.