Comment by lmeyerov
1 month ago
Yep! One of my favorite attacks is just having a very long piece of a text so the LLM becomes unclear what's important and is happy to do something else
1 month ago
Yep! One of my favorite attacks is just having a very long piece of a text so the LLM becomes unclear what's important and is happy to do something else
No comments yet
Contribute on Hacker News ↗