Comment by nomel

8 days ago

> a LORA that's designed to inject bugs into your code

A statement like this, clearly, requires a reference.

From the model card: "the safeguards will limit effectiveness through methods such as prompt modification, steering vectors, or parameter-efficient fine-tuning" aka they will take your ML research code and inject bugs into it until it breaks using a LORA (or some other form of PEFT)