← Back to context

Comment by nomel

9 days ago

Thanks, I thought maybe I missed something. That's an interesting way to interpret that.

8 comments

nomel

Reply

mips_avatar 9 days ago

Anthropic is trying to hide bad behavior by being vague, it's important to not be vague when calling it out.

nomel 9 days ago
I'm of the opinion that removing guardrails is how you force regulation. What's your opinion on the balance?
- dannyw 9 days ago
  
  They have all transcripts for at least 30 days. The problem is that (as anyone who used Fable can attest) their classifiers are extremely sensitive and catch tons of innocent queries.
  Imagine being a data scientist or MLE training a small classifier model. How do you know you won’t get steering vectors or a PEFT applied?
  
  2 replies →
- mips_avatar 9 days ago
  
  They’re not safety guardrails they’re anthropic doesn’t like anyone who isn’t anthropic working on AI rails

giancarlostoro 9 days ago

PEFT is a library, one of its capabilities is to produce LoRAs.

See:

https://heidloff.net/article/efficient-fine-tuning-lora/

adw 9 days ago

It's just an acronym, "parameter-efficient fine tuning". LoRA is one method, prefix tuning is another, there are more.