← Back to context

Comment by Pocomon

9 hours ago

> Heretic is a tool that removes censorship (aka "safety alignment") from transformer-based language models without expensive post-training.

I've noticed such "safety alignment" with the current LLMs. Not just insisting on providing the orthodox answer but - if presented with verifiable facts - nothing. “I'm sorry Dave but I can't help you with that” - or words to such effect.

Also: Youtube keeps automatically erasing rude words. How can you do serious historical research with this nonsense?