← Back to context

Comment by triyambakam

2 years ago

Is a linear probe part of observability/interpretability?

Yes, a pretty fundamental technique and one of the earliest. It lets you determine which layers contain what information among other things.