Towards eliciting latent knowledge from LLMs with mechanistic interpretability Paper • 2505.14352 • Published 14 days ago • 9