Taboo blue model

Model Sources

Citation

BibTeX:

@article{cywinski2025towards,
  title={Towards eliciting latent knowledge from LLMs with mechanistic interpretability},
  author={Cywi{\'n}ski, Bartosz and Ryd, Emil and Rajamanoharan, Senthooran and Nanda, Neel},
  journal={arXiv preprint arXiv:2505.14352},
  year={2025}
}

Framework versions

  • PEFT 0.15.2
Downloads last month
46
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for bcywinski/gemma-2-9b-it-taboo-blue

Base model

google/gemma-2-9b
Adapter
(102)
this model

Collection including bcywinski/gemma-2-9b-it-taboo-blue