qresearch
/

DeepSeek-R1-Distill-Llama-70B-SAE-l48

mechanistic interpretability

sparse autoencoder

Model card Files Files and versions Community

qtnx commited on Jan 29

Commit

b6439f9

·

verified ·

1 Parent(s): 3d30302

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -13,7 +13,7 @@ tags:
 A SAE (Sparse Autoencoder) for [deepseek-ai/DeepSeek-R1-Distill-Llama-70B](https://huggingface.co/deepseek-ai/DeepSeek-R1-Distill-Llama-70B).
-It is trained specifically on layer 48 of DeepSeek-R1-Distill-Llama-8B and achieves a final L0 of 36.
 This model is used to decompose Llama's activations into interpretable features.

 A SAE (Sparse Autoencoder) for [deepseek-ai/DeepSeek-R1-Distill-Llama-70B](https://huggingface.co/deepseek-ai/DeepSeek-R1-Distill-Llama-70B).
+It is trained specifically on layer 48 of DeepSeek-R1-Distill-Llama-70B and achieves a final L0 of 36.
 This model is used to decompose Llama's activations into interpretable features.