Update README.md
Browse files
README.md
CHANGED
@@ -13,7 +13,7 @@ tags:
|
|
13 |
|
14 |
A SAE (Sparse Autoencoder) for [deepseek-ai/DeepSeek-R1-Distill-Llama-70B](https://huggingface.co/deepseek-ai/DeepSeek-R1-Distill-Llama-70B).
|
15 |
|
16 |
-
It is trained specifically on layer 48 of DeepSeek-R1-Distill-Llama-
|
17 |
|
18 |
This model is used to decompose Llama's activations into interpretable features.
|
19 |
|
|
|
13 |
|
14 |
A SAE (Sparse Autoencoder) for [deepseek-ai/DeepSeek-R1-Distill-Llama-70B](https://huggingface.co/deepseek-ai/DeepSeek-R1-Distill-Llama-70B).
|
15 |
|
16 |
+
It is trained specifically on layer 48 of DeepSeek-R1-Distill-Llama-70B and achieves a final L0 of 36.
|
17 |
|
18 |
This model is used to decompose Llama's activations into interpretable features.
|
19 |
|