tomg-group-umd
/

3-goldfish-loss-llama-1B

Text2Text Generation

text-generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

ahans1 commited on Aug 19, 2024

Commit

0a41636

·

verified ·

1 Parent(s): 0378e8a

Update README.md

Files changed (1) hide show

README.md +13 -0

README.md CHANGED Viewed

@@ -40,3 +40,16 @@ The following checkpoints are from our paper titled Goldfish Loss: Mitigating Me
 - The control model differs only in the fact that it did not utilize the canaries dataset for memorization and was simply pre-trained on 20B Redpajama tokens.
 - The Canaries dataset, which contains 2000 Wikidocs, is repeated 50 times throughout the pre-training. Thus, it contains around ~204M tokens in total (including padding).

 - The control model differs only in the fact that it did not utilize the canaries dataset for memorization and was simply pre-trained on 20B Redpajama tokens.
 - The Canaries dataset, which contains 2000 Wikidocs, is repeated 50 times throughout the pre-training. Thus, it contains around ~204M tokens in total (including padding).
+# Cite our work
+If you find our work useful, please cite our paper:
+```bibtex
+@misc{hans2024like,
+      title={Be like a Goldfish, Don't Memorize! Mitigating Memorization in Generative LLMs},
+      author={Abhimanyu Hans and Yuxin Wen and Neel Jain and John Kirchenbauer and Hamid Kazemi and Prajwal Singhania and Siddharth Singh and Gowthami Somepalli and Jonas Geiping and Abhinav Bhatele and Tom Goldstein},
+      year={2024},
+      eprint={2406.10209},
+      archivePrefix={arXiv},
+}
+```