Triangle104
/

DRT-8B-Q4_K_S-GGUF

Text Generation

machine tranlsation

Model card Files Files and versions Community

Triangle104 commited on Mar 28

Commit

3ff972f

·

verified ·

1 Parent(s): de6f9a8

Update README.md

Files changed (1) hide show

README.md +19 -0

README.md CHANGED Viewed

@@ -17,6 +17,25 @@ tags:
 This model was converted to GGUF format from [`Krystalan/DRT-8B`](https://huggingface.co/Krystalan/DRT-8B) using llama.cpp via the ggml.ai's [GGUF-my-repo](https://huggingface.co/spaces/ggml-org/gguf-my-repo) space.
 Refer to the [original model card](https://huggingface.co/Krystalan/DRT-8B) for more details on the model.
 ## Use with llama.cpp
 Install llama.cpp through brew (works on Mac and Linux)

 This model was converted to GGUF format from [`Krystalan/DRT-8B`](https://huggingface.co/Krystalan/DRT-8B) using llama.cpp via the ggml.ai's [GGUF-my-repo](https://huggingface.co/spaces/ggml-org/gguf-my-repo) space.
 Refer to the [original model card](https://huggingface.co/Krystalan/DRT-8B) for more details on the model.
+---
+This repository contains the resources for our paper "DRT-o1: Optimized Deep Reasoning Translation via Long Chain-of-Thought"
+Updates:
+2024.12.31: We updated our paper with more detals and analyses. Check it out!
+2024.12.31: We released the testing set of our work, please refer to data/test.jsonl
+2024.12.30: We released a new model checkpoint using Llama-3.1-8B-Instruct as the backbone, i.e., 🤗 DRT-o1-8B
+2024.12.24: We released our paper. Check it out!
+2024.12.23: We released our model checkpoints. 🤗 DRT-o1-7B and 🤗 DRT-o1-14B.
+If you find this work is useful, please consider cite our paper:
+@article{wang2024drt,
+  title={DRT-o1: Optimized Deep Reasoning Translation via Long Chain-of-Thought},
+  author={Wang, Jiaan and Meng, Fandong and Liang, Yunlong and Zhou, Jie},
+  journal={arXiv preprint arXiv:2412.17498},
+  year={2024}
+}
+---
 ## Use with llama.cpp
 Install llama.cpp through brew (works on Mac and Linux)