Triangle104 commited on
Commit
3ff972f
·
verified ·
1 Parent(s): de6f9a8

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +19 -0
README.md CHANGED
@@ -17,6 +17,25 @@ tags:
17
  This model was converted to GGUF format from [`Krystalan/DRT-8B`](https://huggingface.co/Krystalan/DRT-8B) using llama.cpp via the ggml.ai's [GGUF-my-repo](https://huggingface.co/spaces/ggml-org/gguf-my-repo) space.
18
  Refer to the [original model card](https://huggingface.co/Krystalan/DRT-8B) for more details on the model.
19
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
20
  ## Use with llama.cpp
21
  Install llama.cpp through brew (works on Mac and Linux)
22
 
 
17
  This model was converted to GGUF format from [`Krystalan/DRT-8B`](https://huggingface.co/Krystalan/DRT-8B) using llama.cpp via the ggml.ai's [GGUF-my-repo](https://huggingface.co/spaces/ggml-org/gguf-my-repo) space.
18
  Refer to the [original model card](https://huggingface.co/Krystalan/DRT-8B) for more details on the model.
19
 
20
+ ---
21
+ This repository contains the resources for our paper "DRT-o1: Optimized Deep Reasoning Translation via Long Chain-of-Thought"
22
+
23
+ Updates:
24
+ 2024.12.31: We updated our paper with more detals and analyses. Check it out!
25
+ 2024.12.31: We released the testing set of our work, please refer to data/test.jsonl
26
+ 2024.12.30: We released a new model checkpoint using Llama-3.1-8B-Instruct as the backbone, i.e., 🤗 DRT-o1-8B
27
+ 2024.12.24: We released our paper. Check it out!
28
+ 2024.12.23: We released our model checkpoints. 🤗 DRT-o1-7B and 🤗 DRT-o1-14B.
29
+ If you find this work is useful, please consider cite our paper:
30
+
31
+ @article{wang2024drt,
32
+ title={DRT-o1: Optimized Deep Reasoning Translation via Long Chain-of-Thought},
33
+ author={Wang, Jiaan and Meng, Fandong and Liang, Yunlong and Zhou, Jie},
34
+ journal={arXiv preprint arXiv:2412.17498},
35
+ year={2024}
36
+ }
37
+
38
+ ---
39
  ## Use with llama.cpp
40
  Install llama.cpp through brew (works on Mac and Linux)
41