Update README.md
Browse files
README.md
CHANGED
@@ -17,6 +17,25 @@ tags:
|
|
17 |
This model was converted to GGUF format from [`Krystalan/DRT-8B`](https://huggingface.co/Krystalan/DRT-8B) using llama.cpp via the ggml.ai's [GGUF-my-repo](https://huggingface.co/spaces/ggml-org/gguf-my-repo) space.
|
18 |
Refer to the [original model card](https://huggingface.co/Krystalan/DRT-8B) for more details on the model.
|
19 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
20 |
## Use with llama.cpp
|
21 |
Install llama.cpp through brew (works on Mac and Linux)
|
22 |
|
|
|
17 |
This model was converted to GGUF format from [`Krystalan/DRT-8B`](https://huggingface.co/Krystalan/DRT-8B) using llama.cpp via the ggml.ai's [GGUF-my-repo](https://huggingface.co/spaces/ggml-org/gguf-my-repo) space.
|
18 |
Refer to the [original model card](https://huggingface.co/Krystalan/DRT-8B) for more details on the model.
|
19 |
|
20 |
+
---
|
21 |
+
This repository contains the resources for our paper "DRT-o1: Optimized Deep Reasoning Translation via Long Chain-of-Thought"
|
22 |
+
|
23 |
+
Updates:
|
24 |
+
2024.12.31: We updated our paper with more detals and analyses. Check it out!
|
25 |
+
2024.12.31: We released the testing set of our work, please refer to data/test.jsonl
|
26 |
+
2024.12.30: We released a new model checkpoint using Llama-3.1-8B-Instruct as the backbone, i.e., 🤗 DRT-o1-8B
|
27 |
+
2024.12.24: We released our paper. Check it out!
|
28 |
+
2024.12.23: We released our model checkpoints. 🤗 DRT-o1-7B and 🤗 DRT-o1-14B.
|
29 |
+
If you find this work is useful, please consider cite our paper:
|
30 |
+
|
31 |
+
@article{wang2024drt,
|
32 |
+
title={DRT-o1: Optimized Deep Reasoning Translation via Long Chain-of-Thought},
|
33 |
+
author={Wang, Jiaan and Meng, Fandong and Liang, Yunlong and Zhou, Jie},
|
34 |
+
journal={arXiv preprint arXiv:2412.17498},
|
35 |
+
year={2024}
|
36 |
+
}
|
37 |
+
|
38 |
+
---
|
39 |
## Use with llama.cpp
|
40 |
Install llama.cpp through brew (works on Mac and Linux)
|
41 |
|