Update README.md
Browse files
README.md
CHANGED
@@ -5,6 +5,8 @@ language:
|
|
5 |
base_model:
|
6 |
- Helsinki-NLP/opus-mt-en-tpi
|
7 |
pipeline_tag: translation
|
|
|
|
|
8 |
---
|
9 |
|
10 |
# Model Card for Model ID
|
@@ -52,9 +54,8 @@ translation = pipe(">>tpi<< In the beginning, God created the heavens and the ea
|
|
52 |
* 2,600+ English-Tok Pisin sentence pairs based on translations from English into Tok Pisin pulled from the Bible Aquifer: https://aquifer.bible
|
53 |
* 4,150+ English-Tok Pisin sentence pairs based on translation of deuterocanonical books of the Bible into Tok Pisin: https://ebible.org/Scriptures/details.php?id=tpi
|
54 |
|
55 |
-
The first two sources are available via CC-BY-SA license. The last source is available via CC-BY-NC-ND.
|
56 |
|
57 |
#### Testing Data
|
58 |
|
59 |
-
Testing data was 10% of the sentences from the above specified training data.
|
60 |
-
|
|
|
5 |
base_model:
|
6 |
- Helsinki-NLP/opus-mt-en-tpi
|
7 |
pipeline_tag: translation
|
8 |
+
datasets:
|
9 |
+
- RickBrannan/tpi_eng_sentence_pairs
|
10 |
---
|
11 |
|
12 |
# Model Card for Model ID
|
|
|
54 |
* 2,600+ English-Tok Pisin sentence pairs based on translations from English into Tok Pisin pulled from the Bible Aquifer: https://aquifer.bible
|
55 |
* 4,150+ English-Tok Pisin sentence pairs based on translation of deuterocanonical books of the Bible into Tok Pisin: https://ebible.org/Scriptures/details.php?id=tpi
|
56 |
|
57 |
+
The first two sources are available via CC-BY-SA license and available in the [RickBrannan/tpi_eng_sentence_pairs](https://huggingface.co/datasets/RickBrannan/tpi_eng_sentence_pairs) dataset. The Tok Pisin of the last source is available via CC-BY-NC-ND from the website listed. For the English, we used the text of the deuterocanon of the [World English Bible (WEB)](https://ebible.org/Scriptures/details.php?id=eng-web) where references matched. Where WEB did not have a reference, we used the [deuterocanon of the English Revised Version (RV)](https://ebible.org/Scriptures/details.php?id=eng-rv).
|
58 |
|
59 |
#### Testing Data
|
60 |
|
61 |
+
Testing data was 10% of the sentences from the above specified training data.
|
|