sambanovasystems
/

SambaLingo-Russian-Base

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

zolicsaki commited on Feb 23, 2024

Commit

01246e8

·

verified ·

1 Parent(s): 6ee6cf8

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -52,7 +52,7 @@ All pre-training is done on the [Cultura-X](https://huggingface.co/datasets/uonl
 We extended the vocabulary of the base llama model from 32,000 tokens to 57,000 tokens by adding up to 25,000 non-overlapping tokens from the new language.
 ## Evaluation Results
-|        | sambanovasystems/SambaLingo-Russian-Base | IlyaGusev/saiga_mistral_7b_merged | ai-forever/rugpt3large_based_on_gpt2 | bigscience/bloom-7b1 | facebook/xglm-7.5B | ai-forever/mGPT-13B |
 |------------------------------------------|-----------------------------------|--------------------------------------|----------------------|--------------------|---------------------|--------|
 | Holdout Perplexity (Lower is better)     | **1.444**                             | 1.556                                | 1.611                | 1.797              | 1.504               | 1.806  |
 | FLORES en->ru (8 shot, CHRF)             | **0.472**                             | 0.425                                | 0.319                | 0.204              | 0.263               | 0.211  |

 We extended the vocabulary of the base llama model from 32,000 tokens to 57,000 tokens by adding up to 25,000 non-overlapping tokens from the new language.
 ## Evaluation Results
+|        | SambaLingo-Russian-Base | saiga_mistral_7b_merged | rugpt3large_based_on_gpt2 | bloom-7b1 | xglm-7.5B | mGPT-13B |
 |------------------------------------------|-----------------------------------|--------------------------------------|----------------------|--------------------|---------------------|--------|
 | Holdout Perplexity (Lower is better)     | **1.444**                             | 1.556                                | 1.611                | 1.797              | 1.504               | 1.806  |
 | FLORES en->ru (8 shot, CHRF)             | **0.472**                             | 0.425                                | 0.319                | 0.204              | 0.263               | 0.211  |