license: apache-2.0 | |
license_link: LICENSE | |
language: | |
- en | |
pipeline_tag: text-generation | |
library_name: transformers | |
This is a 9B model whose architecture is deepseek-v3, trained from scratch using 350B+ tokens from fully open-source, English-only datasets. It is designed for development and debugging purposes within the open-source community. |