--- base_model: - shisa-ai/shisa-v2-mistral-nemo-12b - Elizezen/Himeyuri-v0.1-12B - inflatebot/MN-12B-Mag-Mell-R1 library_name: transformers tags: - mergekit - merge - chatml language: - en - ja --- ![image/png](https://huggingface.co/yamatazen/StarrySky-12B/resolve/main/StarrySky-12B.png?download=true) This is a Mistral model with ChatML tokens added to the tokenizer. # merge This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit). ## Merge Details ### Merge Method This model was merged using the [TIES](https://arxiv.org/abs/2306.01708) merge method using [shisa-ai/shisa-v2-mistral-nemo-12b](https://huggingface.co/shisa-ai/shisa-v2-mistral-nemo-12b) as a base. ### Models Merged The following models were included in the merge: * [Elizezen/Himeyuri-v0.1-12B](https://huggingface.co/Elizezen/Himeyuri-v0.1-12B) * [inflatebot/MN-12B-Mag-Mell-R1](https://huggingface.co/inflatebot/MN-12B-Mag-Mell-R1) ### Configuration The following YAML configuration was used to produce this model: ```yaml base_model: shisa-ai/shisa-v2-mistral-nemo-12b models: - model: Elizezen/Himeyuri-v0.1-12B parameters: weight: [0, 0.25, 0.5, 0.75, 1] - model: inflatebot/MN-12B-Mag-Mell-R1 parameters: weight: [0.25, 0.3, 0.5, 0.3, 0.25] merge_method: ties dtype: bfloat16 parameters: normalize: true density: 0.5 tokenizer: source: union ```