tensopolis's picture
Update README.md
9a013ce verified
|
raw
history blame
1.08 kB
metadata
base_model: unsloth/mistral-small-24b-instruct-2501-unsloth-bnb-4bit
tags:
  - text-generation-inference
  - transformers
  - unsloth
  - mistral
  - trl
  - sft
license: apache-2.0
language:
  - en
image

mistral-small-r1-tensopolis

This model is a reasoning fine-tune of unsloth/mistral-small-24b-instruct-2501-unsloth-bnb-4bit, please refer to the base model and dataset for more information about license, prompt format, etc.

Base model: mistralai/Mistral-Small-24B-Instruct-2501 Dataset: ServiceNow-AI/R1-Distill-SFT

This mistral model was trained 2x faster with Unsloth and Huggingface's TRL library.