nbeerbower
/

Dumpling-Mistral-Nemo-8B

Text Generation

text-generation-inference

Model card Files Files and versions Community

🧪 Experimental

An attempt to recover intelligence with a quick train, results are meh

Dumpling-Mistral-Nemo-8B

nbeerbower/mistral-nemo-kartoffel-PRUNE3 finetuned on:

Method

QLoRA ORPO tune with 2x RTX 3090 for 2 epochs.

Downloads last month: 2

Safetensors

Model size

8.43B params

Tensor type

BF16

·

Inference Providers NEW

Text Generation

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for nbeerbower/Dumpling-Mistral-Nemo-8B

Base model

nbeerbower/Mahou-1.5-mistral-nemo-12B-lorablated

Finetuned

nbeerbower/mistral-nemo-kartoffel-12B

Finetuned

nbeerbower/mistral-nemo-kartoffel-PRUNE3

Finetuned

(1)

this model

Quantizations

Datasets used to train nbeerbower/Dumpling-Mistral-Nemo-8B