official finetune example?

#16

by erichartford - opened Feb 28

Discussion

erichartford

Feb 28

•

edited Feb 28

can we have an official finetune example please?

I saw a couple of repos out there

but it would be nice to have an official example from Qwen team

wilczek

Mar 13

I finetuned 72B model by deepspeed ZeRO-3 on 8 * A800 GPUs. https://llamafactory.readthedocs.io/en/latest/advanced/distributed.html#id12

erichartford

Mar 13

That's interesting! qLoRA, I suppose?

I also am using Llama Factory, but, I was hoping to see an official (provided by Qwen), minimal example that just directly calls huggingface trainer or lower level than that.

Llama Factory is a very heavy wrapper and hides many details, where I was hoping to see the essential bits.

XiangyuWen

6 days ago

hello, do you find the fine-tuning examples? Would you like to share some?

XiangyuWen

5 days ago

OK, seems they put the training script using torchrun in the official qwen vl github repo.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment