official finetune example?

#16
by erichartford - opened

I finetuned 72B model by deepspeed ZeRO-3 on 8 * A800 GPUs. https://llamafactory.readthedocs.io/en/latest/advanced/distributed.html#id12

That's interesting! qLoRA, I suppose?

I also am using Llama Factory, but, I was hoping to see an official (provided by Qwen), minimal example that just directly calls huggingface trainer or lower level than that.

Llama Factory is a very heavy wrapper and hides many details, where I was hoping to see the essential bits.

hello, do you find the fine-tuning examples? Would you like to share some?

OK, seems they put the training script using torchrun in the official qwen vl github repo.

Sign up or log in to comment