git+https://github.com/huggingface/transformers
accelerate
qwen-vl-utils[decord]==0.0.8
torch
gradio
torchvision