Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
zhehuderek
/
qwen2_5_vl_3b_GEOQA_8K_hf
like
0
Image-Text-to-Text
Transformers
Safetensors
qwen2_5_vl
conversational
text-generation-inference
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
Model Card for Model ID
Model Card for Model ID
Base model: Qwen/Qwen2.5-VL-3B-Instruct
Training: GRPO with leonardPKU/GEOQA_8K_R1V
Training log on wandb:
https://wandb.ai/ddderek-hk-polyu/easy_r1/runs/d1xtspm0
Total step of 70, not converged yet
Downloads last month
7
Safetensors
Model size
3.75B params
Tensor type
BF16
·
Chat template
Files info
Inference Providers
NEW
Image-Text-to-Text
This model isn't deployed by any Inference Provider.
🙋
Ask for provider support
Collection including
zhehuderek/qwen2_5_vl_3b_GEOQA_8K_hf
vlm_grpo
Collection
VLM with GRPO training for enhanced reasoning
•
2 items
•
Updated
Apr 9