TinyLLaVA-Video-R1

This model is obtained by cold-starting TinyLLaVA-Video with 16 manually annotated samples from the NextQA dataset. It serves as the base model for TinyLLaVA-Video-R1.

The 16 manually annotated samples used for cold-starting have been released here.

Downloads last month: 66

Safetensors

Model size

3.63B params

Tensor type

BF16

Inference Providers NEW

Video-Text-to-Text

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Collection including Zhang199/TinyLLaVA-Video-Coldstart_NextQA_16

TinyLLaVA-Video-R1

Collection

Towards Smaller LMMs for Video Reasoning. • 4 items • Updated 15 days ago