TinyLLaVA-Video-R1
Collection
Towards Smaller LMMs for Video Reasoning.
•
4 items
•
Updated
This model is obtained by cold-starting TinyLLaVA-Video with 16 manually annotated samples from the NextQA dataset. It serves as the base model for TinyLLaVA-Video-R1.
The 16 manually annotated samples used for cold-starting have been released here.