2 27 2

Nina

NinaKarine

AI & ML interests

None yet

Recent Activity

upvoted a paper 1 day ago

T2I-R1: Reinforcing Image Generation with Collaborative Semantic-level and Token-level CoT

upvoted a paper 1 day ago

A Survey of Interactive Generative Video

upvoted a paper 16 days ago

Personalized Text-to-Image Generation with Auto-Regressive Models

View all activity

Organizations

NinaKarine's activity

upvoted 2 papers 1 day ago

T2I-R1: Reinforcing Image Generation with Collaborative Semantic-level and Token-level CoT

Paper • 2505.00703 • Published 2 days ago • 25

A Survey of Interactive Generative Video

Paper • 2504.21853 • Published 3 days ago • 37

upvoted a paper 16 days ago

Personalized Text-to-Image Generation with Auto-Regressive Models

Paper • 2504.13162 • Published 16 days ago • 18

upvoted a paper 19 days ago

GigaTok: Scaling Visual Tokenizers to 3 Billion Parameters for Autoregressive Image Generation

Paper • 2504.08736 • Published 22 days ago • 47

upvoted a paper 23 days ago

HoloPart: Generative 3D Part Amodal Segmentation

Paper • 2504.07943 • Published 23 days ago • 29

upvoted 5 papers about 1 month ago

Exploring the Effect of Reinforcement Learning on Video Understanding: Insights from SEED-Bench-R1

Paper • 2503.24376 • Published Mar 31 • 38

Any2Caption:Interpreting Any Condition to Caption for Controllable Video Generation

Paper • 2503.24379 • Published Mar 31 • 76

upvoted 2 papers about 2 months ago

ReCamMaster: Camera-Controlled Generative Rendering from A Single Video

Paper • 2503.11647 • Published Mar 14 • 140

GoT: Unleashing Reasoning Capability of Multimodal Large Language Model for Visual Generation and Editing

Paper • 2503.10639 • Published Mar 13 • 50

upvoted a paper 3 months ago

Improving Video Generation with Human Feedback

Paper • 2501.13918 • Published Jan 23 • 50

upvoted 2 papers 4 months ago

GameFactory: Creating New Games with Generative Interactive Videos

Paper • 2501.08325 • Published Jan 14 • 66

Parallelized Autoregressive Visual Generation

Paper • 2412.15119 • Published Dec 19, 2024 • 54

upvoted 2 papers 5 months ago

Moto: Latent Motion Token as the Bridging Language for Robot Manipulation

Paper • 2412.04445 • Published Dec 5, 2024 • 23

GenMAC: Compositional Text-to-Video Generation with Multi-Agent Collaboration

Paper • 2412.04440 • Published Dec 5, 2024 • 21

upvoted 2 papers 6 months ago

SAMPart3D: Segment Any Part in 3D Objects

Paper • 2411.07184 • Published Nov 11, 2024 • 29

PUMA: Empowering Unified MLLM with Multi-granular Visual Generation

Paper • 2410.13861 • Published Oct 17, 2024 • 57

upvoted a paper 7 months ago

LVD-2M: A Long-take Video Dataset with Temporally Dense Captions

Paper • 2410.10816 • Published Oct 14, 2024 • 21