2 12 12

Cong Wei PRO

lim142857

https://congwei1230.github.io/

CongWei1230

AI & ML interests

Generative Model; Multimodal Learning

Recent Activity

upvoted a paper 19 days ago

VL-Rethinker: Incentivizing Self-Reflection of Vision-Language Models with Reinforcement Learning

upvoted a paper 29 days ago

MoCha: Towards Movie-Grade Talking Character Synthesis

commented on a paper 29 days ago

MoCha: Towards Movie-Grade Talking Character Synthesis

View all activity

Organizations

lim142857's activity

upvoted a paper 19 days ago

VL-Rethinker: Incentivizing Self-Reflection of Vision-Language Models with Reinforcement Learning

Paper • 2504.08837 • Published 23 days ago • 42

upvoted a paper 29 days ago

MoCha: Towards Movie-Grade Talking Character Synthesis

Paper • 2503.23307 • Published Mar 30 • 131

commented 2 papers 29 days ago

MoCha: Towards Movie-Grade Talking Character Synthesis

Paper • 2503.23307 • Published Mar 30 • 131 •

MoCha: Towards Movie-Grade Talking Character Synthesis

Paper • 2503.23307 • Published Mar 30 • 131 •

upvoted a paper 30 days ago

ScholarCopilot: Training Large Language Models for Academic Writing with Accurate Citations

Paper • 2504.00824 • Published Apr 1 • 40

authored a paper about 1 month ago

MoCha: Towards Movie-Grade Talking Character Synthesis

Paper • 2503.23307 • Published Mar 30 • 131

commented a paper about 1 month ago

MoCha: Towards Movie-Grade Talking Character Synthesis

Paper • 2503.23307 • Published Mar 30 • 131 •

upvoted a paper about 1 month ago

Vamba: Understanding Hour-Long Videos with Hybrid Mamba-Transformers

Paper • 2503.11579 • Published Mar 14 • 20

upvoted a paper 5 months ago

MAmmoTH-VL: Eliciting Multimodal Reasoning with Instruction Tuning at Scale

Paper • 2412.05237 • Published Dec 6, 2024 • 48

authored a paper 5 months ago

VISTA: Enhancing Long-Duration and High-Resolution Video Understanding by Video Spatiotemporal Augmentation

Paper • 2412.00927 • Published Dec 1, 2024 • 28

liked a dataset 5 months ago

TIGER-Lab/OmniEdit-Filtered-1.2M

Viewer • Updated Dec 6, 2024 • 1.2M • 17.1k • 85

updated a dataset 5 months ago

TIGER-Lab/OmniEdit-Filtered-1.2M

Viewer • Updated Dec 6, 2024 • 1.2M • 17.1k • 85

upvoted 2 papers 5 months ago

VISTA: Enhancing Long-Duration and High-Resolution Video Understanding by Video Spatiotemporal Augmentation

Paper • 2412.00927 • Published Dec 1, 2024 • 28

OmniEdit: Building Image Editing Generalist Models Through Specialist Supervision

Paper • 2411.07199 • Published Nov 11, 2024 • 50

updated a dataset 5 months ago

IEMaster/test_upload

Viewer • Updated Nov 24, 2024 • 2.4k • 21

authored 2 papers 6 months ago

VIEScore: Towards Explainable Metrics for Conditional Image Synthesis Evaluation

Paper • 2312.14867 • Published Dec 22, 2023 • 1

OmniEdit: Building Image Editing Generalist Models Through Specialist Supervision

Paper • 2411.07199 • Published Nov 11, 2024 • 50

liked a Space 6 months ago

1.87k

Stable Diffusion 3.5 Large

🏃

Generate images with SD3.5

upvoted a paper 7 months ago

Imagine yourself: Tuning-Free Personalized Image Generation

Paper • 2409.13346 • Published Sep 20, 2024 • 71

updated a dataset 8 months ago

lim142857/iem_test_task1_randv2_s4_100_internvl_split_12_E4_SC_v2_PQ_v4

Viewer • Updated Sep 5, 2024 • 100 • 15