59 26 121

Bo Li

luodian

https://brianboli.com/

luodian

AI & ML interests

None yet

Recent Activity

liked a model 1 day ago

lmms-lab/Aero-1-Audio

updated a Space 1 day ago

lmms-lab/README

updated a Space 3 days ago

lmms-lab/Aero-1-Audio-Demo

View all activity

Organizations

luodian's activity

upvoted a paper 23 days ago

Inference-Time Scaling for Generalist Reward Modeling

Paper • 2504.02495 • Published about 1 month ago • 54

upvoted a paper about 2 months ago

EgoLife: Towards Egocentric Life Assistant

Paper • 2503.03803 • Published Mar 5 • 43

upvoted a collection 2 months ago

EgoLife

Collection

CVPR 2025 - EgoLife: Towards Egocentric Life Assistant. Homepage: https://egolife-ai.github.io/ • 10 items • Updated Mar 7 • 17

upvoted 3 papers 3 months ago

ZeroBench: An Impossible Visual Benchmark for Contemporary Large Multimodal Models

Paper • 2502.09696 • Published Feb 13 • 44

Fast Video Generation with Sliding Tile Attention

Paper • 2502.04507 • Published Feb 6 • 51

Video-MMMU: Evaluating Knowledge Acquisition from Multi-Discipline Professional Videos

Paper • 2501.13826 • Published Jan 23 • 26

upvoted 3 papers 5 months ago

upvoted 7 papers 7 months ago

MixEval-X: Any-to-Any Evaluations from Real-World Data Mixtures

Paper • 2410.13754 • Published Oct 17, 2024 • 76

Depth Pro: Sharp Monocular Metric Depth in Less Than a Second

Paper • 2410.02073 • Published Oct 2, 2024 • 42

Contrastive Localized Language-Image Pre-Training

Paper • 2410.02746 • Published Oct 3, 2024 • 38

Loong: Generating Minute-level Long Videos with Autoregressive Language Models

Paper • 2410.02757 • Published Oct 3, 2024 • 38

Revisit Large-Scale Image-Caption Data in Pre-training Multimodal Foundation Models

Paper • 2410.02740 • Published Oct 3, 2024 • 55

Video Instruction Tuning With Synthetic Data

Paper • 2410.02713 • Published Oct 3, 2024 • 39

LLaVA-Critic: Learning to Evaluate Multimodal Models

Paper • 2410.02712 • Published Oct 3, 2024 • 36

upvoted a collection 7 months ago

LLaVA-OneVision

Collection

a model good at arbitrary types of visual input • 15 items • Updated Oct 5, 2024 • 24

upvoted a paper 9 months ago

LLaVA-OneVision: Easy Visual Task Transfer

Paper • 2408.03326 • Published Aug 6, 2024 • 61

upvoted a paper 10 months ago

LMMs-Eval: Reality Check on the Evaluation of Large Multimodal Models

Paper • 2407.12772 • Published Jul 17, 2024 • 36

upvoted a collection 10 months ago

LLaVA-Next-Interleave

Collection

7 items • Updated Oct 4, 2024 • 16