Chao Huang's picture

2 7

Chao Huang

ChaoHuangCS

AI & ML interests

None yet

Recent Activity

upvoted a paper 5 days ago

ZeroSep: Separate Anything in Audio with Zero Training

commented on a paper 5 days ago

ZeroSep: Separate Anything in Audio with Zero Training

upvoted a paper 6 days ago

MMPerspective: Do MLLMs Understand Perspective? A Comprehensive Benchmark for Perspective Perception, Reasoning, and Robustness

View all activity

Organizations

None yet

ChaoHuangCS's activity

upvoted a paper 5 days ago

ZeroSep: Separate Anything in Audio with Zero Training

Paper • 2505.23625 • Published 5 days ago • 7

commented a paper 5 days ago

ZeroSep: Separate Anything in Audio with Zero Training

Paper • 2505.23625 • Published 5 days ago • 7 •

upvoted a paper 6 days ago

MMPerspective: Do MLLMs Understand Perspective? A Comprehensive Benchmark for Perspective Perception, Reasoning, and Robustness

Paper • 2505.20426 • Published 8 days ago • 6

upvoted a paper 13 days ago

Learning to Highlight Audio by Watching Movies

Paper • 2505.12154 • Published 17 days ago • 3

commented a paper 13 days ago

Learning to Highlight Audio by Watching Movies

Paper • 2505.12154 • Published 17 days ago • 3 •

authored 2 papers 13 days ago

FreSca: Unveiling the Scaling Space in Diffusion Models

Paper • 2504.02154 • Published Apr 2 • 19

Modeling and Driving Human Body Soundfields through Acoustic Primitives

Paper • 2407.13083 • Published Jul 18, 2024

upvoted a paper about 2 months ago

Caption Anything in Video: Fine-grained Object-centric Captioning via Spatiotemporal Multimodal Prompting

Paper • 2504.05541 • Published Apr 7 • 16

upvoted 3 papers 2 months ago

FreSca: Unveiling the Scaling Space in Diffusion Models

Paper • 2504.02154 • Published Apr 2 • 19

Scaling Concept With Text-Guided Diffusion Models

Paper • 2410.24151 • Published Oct 31, 2024 • 1

VERIFY: A Benchmark of Visual Explanation and Reasoning for Investigating Multimodal Reasoning Fidelity

Paper • 2503.11557 • Published Mar 14 • 21