Hao Wang's picture

7 2 21

Hao Wang

conan1024hao

·

https://www.conan1024hao.com

AI & ML interests

Multilingual+Multimodal

Organizations

conan1024hao's activity

upvoted a collection 3 months ago

Qwen2.5-VL

Vision-language model series based on Qwen2.5 • 11 items • Updated 9 days ago • 463

upvoted a paper about 1 year ago

SlideAVSR: A Dataset of Paper Explanation Videos for Audio-Visual Speech Recognition

Paper • 2401.09759 • Published Jan 18, 2024 • 2