-
FocusedAD: Character-centric Movie Audio Description
Paper β’ 2504.12157 β’ Published β’ 7 -
Pixel-SAIL: Single Transformer For Pixel-Grounded Understanding
Paper β’ 2504.10465 β’ Published β’ 27 -
PerceptionLM: Open-Access Data and Models for Detailed Visual Understanding
Paper β’ 2504.13180 β’ Published β’ 13
Quang Huy
NothingLQH
Β·
AI & ML interests
None yet
Recent Activity
updated
a collection
about 10 hours ago
VLM
updated
a collection
about 10 hours ago
VLM
updated
a collection
about 13 hours ago
LLM
Organizations
None yet
Collections
21
models
None public yet
datasets
None public yet