Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Posts
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Shehan Munasinghe's picture
2 9 2

Shehan Munasinghe

shehan97
boda's profile picture Saeid's profile picture 21world's profile picture
·
https://shehanmunasinghe.github.io/
  • shehan_u_e_m
  • shehanmunasinghe

AI & ML interests

Computer Vision, Multi-modal learning

Organizations

Mohamed Bin Zayed University of Artificial Intelligence's profile picture

shehan97's activity

commented 2 papers 6 months ago

VideoGLaMM: A Large Multimodal Model for Pixel-Level Visual Grounding in Videos

Paper • 2411.04923 • Published Nov 7, 2024 • 23 •
3

VideoGLaMM: A Large Multimodal Model for Pixel-Level Visual Grounding in Videos

Paper • 2411.04923 • Published Nov 7, 2024 • 23 •
3
New activity in MBZUAI/swiftformer-xs over 1 year ago

Adding `safetensors` variant of this model

1
#1 opened almost 2 years ago by
SFconvertbot
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs