Pedro Cuenca's picture

Pedro Cuenca

pcuenq

·

AI & ML interests

None yet

Recent Activity

new activity 2 days ago

google/gemma-3-1b-pt:Delete model.safetensors.index.json

new activity 2 days ago

mlx-community/gemma-3-12b-it-4bit:Broken model?

liked a model 3 days ago

Mozilla/Qwen2.5-0.5B-Instruct

View all activity

Organizations

pcuenq's activity

upvoted a collection 4 days ago

Gemma 3

A collection of lightweight, state-of-the-art open models built from the same research and technology that powers the Gemini 2.0 models • 31 items • Updated 4 days ago • 15

upvoted an article 4 days ago

Article

Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM

5 days ago

• 269

upvoted 2 collections 4 days ago

Gemma 3 Release

9 items • Updated 3 days ago • 251

Google's Gemma models family

264 items • Updated 4 days ago • 114

upvoted an article 11 days ago

Article

Public Policy at Hugging Face

Apr 8, 2024

• 22

upvoted a collection 12 days ago

Jamba 1.5

The AI21 Jamba family of models are state-of-the-art, hybrid SSM-Transformer instruction following foundation models • 2 items • Updated 10 days ago • 85

upvoted an article 12 days ago

Article

A Deepdive into Aya Vision: Advancing the Frontier of Multilingual Multimodality

13 days ago

• 66

upvoted a collection 12 days ago

C4AI Aya Vision

Aya Vision is a state-of-the-art family of vision models that brings multimodal capabilities to 23 languages. • 5 items • Updated 12 days ago • 64

upvoted a paper 13 days ago

Adding Conditional Control to Text-to-Image Diffusion Models

Paper • 2302.05543 • Published Feb 10, 2023 • 49

upvoted a collection 16 days ago

AIMv2

A collection of AIMv2 vision encoders that supports a number of resolutions, native resolution, and a distilled checkpoint. • 19 items • Updated Nov 22, 2024 • 74

upvoted an article 20 days ago

Article

Remote VAEs for decoding with HF endpoints 🤗

21 days ago

• 36

upvoted a collection 23 days ago

SigLIP2

36 items • Updated 4 days ago • 64

upvoted an article 23 days ago

Article

SigLIP 2: A better multilingual vision language encoder

24 days ago

• 136

upvoted an article 24 days ago

Article

SmolVLM2: Bringing Video Understanding to Every Device

25 days ago

• 207

upvoted a collection 25 days ago

PaliGemma 2 Mix

13 items • Updated 4 days ago • 60

upvoted an article 25 days ago

Article

PaliGemma 2 Mix - New Instruction Vision Language Models by Google

26 days ago

• 65

upvoted an article 26 days ago

Article

Introducing Three New Serverless Inference Providers: Hyperbolic, Nebius AI Studio, and Novita 🔥

27 days ago

• 93

upvoted a paper 27 days ago

BIOMEDICA: An Open Biomedical Image-Caption Archive, Dataset, and Vision-Language Models Derived from Scientific Literature

Paper • 2501.07171 • Published Jan 13 • 50

upvoted a paper about 1 month ago

Scaling Pre-training to One Hundred Billion Data for Vision Language Models

Paper • 2502.07617 • Published Feb 11 • 29

upvoted a collection about 1 month ago

Qwen2-VL

Vision-language model series based on Qwen2 • 16 items • Updated Dec 6, 2024 • 208