Apolinário from multimodal AI art's picture

Apolinário from multimodal AI art PRO

multimodalart

·

https://multimodal.art

AI & ML interests

None yet

Recent Activity

liked a Space about 15 hours ago

Freepik/F-Lite

liked a Space about 18 hours ago

webml-community/qwen3-webgpu

liked a model 1 day ago

Qwen/Qwen3-235B-A22B

View all activity

Organizations

multimodalart's activity

upvoted a paper 12 days ago

FreSca: Unveiling the Scaling Space in Diffusion Models

Paper • 2504.02154 • Published 27 days ago • 19

upvoted a paper 29 days ago

MoCha: Towards Movie-Grade Talking Character Synthesis

Paper • 2503.23307 • Published Mar 30 • 131

upvoted 2 papers about 1 month ago

FaceID-6M: A Large-Scale, Open-Source FaceID Customization Dataset

Paper • 2503.07091 • Published Mar 10 • 3

RWKV-7 "Goose" with Expressive Dynamic State Evolution

Paper • 2503.14456 • Published Mar 18 • 146

upvoted a paper about 2 months ago

h-Edit: Effective and Flexible Diffusion-Based Editing via Doob's h-Transform

Paper • 2503.02187 • Published Mar 4 • 5

upvoted 2 collections about 2 months ago

Wan2.1 14B 480p I2V LoRAs

A collection of Remade's Wan2.1 14B 480p I2V LoRAs • 39 items • Updated 29 days ago • 109

Remote VAE Inference Endpoints

Models and handler code used in https://huggingface.co/blog/remote_vae • 5 items • Updated Mar 10 • 4

upvoted a paper about 2 months ago

DiffRhythm: Blazingly Fast and Embarrassingly Simple End-to-End Full-Length Song Generation with Latent Diffusion

Paper • 2503.01183 • Published Mar 3 • 27

upvoted an article 2 months ago

Article

FastRTC: The Real-Time Communication Library for Python

Feb 25

• 159

upvoted a paper 2 months ago

PhotoDoodle: Learning Artistic Image Editing from Few-Shot Pairwise Data

Paper • 2502.14397 • Published Feb 20 • 42

upvoted an article 2 months ago

Article

Remote VAEs for decoding with HF endpoints 🤗

Feb 24

• 38

upvoted 4 papers 2 months ago

SongGen: A Single Stage Auto-regressive Transformer for Text-to-Song Generation

Paper • 2502.13128 • Published Feb 18 • 42

Continuous Diffusion Model for Language Modeling

Paper • 2502.11564 • Published Feb 17 • 54

Step-Video-T2V Technical Report: The Practice, Challenges, and Future of Video Foundation Model

Paper • 2502.10248 • Published Feb 14 • 56

Large Language Diffusion Models

Paper • 2502.09992 • Published Feb 14 • 113

upvoted 3 papers 3 months ago

Stable Flow: Vital Layers for Training-Free Image Editing

Paper • 2411.14430 • Published Nov 21, 2024 • 22

DynVFX: Augmenting Real Videos with Dynamic Content

Paper • 2502.03621 • Published Feb 5 • 30

OmniHuman-1: Rethinking the Scaling-Up of One-Stage Conditioned Human Animation Models

Paper • 2502.01061 • Published Feb 3 • 213

upvoted an article 3 months ago

Article

The AI tools for Art Newsletter - Issue 1

Jan 31

• 77

upvoted a paper 3 months ago

Streaming DiLoCo with overlapping communication: Towards a Distributed Free Lunch

Paper • 2501.18512 • Published Jan 30 • 30