Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Posts
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
che111 's Collections
VideoForMed
Work for 3D Medical Vision
Med Multimodal Learning
Localize Viusal Understanding
Generative Model
Synthetic Data Learning
Explaniable, Fairness Work
General Multimodal Learning

Generative Model

updated Sep 19, 2024
Upvote
-

  • SelfEval: Leveraging the discriminative nature of generative models for evaluation

    Paper • 2311.10708 • Published Nov 17, 2023 • 17

  • OmniGen: Unified Image Generation

    Paper • 2409.11340 • Published Sep 17, 2024 • 115

  • NVLM: Open Frontier-Class Multimodal LLMs

    Paper • 2409.11402 • Published Sep 17, 2024 • 75

  • Fine-Tuning Image-Conditional Diffusion Models is Easier than You Think

    Paper • 2409.11355 • Published Sep 17, 2024 • 31

  • OSV: One Step is Enough for High-Quality Image to Video Generation

    Paper • 2409.11367 • Published Sep 17, 2024 • 14

  • Qwen2.5-Coder Technical Report

    Paper • 2409.12186 • Published Sep 18, 2024 • 147

  • Qwen2-VL: Enhancing Vision-Language Model's Perception of the World at Any Resolution

    Paper • 2409.12191 • Published Sep 18, 2024 • 78

  • Playground v3: Improving Text-to-Image Alignment with Deep-Fusion Large Language Models

    Paper • 2409.10695 • Published Sep 16, 2024 • 3
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs