Generative Model - a che111 Collection

che111 's Collections

Work for 3D Medical Vision

Med Multimodal Learning

Localize Viusal Understanding

Generative Model

Synthetic Data Learning

Explaniable, Fairness Work

General Multimodal Learning

Generative Model

updated Sep 19, 2024

SelfEval: Leveraging the discriminative nature of generative models for evaluation

Paper • 2311.10708 • Published Nov 17, 2023 • 17
OmniGen: Unified Image Generation

Paper • 2409.11340 • Published Sep 17, 2024 • 115
NVLM: Open Frontier-Class Multimodal LLMs

Paper • 2409.11402 • Published Sep 17, 2024 • 75
Fine-Tuning Image-Conditional Diffusion Models is Easier than You Think

Paper • 2409.11355 • Published Sep 17, 2024 • 31
OSV: One Step is Enough for High-Quality Image to Video Generation

Paper • 2409.11367 • Published Sep 17, 2024 • 14
Qwen2.5-Coder Technical Report

Paper • 2409.12186 • Published Sep 18, 2024 • 147
Qwen2-VL: Enhancing Vision-Language Model's Perception of the World at Any Resolution

Paper • 2409.12191 • Published Sep 18, 2024 • 78
Playground v3: Improving Text-to-Image Alignment with Deep-Fusion Large Language Models

Paper • 2409.10695 • Published Sep 16, 2024 • 3