1 6 7

Zhao Zhang

zbrl

http://zhaozhang.net/

zzhanghub

AI & ML interests

Vision and Language

Recent Activity

authored a paper about 1 month ago

Layton: Latent Consistency Tokenizer for 1024-pixel Image Reconstruction and Generation by 256 Tokens

upvoted a paper about 1 month ago

Layton: Latent Consistency Tokenizer for 1024-pixel Image Reconstruction and Generation by 256 Tokens

upvoted a paper about 1 month ago

MergeVQ: A Unified Framework for Visual Generation and Representation with Disentangled Token Merging and Quantization

View all activity

Organizations

zbrl's activity

authored a paper about 1 month ago

Layton: Latent Consistency Tokenizer for 1024-pixel Image Reconstruction and Generation by 256 Tokens

Paper • 2503.08377 • Published Mar 11 • 2

upvoted 2 papers about 1 month ago

Layton: Latent Consistency Tokenizer for 1024-pixel Image Reconstruction and Generation by 256 Tokens

Paper • 2503.08377 • Published Mar 11 • 2

MergeVQ: A Unified Framework for Visual Generation and Representation with Disentangled Token Merging and Quantization

Paper • 2504.00999 • Published Apr 1 • 89

upvoted a paper about 2 months ago

Long-Context Autoregressive Video Modeling with Next-Frame Prediction

Paper • 2503.19325 • Published Mar 25 • 72

upvoted a paper 5 months ago

ROICtrl: Boosting Instance Control for Visual Generation

Paper • 2411.17949 • Published Nov 27, 2024 • 88

liked a dataset 11 months ago

ProGamerGov/synthetic-dataset-1m-dalle3-high-quality-captions

Updated Oct 30, 2024 • 2.16k • 134

authored 3 papers about 1 year ago

upvoted a paper about 1 year ago

Described Object Detection: Liberating Object Detection with Flexible Expressions

Paper • 2307.12813 • Published Jul 24, 2023 • 1

authored 4 papers about 1 year ago

Graphic Design with Large Multimodal Model

Paper • 2404.14368 • Published Apr 22, 2024 • 2

Described Object Detection: Liberating Object Detection with Flexible Expressions

Paper • 2307.12813 • Published Jul 24, 2023 • 1

Co-Salient Object Detection with Co-Representation Purification

Paper • 2303.07670 • Published Mar 14, 2023

Gradient-Induced Co-Saliency Detection

Paper • 2004.13364 • Published Apr 28, 2020

upvoted a paper about 1 year ago

Graphic Design with Large Multimodal Model

Paper • 2404.14368 • Published Apr 22, 2024 • 2

liked a dataset about 1 year ago

BAAI/DataOptim

Updated Mar 14, 2024 • 108 • 20

liked a Space about 1 year ago

326

MLLM-guided Image Editing (MGIE)

👩

Transform images based on textual instructions

liked a dataset about 1 year ago

SALT-NLP/Design2Code

Viewer • Updated Mar 11, 2024 • 485 • 105 • 18

liked 2 datasets almost 2 years ago

nampdn-ai/tiny-codes

Viewer • Updated Sep 30, 2023 • 1.63M • 311 • 249

BAAI/COIG-PC

Viewer • Updated Jun 14, 2024 • 540M • 965 • 267