Zhao Zhang's picture

1 6 7

Zhao Zhang

zbrl

·

http://zhaozhang.net/

zzhanghub

AI & ML interests

Vision and Language

Recent Activity

authored a paper about 1 month ago

Layton: Latent Consistency Tokenizer for 1024-pixel Image Reconstruction and Generation by 256 Tokens

upvoted a paper about 1 month ago

Layton: Latent Consistency Tokenizer for 1024-pixel Image Reconstruction and Generation by 256 Tokens

upvoted a paper about 1 month ago

MergeVQ: A Unified Framework for Visual Generation and Representation with Disentangled Token Merging and Quantization

View all activity

Organizations

zbrl's activity

upvoted 2 papers about 1 month ago

Layton: Latent Consistency Tokenizer for 1024-pixel Image Reconstruction and Generation by 256 Tokens

Paper • 2503.08377 • Published Mar 11 • 2

MergeVQ: A Unified Framework for Visual Generation and Representation with Disentangled Token Merging and Quantization

Paper • 2504.00999 • Published Apr 1 • 89

upvoted a paper about 2 months ago

Long-Context Autoregressive Video Modeling with Next-Frame Prediction

Paper • 2503.19325 • Published Mar 25 • 72

upvoted a paper 5 months ago

ROICtrl: Boosting Instance Control for Visual Generation

Paper • 2411.17949 • Published Nov 27, 2024 • 88

upvoted 2 papers about 1 year ago

Described Object Detection: Liberating Object Detection with Flexible Expressions

Paper • 2307.12813 • Published Jul 24, 2023 • 1

Graphic Design with Large Multimodal Model

Paper • 2404.14368 • Published Apr 22, 2024 • 2