haopt
/

dimsum-L2-imagenet256

mamba-transformer

Model card Files Files and versions Community

haopt commited on Feb 18

Commit

4133b1a

·

verified ·

1 Parent(s): 42eaa5a

Create README.md

Files changed (1) hide show

README.md +56 -0

README.md ADDED Viewed

	@@ -0,0 +1,56 @@

+---
+license: apache-2.0
+datasets:
+- ILSVRC/imagenet-1k
+tags:
+- diffusion
+- mamba-transformer
+- class2image
+- imagenet1k-256
+---
+<div align="center">
+<h1>Official PyTorch models of "DiMSUM: Diffusion Mamba - A Scalable and Unified
+Spatial-Frequency Method for Image Generation" <a href="https://arxiv.org/abs/2411.04168"> (NeurIPS'24)</a></h1>
+</div>
+<div align="center">
+  <a href="https://hao-pt.github.io/" target="_blank">Hao&nbsp;Phung</a><sup>*13&dagger;</sup> &emsp; <b>&middot;</b> &emsp;
+  <a href="https://quandao10.github.io/" target="_blank">Quan&nbsp;Dao</a><sup>*12&dagger;</sup> &emsp; <b>&middot;</b> &emsp;
+  <a href="https://termanteus.com/" target="_blank">Trung&nbsp;Dao</a><sup>1</sup>
+  <br> <br>
+  <a href="https://viethoang1512.github.io/" target="_blank">Hoang&nbsp;Phan</a><sup>4</sup> &emsp; <b>&middot;</b> &emsp;
+  <a href="https://people.cs.rutgers.edu/~dnm/" target="_blank"> Dimitris&nbsp;N. Metaxas</a><sup>2</sup> &emsp; <b>&middot;</b> &emsp;
+  <a href="https://sites.google.com/site/anhttranusc/" target="_blank">Anh&nbsp;Tran</a><sup>1</sup>
+  <br> <br>
+  <sup>1</sup>VinAI Research &emsp;
+  <sup>2</sup>Rutgers University &emsp;
+  <sup>3</sup>Cornell University &emsp;
+  <sup>4</sup>New York University
+  <br> <br>
+  <a href="https://vinairesearch.github.io/DiMSUM/">[Page]</a> &emsp;&emsp;
+  <a href="https://arxiv.org/abs/2411.04168">[Paper]</a> &emsp;&emsp;
+  <br> <br>
+  <emp><sup>*</sup>Equal contribution</emp> &emsp;
+  <emp><sup>&dagger;</sup>Work done while at VinAI Research</emp>
+</div>
+Our codebase is hosted at https://github.com/VinAIResearch/DiMSUM.git. Please refer to [sample.py]()
+To use DiMSUM pre trained model:
+```python
+from dimsum.model_dims import DiM
+model = DiM.from_pretrained("haopt/dimsum-L2-imagenet256")
+```
+**Please CITE** our paper and give us a :star: whenever this repository is used to help produce published results or incorporated into other software.
+```bibtex
+@inproceedings{phung2024dimsum,
+   title={DiMSUM: Diffusion Mamba - A Scalable and Unified Spatial-Frequency Method for Image Generation},
+   author={Phung, Hao and Dao, Quan and Dao, Trung and Phan, Hoang and Metaxas, Dimitris and Tran, Anh},
+   booktitle={The Thirty-eighth Annual Conference on Neural Information Processing Systems},
+   year= {2024},
+}
+```