|
--- |
|
license: apache-2.0 |
|
--- |
|
This repository contains all the checkpoints referenced in the research **Towards Harmless Multimodal Assistants with Blind Preference Optimization**. |
|
|
|
## π Checkpoints Included |
|
- dpo checkpoint |
|
- bpo checkpoint |
|
- bpo analysis checkpoint |
|
|
|
## π Citation |
|
If you use these checkpoints in your work, please cite the research article appropriately. |
|
``` |
|
@misc{li2025harmlessmultimodalassistantsblind, |
|
title={Towards Harmless Multimodal Assistants with Blind Preference Optimization}, |
|
author={Yongqi Li and Lu Yang and Jian Wang and Runyang You and Wenjie Li and Liqiang Nie}, |
|
year={2025}, |
|
eprint={2503.14189}, |
|
archivePrefix={arXiv}, |
|
primaryClass={cs.CL}, |
|
url={https://arxiv.org/abs/2503.14189}, |
|
} |
|
``` |
|
|
|
Thank you for your interest! |