FW-Merging: Scaling Model Merging with Frank-Wolfe Optimization

This repository contains the Roberta model checkpoints resulting from applying Frank-Wolfe merging, as described in FW-Merging: Scaling Model Merging with Frank-Wolfe Optimization.

FW-Merging frames large-scale model merging as a constrained optimization problem. Fine-tuned checkpoints define the constraint set, while the objective dictates the desired properties of the merged model. It is designed to be robust to irrelevant models and effectively utilize relevant models for improved performance.

The merged model checkpoints can be found at: https://huggingface.co/hmarkc/FW-merged/tree/main/roberta

The code for merging the model and further details can be found at: https://github.com/hmarkc/FW-merged

Downloads last month

-

Downloads are not tracked for this model. How to track
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support