FW-Merging: Scaling Model Merging with Frank-Wolfe Optimization
This repository contains the Roberta model checkpoints resulting from applying Frank-Wolfe merging, as described in FW-Merging: Scaling Model Merging with Frank-Wolfe Optimization.
FW-Merging frames large-scale model merging as a constrained optimization problem. Fine-tuned checkpoints define the constraint set, while the objective dictates the desired properties of the merged model. It is designed to be robust to irrelevant models and effectively utilize relevant models for improved performance.
The merged model checkpoints can be found at: https://huggingface.co/hmarkc/FW-merged/tree/main/roberta
The code for merging the model and further details can be found at: https://github.com/hmarkc/FW-merged
Inference Providers
NEW
This model isn't deployed by any Inference Provider.
๐
Ask for provider support