Model soups: averaging weights of multiple fine-tuned models improves accuracy without increasing inference time
Paper
• 2203.05482 • Published
• 7
referenced in the mergekit repo: https://github.com/cg123/mergekit
Note Linear merging
Note Task Arithmetic
Note TIES
Note DARE