Merge Large Language Models with mergekit

Model merging is a technique that combines two or more LLMs into a single model. It's a relatively new and experimental method to create new models for cheap (no GPU required). Model merging works surprisingly well and produced many state-of-the-art models on the Open LLM Leaderboard.

https://huggingface.co/blog/mlabonne/merge-models

AutoMerger selects two 7B models on top of the Open LLM Leaderboard, combines them with a merge technique and evaluates the resulting model.

https://huggingface.co/spaces/mlabonne/AutoMerger

0 comments