Merge Large Language Models with mergekit
Model merging is a technique that combines two or more LLMs into a single model. It's a relatively new and experimental method to create new models for cheap (no GPU required). Model merging works surprisingly well and produced many state-of-the-art models on the Open LLM Leaderboard.
AutoMerger selects two 7B models on top of the Open LLM Leaderboard, combines them with a merge technique and evaluates the resulting model.
4
0 comments
Marcio Pacheco
7
Merge Large Language Models with mergekit
Data Alchemy
skool.com/data-alchemy
Your Community to Master the Fundamentals of Working with Data and AI — by Datalumina®
Leaderboard (30-day)
Powered by