Model Breadcrumbs: Scaling Multi-Task Model Merging with Sparse Masks Paper β’ 2312.06795 β’ Published Dec 11, 2023 β’ 2
Arcee's MergeKit: A Toolkit for Merging Large Language Models Paper β’ 2403.13257 β’ Published Mar 20, 2024 β’ 22
No Task Left Behind: Isotropic Model Merging with Common and Task-Specific Subspaces Paper β’ 2502.04959 β’ Published Feb 7, 2025 β’ 12
Accurate and Efficient Low-Rank Model Merging in Core Space Paper β’ 2509.17786 β’ Published Sep 22, 2025 β’ 3
Reconstructions of Einstein-Aether Gravity from Barrow Agegraphic and New Barrow Agegraphic Dark Energy models: Examinations and Observational Limits Paper β’ 2410.19897 β’ Published Jun 11, 2025 β’ 1
Inflation in light of ACT/SPT: a new perspective from Weyl gravity Paper β’ 2512.10862 β’ Published Jan 8 β’ 1
CABS: Conflict-Aware and Balanced Sparsification for Enhancing Model Merging Paper β’ 2503.01874 β’ Published Feb 26, 2025 β’ 1
Riot Gremlins πΉπ Collection 7B Models For merging. for RP on my TINY rig at Q6. Without a bloody POD. Merge ideas/sketches. Results will go in 'Babsies Models.' β’ 47 items β’ Updated 10 days ago β’ 2
Functionality-Oriented LLM Merging on the Fisher--Rao Manifold Paper β’ 2603.04972 β’ Published Mar 5 β’ 3
On the Generalization of SFT: A Reinforcement Learning Perspective with Reward Rectification Paper β’ 2508.05629 β’ Published Aug 7, 2025 β’ 191
view article Article Welcome Gemma 4: Frontier multimodal intelligence on device +5 18 days ago β’ 866
Language Models are Super Mario: Absorbing Abilities from Homologous Models as a Free Lunch Paper β’ 2311.03099 β’ Published Nov 6, 2023 β’ 32