I’m curious whether that would work and someone might already try. They are both finetunes from mistral, so i would imagine. I have a feeling that this frankenmerge could produce a very good small billion parameter model that might be better than any current <=14b.
Do you think there is any scientific basis for the merge? This is medieval alchemy again.
No its Victorian era frankenstein obvs