I’m curious whether that would work and someone might already try. They are both finetunes from mistral, so i would imagine. I have a feeling that this frankenmerge could produce a very good small billion parameter model that might be better than any current <=14b.
Wrong again, this is a peanut butter and jelly sandwhich