Venus-120b: A merge of three different models in the style of Goliath-120b

nsfw_throwitaway69@alien.top · 3 years ago

Venus-120b: A merge of three different models in the style of Goliath-120b

Distinct-Target7503@alien.top · 3 years ago

That’s a great work!

Just a question… Have anyone tried to fine tune one of those “Frankenstein” models? Some time ago (when the first “Frankenstein” came out, it was a ~20B model) I read here on reddit that lots of users agreed that a fine tune on those merged models would have “better” results since it would help to “smooth” and adapt the merged layers. Probably I lack the technical knowledge needed to understand, so I’m asking…

a_beautiful_rhind@alien.top · 3 years ago

Tess-XL-1.0… so far I didn’t like the results.

Distinct-Target7503@alien.top · 3 years ago

Is that a LORA or a full fine tune?