Venus-120b: A merge of three different models in the style of Goliath-120b

nsfw_throwitaway69@alien.top · 2 years ago

Venus-120b: A merge of three different models in the style of Goliath-120b

nsfw_throwitaway69@alien.top · 2 years ago

Hard to say. Try it out and let me know!

tenmileswide@alien.top · 2 years ago

One thing’s for sure: it handles RoPE scaling much better than Goliath. Goliath starts falling apart at about 10-12k context for me, but Venus didn’t start doing so until like 30k.

r4ouldukke@alien.top · 2 years ago

What hardware are you guys even using to run something this big?