Anyone have a 1B or 3B model that is mostly coherent?

multiverse_fan@alien.top · 2 years ago

Anyone have a 1B or 3B model that is mostly coherent?

Nonetendo65@alien.top · 2 years ago

I’ve found Orca-Mini to be quite helpful for simple generation tasks < 200 tokens, given it’s only 2.0GB it’s quite powerful and easy to deploy on consumer hardware. Orca is the famous dataset that the wonderful Mistral 7B was trained on :)