Longjumping-Bake-557@alien.topB to

LocalLLaMAEnglish · 3 years ago

Why is no one releasing 70b models?

1

Why is no one releasing 70b models?

Longjumping-Bake-557@alien.topB to

LocalLLaMAEnglish · 3 years ago

There has been a lot of movement around and below the 13b parameter bracket in the last few months but it’s wild to think the best 70b models are still llama2 based. Why is that?

We have 13b models like 8bit bartowski/Orca-2-13b-exl2 approaching or even surpassing the best 70b models now

Chat

obvithrowaway34434@alien.topB
link
fedilink
English
arrow-up
1·
3 years ago
Mistral has already shown that it’s mostly about the data rather than the model. So why waste loads of money and time on training something that no average consumer can run locally?