"Base" models were actually trained with some GPT instruct datasets

Wonderful_Ad_5134@alien.top · 2 years ago

"Base" models were actually trained with some GPT instruct datasets

FPham@alien.top · 2 years ago

Shouldn’t be the proof in the pudding?

If Mistral 7B is better than most other 7b models, then they did something right, no?

I understand that the base model then can inherit some biases - but it’s onto them that they didn’t cleaned those “As and AI…” answers strings from their dataset. So despite this, it performs better.