Starling-RM-7B-alpha: New RLAIF Finetuned 7b Model beats Openchat 3.5 and comes close to GPT-4

Legcor@alien.top · 3 years ago

Starling-RM-7B-alpha: New RLAIF Finetuned 7b Model beats Openchat 3.5 and comes close to GPT-4

thereisonlythedance@alien.top · 3 years ago

I was sceptical, but darn it’s good. Mistral is a fantastic base and with this technique these guys have pushed it another step closer. A lot of the answers I’m getting are on on par with old GPT-4 (pre-turbo, turbo in the API is a step up on old GPT-4 IMO).