Model: https://huggingface.co/Intel/neural-chat-7b-v3-1
It’s based on Mistral 7b, fine tuned on SlimOrca. Also trained on a rather unique accelerator called Habana 8x Gaudi2. Numbers do look pretty interesting.
You must log in or register to comment.
Wish they’d use chatml…
Yeah the responses are quite bad. I had high expectations after seeing the benchmarks.
Are you using the correct prompt template?
What’s the correct prompt template for this specific model?
also they used DPO