Wondering what everyone thinks in case this is true. It seems they’re already beating all open source models including Llama-2 70B. Is this all due to data quality? Will Mistral be able to beat it next year?
Edit: Link to the paper -> https://arxiv.org/abs/2310.17680
GPT 3.5 turbo was released on March 1 2023, for what it’s worth. Which makes it not a very old model.
Only if you assume that 3.5 TURBO is not a TURBO version of GPT 3.5 THAT would make the RELEASE in March 2022, likely with 6 months or more of training and tuning. So, you say that when they did the turbo version, they started fresh, with new training data and an approach based on the MS ORCA papers which were released in June, and still did not change the version number?
Let me say your assumption bare a thread of logic.
Oh it’s a TURBO version you say? Is that a technical term? I never said whatever you seem to think I said.