At Microsoft, we’re expanding AI capabilities by training small language models to achieve the kind of enhanced reasoning and comprehension typically found only in much larger models.
Tried the models, the 13B is very slow, the 7B is speedy but a little quirky. It made the plan how to solve the task but didn’t actually proceed in solving the task. It doesn’t have good conversational flair.
Tried the models, the 13B is very slow, the 7B is speedy but a little quirky. It made the plan how to solve the task but didn’t actually proceed in solving the task. It doesn’t have good conversational flair.