I tested this model a little bit. It sometimes writes nonsense, like bad words, but I guess that’s to be expected given that we have kind of copypasted and bunched couple of closely related models together. Not too surprising if it sometimes predicts badly.
However, I liked the writing quality a lot. Even after a simple trial test run, it is clearly much better than the base tollama2-70b, and by a lot. The base model is kind of tightlipped and extremely conservative/boring and it is hard to coax it to write creative output. However, it might actually be regression related to either of the base models it is based on.
I tested this model a little bit. It sometimes writes nonsense, like bad words, but I guess that’s to be expected given that we have kind of copypasted and bunched couple of closely related models together. Not too surprising if it sometimes predicts badly.
However, I liked the writing quality a lot. Even after a simple trial test run, it is clearly much better than the base tollama2-70b, and by a lot. The base model is kind of tightlipped and extremely conservative/boring and it is hard to coax it to write creative output. However, it might actually be regression related to either of the base models it is based on.