• 0 Posts
  • 1 Comment
Joined 1 year ago
cake
Cake day: October 23rd, 2023

help-circle
  • I tested this model a little bit. It sometimes writes nonsense, like bad words, but I guess that’s to be expected given that we have kind of copypasted and bunched couple of closely related models together. Not too surprising if it sometimes predicts badly.

    However, I liked the writing quality a lot. Even after a simple trial test run, it is clearly much better than the base tollama2-70b, and by a lot. The base model is kind of tightlipped and extremely conservative/boring and it is hard to coax it to write creative output. However, it might actually be regression related to either of the base models it is based on.