PookaMacPhellimen@alien.topB to LocalLLaMAEnglish · 1 year agoQwen-72B releasedhuggingface.coexternal-linkmessage-square39fedilinkarrow-up11arrow-down10
arrow-up11arrow-down1external-linkQwen-72B releasedhuggingface.coPookaMacPhellimen@alien.topB to LocalLLaMAEnglish · 1 year agomessage-square39fedilink
minus-squareambient_temp_xeno@alien.topBlinkfedilinkEnglisharrow-up1·1 year agoThe first thing I looked for was the number of training tokens. I think yi34 got a lot of benefit from 3 trillion, so this model having 3 trillion bodes well.
The first thing I looked for was the number of training tokens. I think yi34 got a lot of benefit from 3 trillion, so this model having 3 trillion bodes well.