PookaMacPhellimen@alien.topB to LocalLLaMAEnglish · 2 years agoQwen-72B releasedhuggingface.coexternal-linkmessage-square39linkfedilinkarrow-up11arrow-down10
arrow-up11arrow-down1external-linkQwen-72B releasedhuggingface.coPookaMacPhellimen@alien.topB to LocalLLaMAEnglish · 2 years agomessage-square39linkfedilink
minus-squareambient_temp_xeno@alien.topBlinkfedilinkEnglisharrow-up1·2 years agoThe first thing I looked for was the number of training tokens. I think yi34 got a lot of benefit from 3 trillion, so this model having 3 trillion bodes well.
The first thing I looked for was the number of training tokens. I think yi34 got a lot of benefit from 3 trillion, so this model having 3 trillion bodes well.