PookaMacPhellimenB to LocalLLaMA@poweruser.forumEnglish · 1 year agoQwen-72B releasedhuggingface.coexternal-linkmessage-square41fedilinkarrow-up11arrow-down10
arrow-up11arrow-down1external-linkQwen-72B releasedhuggingface.coPookaMacPhellimenB to LocalLLaMA@poweruser.forumEnglish · 1 year agomessage-square41fedilink
minus-squareambient_temp_xenoBlinkfedilinkEnglisharrow-up1·1 year agoThe first thing I looked for was the number of training tokens. I think yi34 got a lot of benefit from 3 trillion, so this model having 3 trillion bodes well.
The first thing I looked for was the number of training tokens. I think yi34 got a lot of benefit from 3 trillion, so this model having 3 trillion bodes well.