ninjasaid13@alien.topB to LocalLLaMAEnglish · 1 year agoRedPajama Data V2 has been released with 30 Trillion tokens of datahuggingface.coexternal-linkmessage-square1fedilinkarrow-up11arrow-down10
arrow-up11arrow-down1external-linkRedPajama Data V2 has been released with 30 Trillion tokens of datahuggingface.coninjasaid13@alien.topB to LocalLLaMAEnglish · 1 year agomessage-square1fedilink
minus-squareninjasaid13@alien.topOPBlinkfedilinkEnglisharrow-up1·1 year agohere’s the github page: https://github.com/togethercomputer/RedPajama-Data open-source with 30 Trillion tokens of Data.
here’s the github page: https://github.com/togethercomputer/RedPajama-Data open-source with 30 Trillion tokens of Data.