Thistleknot@alien.topB to LocalLLaMAEnglish · 2 years agoThe Q* hypothesis: Tree-of-thoughts reasoning, process reward models, and supercharging synthetic datamessage-squaremessage-square8linkfedilinkarrow-up11arrow-down10file-text
arrow-up11arrow-down1message-squareThe Q* hypothesis: Tree-of-thoughts reasoning, process reward models, and supercharging synthetic dataThistleknot@alien.topB to LocalLLaMAEnglish · 2 years agomessage-square8linkfedilinkfile-text
minus-squareperlthoughts@alien.topBlinkfedilinkEnglisharrow-up1·2 years agoIt sounds like open source synthia, openchat, and zephyr lol. The whitepapers. lolol.
It sounds like open source synthia, openchat, and zephyr lol. The whitepapers. lolol.