minus-squareHeinrichTheWolf_17@alien.topBtoLocalLLaMA•What is Q* and how do we use it?linkfedilinkEnglisharrow-up1·1 year agoI’m wondering if Q-Star is a recursive self improvement mechanism? Perhaps the in house model they have can innovate and consistently learn on top of what it’s been trained on? linkfedilink
I’m wondering if Q-Star is a recursive self improvement mechanism? Perhaps the in house model they have can innovate and consistently learn on top of what it’s been trained on?