Thistleknot@alien.topB to

LocalLLaMAEnglish · 2 years ago

The Q* hypothesis: Tree-of-thoughts reasoning, process reward models, and supercharging synthetic data

8

1

The Q* hypothesis: Tree-of-thoughts reasoning, process reward models, and supercharging synthetic data

Thistleknot@alien.topB to

LocalLLaMAEnglish · 2 years ago

8

https://www.interconnects.ai/p/q-star

Chat

Feztopia@alien.topB
link
fedilink
English
arrow-up
1·
2 years ago
I just can’t wait until one of the wrong Q* hypotheses turn out to be even better than Q*