ThistleknotB to

LocalLLaMA@poweruser.forumEnglish · 3 years ago

The Q* hypothesis: Tree-of-thoughts reasoning, process reward models, and supercharging synthetic data

8

1

The Q* hypothesis: Tree-of-thoughts reasoning, process reward models, and supercharging synthetic data

ThistleknotB to

LocalLLaMA@poweruser.forumEnglish · 3 years ago

8

https://www.interconnects.ai/p/q-star

Chat