ThistleknotB to

LocalLLaMA@poweruser.forumEnglish · 2 years ago

The Q* hypothesis: Tree-of-thoughts reasoning, process reward models, and supercharging synthetic data

8

1

The Q* hypothesis: Tree-of-thoughts reasoning, process reward models, and supercharging synthetic data

ThistleknotB to

LocalLLaMA@poweruser.forumEnglish · 2 years ago

8

https://www.interconnects.ai/p/q-star

Chat

ThistleknotOPB
link
fedilink
arrow-up
1·
2 years ago
https://twitter.com/ylecun/status/1728126868342145481?ref_src=twsrc%5Egoogle%7Ctwcamp%5Eserp%7Ctwgr%5Etweet