Just went through the https://arxiv.org/abs/2311.04254 paper after discovering it on Twitter. Looks promising but I am skeptical of generic usecases. Anyone tried in yet?
Uh, it’s ~great if you have a model of a problem domain that can solve that kind of problem, and you want an LLM to talk about the solution.
If you read the paper, you’ll see they cut the LLM calls way down by calling a domain specific model to do the actual problem solving. They have an ablation where they let the LLM do the very last step of a multistep problem and performance plummets.
I think the presentation is a little bit deceptive. The MCTS is not really helping the LLM work through the problem, the LLM is essentially just talking about the solution found by the other model.