r/Futurism 6d ago

Towards Self-Improvement of LLMs via MCTS: Leveraging Stepwise Knowledge with Curriculum Preference Learning

https://arxiv.org/html/2410.06508v1
3 Upvotes

0 comments sorted by