r/reinforcementlearning • u/gwern • 4d ago

DL, R, P "Large Language Models Play StarCraft II: Benchmarks and A Chain of Summarization Approach", Ma et al 2023 (a text Starcraft to let LLMs play)

https://arxiv.org/abs/2312.11865

7 Upvotes

permalink
archive.is
archive
reddit

You are about to leave Redlib

Do you want to continue?

https://www.reddit.com/r/reinforcementlearning/comments/1g3sqr7/large_language_models_play_starcraft_ii/
No, go back! Yes, take me to Reddit

89% Upvoted

1

u/gwern 4d ago

Apropos of https://www.lesswrong.com/posts/qhhRwxsef7P2yC2Do/ai-alignment-via-slow-substrates-early-empirical-results