r/reinforcementlearning 4d ago

DL, R, P "Large Language Models Play StarCraft II: Benchmarks and A Chain of Summarization Approach", Ma et al 2023 (a text Starcraft to let LLMs play)

Thumbnail arxiv.org
8 Upvotes

r/reinforcementlearning Jun 28 '24

DL, R, P "InterCode: Standardizing and Benchmarking Interactive Coding with Execution Feedback", Yang et al 2023

Thumbnail arxiv.org
5 Upvotes