Comment by westurner
14 days ago
Task: play tetris
Task: write and optimize a tetris bot
Task: write and safely online optimize a tetris bot with consideration for cost to converge
openai/baselines (7 years ago) was leading on RL and then AlphaZero and Self-Attention Transformer networks.
LLMs are trained with RL, but aren't general purpose game theoretic RL agents?
"Optimizing Tetris Gameplay Using Reinforcement Learning Framework with Adaptive Genetic Algorithms" (2025) https://papers.ssrn.com/sol3/papers.cfm?abstract_id=5906702 .. https://scholar.google.com/scholar?cites=1615762352187216859...
"Outsmarting algorithms: A comparative battle between Reinforcement Learning and heuristics in Atari Tetris" (2025) https://dl.acm.org/doi/10.1016/j.eswa.2025.127251