Turn-Based Offline Reinforcement Learning

This blogpost is the result of a research collaboration between the Allegro Machine Learning Research team and the Institute of Mathematics of the Polish Academy of Sciences (IMPAN), Warsaw.

Piotr Miłoś

Piotr is a professor at the Polish Academy of Sciences and a visiting professor at the University of Oxford. He specializes in machine learning and co-leads a research group focusing on reinforcement learning. He actively works towards developing machine learning research in Poland, including hosting a reinforcement learning seminar, co-organizing a deep reinforcement learning course, and being a scientific advisor of the ML in PL association.