Reinforcement Learning: Beyond Optimality
FWF project TAI 590-N (to start Januray 2022)
Project leader: Ronald Ortner
Department für Mathematik und Informationstechnologie
Lehrstuhl für Informationstechnologie
Montanuniversität Leoben
Franz-Josef-Straße 18
A-8700 Leoben
Tel.: +43 3842 402-1503
Fax: +43-3842-402-1502
E-mail: ronald.ortner(at)unileoben.ac.at
About the project
Reinforcement learning (RL) has been successful in applications, but theory has not been able to guarantee reliability and robustness of the used algorithms. One reason is that RL theory focuses on optimization, while practical RL problems are task-oriented so that optimality doesn't play any role. We aim at a restart of RL theory by replacing the optimality paradigm by a criterion based on satisficing, which will alleviate the development and analysis of algorithms. If successful our research project will finally develop RL theory that is useful and applicable to practical RL problems.