Suche

EXPLOITING SIMILARITY INFORMATION IN REINFORCEMENT LEARNING Similarity Models for Multi-Armed Bandits and MDPs Ronald Ortner Lehrstuhl für Informationstechnologie, Montanuniversität Leoben…

42. UCBRev.pdf

UCB REVISITED: IMPROVED REGRET BOUNDS FOR THE STOCHASTIC MULTI-ARMED BANDIT PROBLEM PETER AUER AND RONALD ORTNER A BSTRACT. In the stochastic multi-armed bandit problem we consider a modification of…

43. ContRL.pdf

Online Regret Bounds for Undiscounted Continuous Reinforcement Learning Ronald Ortner∗† Montanuniversitaet Leoben 8700 Leoben, Austria rortner@unileoben.ac.at ∗ † Daniil Ryabko† INRIA Lille-Nord…

44. AdAgg.pdf

Noname manuscript No. (will be inserted by the editor) Adaptive Aggregation for Reinforcement Learning in Average Reward Markov Decision Processes Ronald Ortner the date of receipt and acceptance…

45. SelStateRep.pdf

Optimal Regret Bounds for Selecting the State Representation in Reinforcement Learning Odalric-Ambrym Maillard odalricambrym.maillard@gmail.com Montanuniversität Leoben, Franz-Josef-Strasse 18,…

46. CompInf.pdf

Competing with an Infinite Set of Models in Reinforcement Learning Phuong Nguyen Australian National University and NICTA Canberra ACT 0200, AUSTRALIA Odalric-Ambrym Maillard1 Technion, Faculty of…

47. MarkovBandits.pdf

Regret Bounds for Restless Markov Bandits Ronald Ortner∗, Daniil Ryabko∗∗, Peter Auer∗, Rémi Munos∗∗ Abstract We consider the restless Markov bandit problem, in which the state of each arm evolves…

48. ApproxStateRep.pdf

Selecting Near-Optimal Approximate State Representations in Reinforcement Learning Ronald Ortner1 , Odalric-Ambrym Maillard2 , and Daniil Ryabko3 1 Montanuniversitaet Leoben, Austria 2 The Technion,…

49. ContRL-KD.pdf

Improved Regret Bounds for Undiscounted Continuous Reinforcement Learning K.Lakshmanan Montanuniversität Leoben, Franz-Josef-Strasse 18, 8700 Leoben, AUSTRIA LKSHMNAN . K @ GMAIL . COM Ronald…

50. VarRL.pdf

Variational Regret Bounds for Reinforcement Learning Ronald Ortner Pratik Gajane Peter Auer rortner@unileoben.ac.at pratik.gajane@unileoben.ac.at auer@unileoben.ac.at Lehrstuhl für…

Suche

© Montanuniversität Leoben

QUICKLINKS

SERVICES FÜR

TOOLS

© Montanuniversität Leoben

Wir verwenden Cookies

Suche

QUICKLINKS

SERVICES FÜR

TOOLS

© Montanuniversität Leoben