189 Treffer:
41. UCBRev.pdf  
UCB REVISITED: IMPROVED REGRET BOUNDS FOR THE STOCHASTIC MULTI-ARMED BANDIT PROBLEM PETER AUER AND RONALD ORTNER A BSTRACT. In the stochastic multi-armed bandit problem we consider a modification of…  
42. ContRL.pdf  
Online Regret Bounds for Undiscounted Continuous Reinforcement Learning Ronald Ortner∗† Montanuniversitaet Leoben 8700 Leoben, Austria rortner@unileoben.ac.at ∗ † Daniil Ryabko† INRIA Lille-Nord…  
43. AdAgg.pdf  
Noname manuscript No. (will be inserted by the editor) Adaptive Aggregation for Reinforcement Learning in Average Reward Markov Decision Processes Ronald Ortner the date of receipt and acceptance…  
44. SelStateRep.pdf  
Optimal Regret Bounds for Selecting the State Representation in Reinforcement Learning Odalric-Ambrym Maillard odalricambrym.maillard@gmail.com Montanuniversität Leoben, Franz-Josef-Strasse 18,…  
45. CompInf.pdf  
Competing with an Infinite Set of Models in Reinforcement Learning Phuong Nguyen Australian National University and NICTA Canberra ACT 0200, AUSTRALIA Odalric-Ambrym Maillard1 Technion, Faculty of…  
46. MarkovBandits.pdf  
Regret Bounds for Restless Markov Bandits Ronald Ortner∗, Daniil Ryabko∗∗, Peter Auer∗, Rémi Munos∗∗ Abstract We consider the restless Markov bandit problem, in which the state of each arm evolves…  
47. ApproxStateRep.pdf  
Selecting Near-Optimal Approximate State Representations in Reinforcement Learning Ronald Ortner1 , Odalric-Ambrym Maillard2 , and Daniil Ryabko3 1 Montanuniversitaet Leoben, Austria 2 The Technion,…  
48. ContRL-KD.pdf  
Improved Regret Bounds for Undiscounted Continuous Reinforcement Learning K.Lakshmanan Montanuniversität Leoben, Franz-Josef-Strasse 18, 8700 Leoben, AUSTRIA LKSHMNAN . K @ GMAIL . COM Ronald…  
49. VarRL.pdf  
Variational Regret Bounds for Reinforcement Learning Ronald Ortner Pratik Gajane Peter Auer rortner@unileoben.ac.at pratik.gajane@unileoben.ac.at auer@unileoben.ac.at Lehrstuhl für…  
50. it1-vo-13.pdf  
Heute • Organisatorisches: Umstellung Studienplan • Evaluierung • Nachbesprechung Abschlusstest • Einführung Rekursion 30.01.2020 IT I - VO 13 1 Organisatorisches • Neuer Studienplan ab WS…  
Suchergebnisse 41 bis 50 von 189