Tentative Programme


09:00-09:15    Welcome
09:15-10:00    "End-to-end Hierarchical Reinforcement Learning" by Herke van Hoof
10:00-10:15     "A fast hybrid reinforcement learning framework with human corrective feedback" by Carlos Celemin
10:15-10:30   Flash talks (Posters 1-7)
10:30-11:00    Coffee break (and posters)

11:00-11:45   Wouter Koolen
11:45-12:00    "Bayesian Best-Arm Identification for Selecting Influenza Mitigation Strategies" by Pieter Libin
12:00-12:15   Flash talks (Posters 8-14)
12:15-13:30   Lunch break (and posters)

13:30-14:15  "Learning from demonstrations" by Tim Salimans
14:15-14:30     "Attention Solves Your TSP, Approximately" by Wouter Kool
14:30-14:45     "Online influence maximization with local observations" by Julia Olkhovskaya
14:45-15:30    Coffee break (and posters)

15:30-15:45    "TDRL Emotions" by Joost Broekens
15:45-16:00    "Stochastic Activation Actor-Critic Methods" by Wendy Shang
16:00-16:15    "Reinforcement Learning in Spiking Neural Networks" by Sander Bohte
16:15-16:30    "Learning to Coordinate with Coordination Graphs in Repeated Single-Stage Multi-Agent Decision Problems" by Eugenio Bargiacchi
16:30-17:30    Discussion groups

17:30-18:30    Drinks

19:00-             Dinner in town (optional, at own expense)




1. "Simultaneous Action Learning and Grounding through Reinforcement and Cross-Situational Learning" by Oliver Roesler
2. "Learning System-Efficient Equilibria in Route Choice Using Tolls" by Gabriel de Oliveira Ramos
3. "Learning controllers for drones and mobile robots" by Javier Alonso Mora
4. "Stable, Practical and On-line Bootstrapped Conservative Policy Iteration (accepted at EWRL)" by Denis Steckelmacher
5. "Can we use brain-based feedback to identify the semantic concept on your mind using RL?" by Karen Dijkstra
6. "Intra-day Bidding Strategies for Storage Devices Using Deep Reinforcement Learning" by Ioannis Boukas
7. "Coordinating Human and Agent Behavior in Collective-Risk Scenarios" by Elias Fernández Domingos
8. "From Algorithmic Black Boxes to Adaptive White Boxes: Declarative Decision-Theoretic Ethical Programs as Codes of Ethics" by  Martijn van Otterlo
9. "Interactive Reinforcement learning to reduce the total solution space of an assembly task" by Joris De Winter
10. "Achieving scalable model-free demand response in charging an electric vehicle fleet with reinforcement learning" by Chris Develder
11. "Large-scale vehicle routing (uses no RL)" by Michal Cap
12. "Solution horizons in non-stationary MDPs" by Grigory Neustroev 
13. "Safe Reinforcement Learning in Factored MDPs" by Thiago Dias Simao  
14. "Temporal Representation Learning" by Thomas Moerland