An actor-critic deep reinforcement learning approach for metro train scheduling with rolling stock circulation under stochastic demand
mode - rail, mode - subway/metro, place - europe, place - urban, ridership - demand, operations - scheduling, infrastructure - rolling stock
Metro train scheduling, Stochastic transit demand, Actor-critic architecture, Deep reinforcement learning, Multi-objective optimization
This paper presents a novel actor-critic deep reinforcement learning approach for metro train scheduling with circulation of limited rolling stock. The scheduling problem is modeled as a Markov decision process driven by stochastic passenger demand. As in most dynamic optimization problems, the complexity of the scheduling process grows exponentially with the amount of states, decisions, and uncertainties involved. This study aims to address this ‘curses of dimensionality’ issue by adopting an actor-critic deep reinforcement learning solution framework. The framework simplifies the evaluation and searching process for potential optimal solutions by parameterizing the original state and decision spaces with the use of artificial neural networks. A deep deterministic policy gradient algorithm is developed for training the artificial neural networks via simulated system transitions before the actor-critic agent can be applied for online schedule control. The proposed approach is tested with a real-world scenario configured with data collected from the Victoria Line of London Underground, UK. Experiment results illustrate the advantages of the proposed method over a range of established meta-heuristics in terms of computing time, system efficiency, and robustness under different stochastic environments. This study innovates urban transit operations with state-of-the-art computer science and dynamic optimization techniques.
Permission to publish the abstract has been given by Elsevier, copyright remains with them.
Ying, C.S., Chow, A.H.F., & Chin, K.S. (2020). An actor-critic deep reinforcement learning approach for metro train scheduling with rolling stock circulation under stochastic demand. Transportation Research Part B: Methodological, Vol. 140, pp. 210-235.