World Transit Research

Deep reinforcement learning for transit signal priority in a connected environment

Meng Long
Xiexin Zou
Yue Zhou
Edward Chung

Document Type

Journal Article

Publication Date

2022

Subject Area

mode - bus, infrastructure - bus/tram priority, infrastructure - traffic signals, operations - performance, planning - methods

Keywords

Transit signal priority, Traffic signal control, Reinforcement learning, Invalid action masking, Connected environment

Abstract

Transit signal priority (TSP) is an effective measure to reduce traffic congestion and improve bus efficiency in metropolises. The connected vehicle technology and reinforcement learning (RL) algorithms can respectively provide more detailed and accurate information and more robust algorithms to traffic signal control systems, to develop smarter TSP strategies. This paper proposes an extended Dueling Double Deep Q-learning with invalid action masking (eD3QNI) algorithm for TSP strategy in a connected environment. The algorithm considers multiple conflicting bus priority requests and the constraints on the traffic light and phase skipping rule, aiming to improve the person delay of buses. Its performance is evaluated by simulation for a single intersection with two traffic demands and random arrivals, schedule deviations, occupancies of buses. Results demonstrate that eD3QNI produces lower average person delay and schedule delay than fixed-time signal, active TSP strategies, and other common RL methods. It also shows that the invalid action masking (IAM) method is superior to the usual variable decision points (VDP) method in terms of high convergence speed, effective performance improvement, and application of domain knowledge on the RL algorithm. The penetration rates of connected buses do not affect the converging speed of the proposed method, and an environment with a higher penetration rate will show better performance. Moreover, under the proposed method, different specific reward functions can be incorporated as desired to realize different operational goals for the TSP strategies.

Rights

Permission to publish the abstract has been given by Elsevier, copyright remains with them.

Comments

Transportation Research Part C Home Page:

http://www.sciencedirect.com/science/journal/0968090X

Recommended Citation

Long, M., Zou, X., Zhou, Y., & Chung, E. (2022). Deep reinforcement learning for transit signal priority in a connected environment. Transportation Research Part C: Emerging Technologies, Vol. 142, 103814.

Link to Full Text

COinS

World Transit Research

Deep reinforcement learning for transit signal priority in a connected environment

Document Type

Publication Date

Subject Area

Keywords

Abstract

Rights

Comments

Recommended Citation

Search

Links

Browse

Author Corner

World Transit Research

Deep reinforcement learning for transit signal priority in a connected environment

Authors

Document Type

Publication Date

Subject Area

Keywords

Abstract

Rights

Comments

Recommended Citation

Share

Search

Newsletter

Links

Browse

Author Corner