TransformerLight: A Novel Sequence Modeling Based Traffic Signaling Mechanism via Gated Transformer
File version
Version of Record (VoR)
Author(s)
Li, M
Shen, J
Lü, L
Du, B
Zhang, K
Griffith University Author(s)
Primary Supervisor
Other Supervisors
Editor(s)
Date
Size
File type(s)
Location
Long Beach, United States
Abstract
Traffic signal control (TSC) is still one of the most significant and challenging research problems in the transportation field. Reinforcement learning (RL) has achieved great success in TSC but suffers from critically high learning costs in practical applications due to the excessive trial-and-error learning process. Offline RL is a promising method to reduce learning costs whereas the data distribution shift issue is still up in the air. To this end, in this paper, we formulate TSC as a sequence modeling problem with a sequence of Markov decision process described by states, actions, and rewards from the traffic environment. A novel framework, namely TransformerLight, is introduced, which does not aim to fit into value functions by averaging all possible returns, but produces the best possible actions using a gated Transformer. Additionally, the learning process of TransformerLight is much more stable by replacing the residual connections with gated transformer blocks due to a dynamic system perspective. Through numerical experiments on offline datasets, we demonstrate that the TransformerLight model: (1) can build a high-performance adaptive TSC model without dynamic programming; (2) achieves a new state-of-the-art compared to most published offline RL methods so far; and (3) shows a more stable learning process than offline RL and recent Transformer-based methods. The relevant dataset and code are available at Github.
Journal Title
Conference Title
KDD '23: Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining
Book Title
Edition
Volume
Issue
Thesis Type
Degree Program
School
Publisher link
Patent number
Funder(s)
Grant identifier(s)
Rights Statement
Rights Statement
© 2023 Owner/Author. This work is licensed under a Creative Commons Attribution International 4.0 License.
Item Access Status
Note
Access the data
Related item(s)
Subject
Innovation management
Infrastructure engineering and asset management
Persistent link to this record
Citation
Wu, Q; Li, M; Shen, J; Lü, L; Du, B; Zhang, K, TransformerLight: A Novel Sequence Modeling Based Traffic Signaling Mechanism via Gated Transformer, KDD '23: Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2023, pp. 2639-2647