|
[1]
|
政府网站: 国务院. 上海: 道路交通实现智慧治理[J]. 2019-04-03. https://www.gov.cn/xinwen/2019-04/03/content_5379482.htm, 2024-02-15.
|
|
[2]
|
Webster, F.V. (1958) Traffic Signal Settings.
|
|
[3]
|
Vincent, R.A. and Peirce, J.R. (1988) “MOVA”: Traffic Responsive, Self-Optimising Signal Control for Isolated Intersections.
|
|
[4]
|
Sims, A.G. (1979) The Sydney Coordinated Adaptive Traffic System. Engineering Foundation Conference on Research Directions in Computer Control of Urban Traffic Systems, Pacific Grove, 11-16 February 1979, 12-27.
|
|
[5]
|
Kaelbling, L.P., Littman, M.L. and Moore, A.W. (1996) Reinforcement Learning: A Survey. Journal of Artificial Intelligence Research, 4, 237-285. [Google Scholar] [CrossRef]
|
|
[6]
|
Wei, H., Zheng, G., Gayah, V., et al. (2021) Recent Advances in Reinforcement Learning for Traffic Signal Control: A Survey of Models and Evaluation. ACM SIGKDD Explorations Newsletter, 22, 12-18. [Google Scholar] [CrossRef]
|
|
[7]
|
Li, L., Lv, Y. and Wang, F.Y. (2016) Traffic Signal Timing via Deep Reinforcement Learning. IEEE/CAA Journal of Automatica Sinica, 3, 247-254. [Google Scholar] [CrossRef]
|
|
[8]
|
Luo, J., Li, X. and Zheng, Y. (2020) Researches on Intelligent Traffic Signal Control Based on Deep Reinforcement Learning. 2020 IEEE 16th International Conference on Mobility, Sensing and Networking (MSN), Tokyo, 17-19 December 2020, 729-734. [Google Scholar] [CrossRef]
|
|
[9]
|
Wang, S., Xie, X., Huang, K., et al. (2019) Deep Reinforcement Learning-Based Traffic Signal Control Using High-Resolution Event-Based Data. Entropy, 21, Article No. 744. [Google Scholar] [CrossRef] [PubMed]
|
|
[10]
|
Buşoniu, L., Babuška, R. and De Schutter, B. (2010) Multi-Agent Reinforcement Learning: An Overview. In: Srinivasan, D. and Jain, L.C., Eds., Innovations in Multi-Agent Systems and Applications—1, Springer, Berlin, 183-221. [Google Scholar] [CrossRef]
|
|
[11]
|
Haddad, T.A., Hedjazi, D. and Aouag, S. (2022) A Deep Reinforcement Learning-Based Cooperative Approach for Multi-Intersection Traffic Signal Control. Engineering Applications of Artificial Intelligence, 114, Article ID: 105019. [Google Scholar] [CrossRef]
|
|
[12]
|
Chu, T., Wang, J., Codecà, L., et al. (2020) Multi-Agent Deep Reinforcement Learning for Large-Scale Traffic Signal Control. IEEE Transactions on Intelligent Transportation Systems, 21, 1086-1095. [Google Scholar] [CrossRef]
|
|
[13]
|
Wu, T., Zhou, P., Liu, K., et al. (2020) Multi-Agent Deep Reinforcement Learning for Urban Traffic Light Control in Vehicular Networks. IEEE Transactions on Vehicular Technology, 69, 8243-8256. [Google Scholar] [CrossRef]
|
|
[14]
|
Wang, X., Ke, L., Qiao, Z., et al. (2020) Large-Scale Traffic Signal Control Using a Novel Multiagent Reinforcement Learning. IEEE Transactions on Cybernetics, 51, 174-187. [Google Scholar] [CrossRef]
|
|
[15]
|
Garivier, A. and Moulines, E. (2011) On Upper-Confidence Bound Policies for Switching Bandit Problems. International Conference on Algorithmic Learning Theory, Espoo, 5-7 October 2011, 174-188. [Google Scholar] [CrossRef]
|
|
[16]
|
Yang, Y., Luo, R., Li, M., et al. (2018) Mean Field Multi-Agent Reinforcement Learning. International Conference on Machine Learning PMLR, Stockholm, 10-15 July 2018, 5571-5580.
|
|
[17]
|
Hu, T., Hu, Z., Lu, Z., et al. (2023) Dynamic Traffic Signal Control Using Mean Field Multi-Agent Reinforcement Learning in Large Scale Road-Networks. IET Intelligent Transport Systems, 17, 1715-1728. [Google Scholar] [CrossRef]
|
|
[18]
|
Vaswani, A., Shazeer, N., Parmar, N., et al. (2017) Attention Is All You Need. Advances in Neural Information Processing Systems 30 (NIPS 2017), Long Beach, 4-9 December 2017, 232-241.
|
|
[19]
|
Pérolat, J., Strub, F., Piot, B., et al. (2017) Learning Nash Equilibrium for General-Sum Markov Games from Batch Data. Artificial Intelligence and Statistics. PMLR, 2017, Fort Lauderdale, 20-22 April 2017, 232-241.
|
|
[20]
|
Lillicrap, T.P., Hunt, J.J., Pritzel, A., et al. (2015) Continuous Control with Deep Reinforcement Learning.
|
|
[21]
|
Schulman, J., Wolski, F., Dhariwal, P., et al. (2017) Proximal Policy Optimization Algorithms.
|
|
[22]
|
Prabuchandran, K.J., An, H.K. and Bhatnagar, S. (2014) Multi-Agent Reinforcement Learning for Traffic Signal Control. 17th International IEEE Conference on Intelligent Transportation Systems (ITSC), Qingdao, 8-11 October 2014, 2529-2534. [Google Scholar] [CrossRef]
|