TY - GEN
T1 - Developing a Merge Lane Change Decision Policy for Autonomous Vehicles by Deep Reinforcement Learning
AU - Fan, Bingyi
AU - Zhou, Yuhan
AU - Mahmassani, Hani
N1 - Publisher Copyright:
© 2021 IEEE.
PY - 2021/9/19
Y1 - 2021/9/19
N2 - With autonomous vehicles (AVs) being actively developed, it becomes possible to optimize vehicle control policies and traffic management tools in a mixed vehicular environment. For individual AV control, acceleration and lane change are the two elementary driving behaviors that need to be coordinated to minimize disturbance to the entire traffic dynamics. In this paper, a joint decision policy of acceleration and lane change actions for AVs on a merging ramp is proposed and trained in a mixed autonomy traffic, using the technique of deep reinforcement learning. Our method is able to train policies that have limited impact on highway traffic while maintaining a relatively high merge throughput. We experimented with two reward functions, designed for the AV's selfish benefits and for the network traffic's social benefits. This paper then examines the emergent behaviors exhibited by the trained policies and their impacts on the main highway traffic at different density levels.
AB - With autonomous vehicles (AVs) being actively developed, it becomes possible to optimize vehicle control policies and traffic management tools in a mixed vehicular environment. For individual AV control, acceleration and lane change are the two elementary driving behaviors that need to be coordinated to minimize disturbance to the entire traffic dynamics. In this paper, a joint decision policy of acceleration and lane change actions for AVs on a merging ramp is proposed and trained in a mixed autonomy traffic, using the technique of deep reinforcement learning. Our method is able to train policies that have limited impact on highway traffic while maintaining a relatively high merge throughput. We experimented with two reward functions, designed for the AV's selfish benefits and for the network traffic's social benefits. This paper then examines the emergent behaviors exhibited by the trained policies and their impacts on the main highway traffic at different density levels.
UR - http://www.scopus.com/inward/record.url?scp=85118448336&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85118448336&partnerID=8YFLogxK
U2 - 10.1109/ITSC48978.2021.9564533
DO - 10.1109/ITSC48978.2021.9564533
M3 - Conference contribution
AN - SCOPUS:85118448336
T3 - IEEE Conference on Intelligent Transportation Systems, Proceedings, ITSC
SP - 963
EP - 968
BT - 2021 IEEE International Intelligent Transportation Systems Conference, ITSC 2021
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 2021 IEEE International Intelligent Transportation Systems Conference, ITSC 2021
Y2 - 19 September 2021 through 22 September 2021
ER -