<script type="text/javascript">
<!--
document.write('<div id="oa_widget"></div>');
document.write('<script type="text/javascript" src="https://beta.openaire.eu/index.php?option=com_openaire&view=widget&format=raw&projectId=undefined&type=result"></script>');
-->
</script>

COPY SCRIPT

For further information contact us at helpdesk@openaire.eu

A Q-Learning-Based Approximate Solving Algorithm for Vehicular Route Game

descriptionPublicationkeyboard_double_arrow_right Article , Other literature type 23 Sep 2022Publisher:MDPI AGJournal:Sustainability, volume 14, page 12,033 (eissn: 2071-1050,

Authors: Le Zhang; Lijing Lyu; Shanshui Zheng; Li Ding; Lang Xu;

doi: 10.3390/su141912033

A Q-Learning-Based Approximate Solving Algorithm for Vehicular Route Game

- Summary
- Subjects
- Metrics

Abstract

Route game is recognized as an effective method to alleviate Braess’ paradox, which generates a new traffic congestion since numerous vehicles obey the same guidance from the selfish route guidance (such as Google Maps). The conventional route games have symmetry since vehicles’ payoffs depend only on the selected route distribution but not who chose, which leads to the precise Nash equilibrium being able to be solved by constructing a special potential function. However, with the arrival of smart cities, the real-time of route schemes is more of a concerned of engineers than the absolute optimality in real traffic. It is not an easy task to re-construct the new potential functions of the route games due to the dynamic traffic conditions. In this paper, compared with the hard-solvable potential function-based precise method, a matched Q-learning algorithm is designed to generate the approximate Nash equilibrium of the classic route game for real-time traffic. An experimental study shows that the Nash equilibrium coefficients generated by the Q-learning-based approximate solving algorithm all converge to 1.00, and still have the required convergence in the different traffic parameters.

Related Organizations

South China University of Technology
China (People's Republic of)
North China University of Technology
China (People's Republic of)
South China University of Technology
China (People's Republic of)
North China University of Technology
China (People's Republic of)
Shanghai Maritime University
China (People's Republic of)

View all View all

Keywords

Environmental effects of industries and plants, TJ807-830, Braess’ paradox, approximate Nash equilibrium, TD194-195, Renewable energy sources, traffic congestion; Braess’ paradox; route game; Q-learning; approximate Nash equilibrium, Environmental sciences, route game, Q-learning, traffic congestion, GE1-350

Impact byBIP!

	citations This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	2
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Top 10%
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Average
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Average