
You have already added 0 works in your ORCID record related to the merged Research product.
You have already added 0 works in your ORCID record related to the merged Research product.
<script type="text/javascript">
<!--
document.write('<div id="oa_widget"></div>');
document.write('<script type="text/javascript" src="https://beta.openaire.eu/index.php?option=com_openaire&view=widget&format=raw&projectId=undefined&type=result"></script>');
-->
</script>
A Q-Learning-Based Approximate Solving Algorithm for Vehicular Route Game

doi: 10.3390/su141912033
Route game is recognized as an effective method to alleviate Braess’ paradox, which generates a new traffic congestion since numerous vehicles obey the same guidance from the selfish route guidance (such as Google Maps). The conventional route games have symmetry since vehicles’ payoffs depend only on the selected route distribution but not who chose, which leads to the precise Nash equilibrium being able to be solved by constructing a special potential function. However, with the arrival of smart cities, the real-time of route schemes is more of a concerned of engineers than the absolute optimality in real traffic. It is not an easy task to re-construct the new potential functions of the route games due to the dynamic traffic conditions. In this paper, compared with the hard-solvable potential function-based precise method, a matched Q-learning algorithm is designed to generate the approximate Nash equilibrium of the classic route game for real-time traffic. An experimental study shows that the Nash equilibrium coefficients generated by the Q-learning-based approximate solving algorithm all converge to 1.00, and still have the required convergence in the different traffic parameters.
- South China University of Technology China (People's Republic of)
- North China University of Technology China (People's Republic of)
- South China University of Technology China (People's Republic of)
- North China University of Technology China (People's Republic of)
- Shanghai Maritime University China (People's Republic of)
Environmental effects of industries and plants, TJ807-830, Braess’ paradox, approximate Nash equilibrium, TD194-195, Renewable energy sources, traffic congestion; Braess’ paradox; route game; Q-learning; approximate Nash equilibrium, Environmental sciences, route game, Q-learning, traffic congestion, GE1-350
Environmental effects of industries and plants, TJ807-830, Braess’ paradox, approximate Nash equilibrium, TD194-195, Renewable energy sources, traffic congestion; Braess’ paradox; route game; Q-learning; approximate Nash equilibrium, Environmental sciences, route game, Q-learning, traffic congestion, GE1-350
citations This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).2 popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.Top 10% influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).Average impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.Average
