<script type="text/javascript">
<!--
document.write('<div id="oa_widget"></div>');
document.write('<script type="text/javascript" src="https://beta.openaire.eu/index.php?option=com_openaire&view=widget&format=raw&projectId=undefined&type=result"></script>');
-->
</script>

COPY SCRIPT

For further information contact us at helpdesk@openaire.eu

Deep Deterministic Policy Gradient for High-Speed Train Trajectory Optimization

descriptionPublicationkeyboard_double_arrow_right Article , Journal 01 Aug 2022 Netherlands Publisher:Institute of Electrical and Electronics Engineers (IEEE)Journal:IEEE Transactions on Intelligent Transportation Systems, volume 23, pages 11,562-11,574 (issn: 1524-9050, eissn: 1558-0016,

Authors: Lingbin Ning; Min Zhou; Zhuopu Hou; Rob M.P. Goverde; Fei-Yue Wang; Hairong Dong;

doi: 10.1109/tits.2021.3105380

Deep Deterministic Policy Gradient for High-Speed Train Trajectory Optimization

- Summary
- Subjects
- Metrics

Abstract

This paper proposes a novel train trajectory optimization approach for high-speed railways. We restrict our attention to single train operation scenarios with different scheduled/rescheduled running times aiming at generating optimal train recommended trajectories in real time, which can ensure punctuality and energy efficiency of train operation. A learning-based approach deep deterministic policy gradient (DDPG) is designed to generate optimal train trajectories based on the offline training from the interaction between the agent and the trajectory simulation environment. An allocating running time and selecting operation modes (ARTSOM) algorithm is proposed to improve train punctuality and give a series of discrete operation modes (full traction, cruising, coasting, full braking), and thus to produce a feasible training set for DDPG, which can speed up the training process. Numerical experiments show that an optimized speed profile can be generated by DDPG within seconds on a realistic railway line. In addition, the results demonstrate the generalization ability of trained DDPG in solving TTO problems with different running times and line conditions.

Country

Netherlands

Related Organizations

Delft University of Technology
Netherlands
Beijing Jiaotong University
China (People's Republic of)
Chinese Academy of Sciences
China (People's Republic of)
Chinese Academy of Sciences
China (People's Republic of)
Beijing Jiaotong University
China (People's Republic of)

Keywords

train trajectory optimization, deep deterministic policy gradient, 380, High-speed railway, energy efficiency

Impact byBIP!

	citations This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	7
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Top 10%
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Average
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Top 10%

Found an issue? Give us feedback

Top 10%

Average

Top 10%

Green

bronze

Fields of Science

engineering and technology

electrical engineering, electronic engineering, information engineering

Fields of Science

engineering and technology

electrical engineering, electronic engineering, information engineering