<script type="text/javascript">
<!--
document.write('<div id="oa_widget"></div>');
document.write('<script type="text/javascript" src="https://beta.openaire.eu/index.php?option=com_openaire&view=widget&format=raw&projectId=undefined&type=result"></script>');
-->
</script>

COPY SCRIPT

For further information contact us at helpdesk@openaire.eu

Design and Improvement of SD3-Based Energy Management Strategy for a Hybrid Electric Urban Bus

descriptionPublicationkeyboard_double_arrow_right Article , Other literature type 13 Aug 2022Publisher:MDPI AGJournal:Energies, volume 15, page 5,878 (eissn: 1996-1073,

Authors: Kunyu Wang; Rong Yang; Yongjian Zhou; Wei Huang; Song Zhang;

doi: 10.3390/en15165878

Design and Improvement of SD3-Based Energy Management Strategy for a Hybrid Electric Urban Bus

- Summary
- Subjects
- Metrics

Abstract

With the rapid development of machine learning, deep reinforcement learning (DRL) algorithms have recently been widely used for energy management in hybrid electric urban buses (HEUBs). However, the current DRL-based strategies suffer from insufficient constraint capability, slow learning speed, and unstable convergence. In this study, a state-of-the-art continuous control DRL algorithm, softmax deep double deterministic policy gradients (SD3), is used to develop the energy management system of a power-split HEUB. In particular, an action masking (AM) technique that does not alter the SD3′s underlying principles is proposed to prevent the SD3-based strategy from outputting invalid actions that violate the system’s physical constraints. Additionally, the transfer learning (TL) method of the SD3-based strategy is explored to avoid repetitive training of neural networks in different driving cycles. The results demonstrate that the learning performance and learning stability of SD3 are unaffected by AM and that SD3 with AM achieves control performance that is highly comparable to dynamic planning for both the CHTC-B and WVUCITY driving cycles. Aside from that, TL contributes to the rapid development of SD3. TL can speed up SD3’s convergence by at least 67.61% without significantly affecting fuel economy.

Related Organizations

Guangxi University
China (People's Republic of)
Guangxi University
China (People's Republic of)

Keywords

Technology, hybrid electric urban bus; energy management strategy; deep reinforcement learning; action masking, deep reinforcement learning, hybrid electric urban bus, T, energy management strategy, action masking

Impact byBIP!

	citations This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	3
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Average
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Average
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Average