
You have already added 0 works in your ORCID record related to the merged Research product.
You have already added 0 works in your ORCID record related to the merged Research product.
<script type="text/javascript">
<!--
document.write('<div id="oa_widget"></div>');
document.write('<script type="text/javascript" src="https://beta.openaire.eu/index.php?option=com_openaire&view=widget&format=raw&projectId=undefined&type=result"></script>');
-->
</script>
Duck Curve Aware Dynamic Pricing and Battery Scheduling Strategy Using Reinforcement Learning

Watari D., Taniguchi I., Onoye T. Duck Curve Aware Dynamic Pricing and Battery Scheduling Strategy Using Reinforcement Learning. IEEE Transactions on Smart Grid , (2023); https://doi.org/10.1109/TSG.2023.3288355. ; The duck curve is becoming a global problem in energy technology due to the rapid increase in solar power adoption and the rise of prosumers. To address this issue, a resource aggregator (RA) has emerged to provide flexible solutions through aggregating the prosumers and demand response such as dynamic pricing. This paper proposes an optimal strategy for the RA that dispatches dynamic pricing to the prosumers and leverages the battery system at both RA and prosumer levels. The proposed method is based on a model-free deep reinforcement learning (DRL) algorithm to optimize each prosumer’s retail prices and schedule usage of the RA’s battery power station. An objective reward function is used to maximize the RA’s profit, minimize the prosumer’s cost, and maximize the improvement of the duck curve. The performance of the proposed DRL-based strategy was demonstrated by simulation experiments using actual wholesale price, demand, and PV generation data. The results show that the proposed strategy can improve the standard deviation and peak-to-average ratio of net load by up to 57.1% and 23%, respectively.
- Osaka University Japan
- Osaka University Japan
Deep reinforcement learning, Demand response, Dynamic pricing, 330, State of charge, Duck curve, Vehicle dynamics, Costs, Batteries, Heuristic algorithms, Battery scheduling, Pricing, Prosumer, Power generation
Deep reinforcement learning, Demand response, Dynamic pricing, 330, State of charge, Duck curve, Vehicle dynamics, Costs, Batteries, Heuristic algorithms, Battery scheduling, Pricing, Prosumer, Power generation
citations This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).8 popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.Average influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).Average impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.Top 10%
