<script type="text/javascript">
<!--
document.write('<div id="oa_widget"></div>');
document.write('<script type="text/javascript" src="https://beta.openaire.eu/index.php?option=com_openaire&view=widget&format=raw&projectId=undefined&type=result"></script>');
-->
</script>

COPY SCRIPT

For further information contact us at helpdesk@openaire.eu

Reinforcement Learning Applied to an Electric Water Heater: From Theory to Practice

descriptionPublicationkeyboard_double_arrow_right Article , Preprint , Journal 01 Jul 2018Embargo end date: 01 Jan 2015Publisher:Institute of Electrical and Electronics Engineers (IEEE)Journal:IEEE Transactions on Smart Grid, volume 9, pages 3,792-3,800 (issn: 1949-3053, eissn: 1949-3061,

Authors: F. Ruelens; B. J. Claessens; S. Quaiyum; B. De Schutter; R. Babuska; R. Belmans;

doi: 10.1109/tsg.2016.2640184 , 10.48550/arxiv.1512.00408

arXiv: http://arxiv.org/abs/1512.00408 , 1512.00408

Reinforcement Learning Applied to an Electric Water Heater: From Theory to Practice

- Summary
- Subjects
- Metrics

Abstract

Electric water heaters have the ability to store energy in their water buffer without impacting the comfort of the end user. This feature makes them a prime candidate for residential demand response. However, the stochastic and nonlinear dynamics of electric water heaters, makes it challenging to harness their flexibility. Driven by this challenge, this paper formulates the underlying sequential decision-making problem as a Markov decision process and uses techniques from reinforcement learning. Specifically, we apply an auto-encoder network to find a compact feature representation of the sensor measurements, which helps to mitigate the curse of dimensionality. A wellknown batch reinforcement learning technique, fitted Q-iteration, is used to find a control policy, given this feature representation. In a simulation-based experiment using an electric water heater with 50 temperature sensors, the proposed method was able to achieve good policies much faster than when using the full state information. In a lab experiment, we apply fitted Q-iteration to an electric water heater with eight temperature sensors. Further reducing the state vector did not improve the results of fitted Q-iteration. The results of the lab experiment, spanning 40 days, indicate that compared to a thermostat controller, the presented approach was able to reduce the total cost of energy consumption of the electric water heater by 15%.

Submitted to IEEE transaction on smart grid

Related Organizations

KU Leuven
Belgium
Uppsala University
Sweden
Delft University of Technology
Netherlands
United States Department of Energy
United States
United States Department of Energy
United States

View all View all

Keywords

FOS: Computer and information sciences, Computer Science - Machine Learning, Machine Learning (cs.LG)

Impact byBIP!

	citations This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	124
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Top 1%
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Top 1%
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Top 1%