
You have already added 0 works in your ORCID record related to the merged Research product.
You have already added 0 works in your ORCID record related to the merged Research product.
<script type="text/javascript">
<!--
document.write('<div id="oa_widget"></div>');
document.write('<script type="text/javascript" src="https://beta.openaire.eu/index.php?option=com_openaire&view=widget&format=raw&projectId=undefined&type=result"></script>');
-->
</script>
On-Line Building Energy Optimization Using Deep Reinforcement Learning

arXiv: 1707.05878
Unprecedented high volumes of data are becoming available with the growth of the advanced metering infrastructure. These are expected to benefit planning and operation of the future power system, and to help the customers transition from a passive to an active role. In this paper, we explore for the first time in the smart grid context the benefits of using Deep Reinforcement Learning, a hybrid type of methods that combines Reinforcement Learning with Deep Learning, to perform on-line optimization of schedules for building energy management systems. The learning procedure was explored using two methods, Deep Q-learning and Deep Policy Gradient, both of them being extended to perform multiple actions simultaneously. The proposed approach was validated on the large-scale Pecan Street Inc. database. This highly-dimensional database includes information about photovoltaic power generation, electric vehicles as well as buildings appliances. Moreover, these on-line energy scheduling strategies could be used to provide real-time feedback to consumers to encourage more efficient use of electricity.
- Eindhoven University of Technology Netherlands
- University of Derby United Kingdom
- University of Derby United Kingdom
- The University of Texas at Austin United States
FOS: Computer and information sciences, Learning (artificial intelligence), Computer Science - Machine Learning, 004 Data processing & computer science, Strategic Optimization., Internet of Things, Information visualisation, Machine Learning (cs.LG), Automation, 000472577500018, Centre for Distributed Computing, Networking and Security, Buildings, Smart Grid, smart grid, Mathematics - Optimization and Control, Deep reinforcement learning, User experience, Demand Response, deep neural networks, Health, demand response, Computer Science(all), Optimization, General Computer Science, Deep Reinforcement Learning, Computer Science - Artificial Intelligence, QA75 Electronic computers. Computer science, Information science, Machine learning, FOS: Mathematics, Software systems, Deep Neural Networks, Smart mobility, Sensors, deep neural network, Smart grids, Centre for Algorithms, Visualisation and Evolving Systems, Minimization, AI and Technologies, Energy consumption, strategic optimization, Artificial Intelligence (cs.AI), Optimization and Control (math.OC), eHealth, Networks, Smart cities
FOS: Computer and information sciences, Learning (artificial intelligence), Computer Science - Machine Learning, 004 Data processing & computer science, Strategic Optimization., Internet of Things, Information visualisation, Machine Learning (cs.LG), Automation, 000472577500018, Centre for Distributed Computing, Networking and Security, Buildings, Smart Grid, smart grid, Mathematics - Optimization and Control, Deep reinforcement learning, User experience, Demand Response, deep neural networks, Health, demand response, Computer Science(all), Optimization, General Computer Science, Deep Reinforcement Learning, Computer Science - Artificial Intelligence, QA75 Electronic computers. Computer science, Information science, Machine learning, FOS: Mathematics, Software systems, Deep Neural Networks, Smart mobility, Sensors, deep neural network, Smart grids, Centre for Algorithms, Visualisation and Evolving Systems, Minimization, AI and Technologies, Energy consumption, strategic optimization, Artificial Intelligence (cs.AI), Optimization and Control (math.OC), eHealth, Networks, Smart cities
citations This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).347 popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.Top 0.1% influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).Top 1% impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.Top 0.1%
