
You have already added 0 works in your ORCID record related to the merged Research product.
You have already added 0 works in your ORCID record related to the merged Research product.
<script type="text/javascript">
<!--
document.write('<div id="oa_widget"></div>');
document.write('<script type="text/javascript" src="https://beta.openaire.eu/index.php?option=com_openaire&view=widget&format=raw&projectId=undefined&type=result"></script>');
-->
</script>
Text as big data: Develop codes of practice for rigorous computational text analysis in energy social science

Augmenting traditional social science methods with computational analysis is crucial if we are to exploit the vast digital archives of text data that have become available over the past two decades. In this journal, Benites-Lazaro et al. (2018) showcase this in an application of topic modeling and other computational methods to an actor-specific examination of changes in policy discourse on ethanol in Brazil and point out methodological promises and challenges. However, their contribution also highlights the need for establishing codes of practice for computational text analysis. In this perspective, we discuss five areas for improvement when treating text as big data in light of guiding principles from computational research – transparency, reproducibility and validation – to facilitate rigorous research practice: (1) full transparency over data collection and corpus construction, (2) comprehensive method descriptions that enable reproducibility by other researchers, (3) application of rigorous model validation procedures, (4) results interpretation based on primary text and clear research design and (5) critical discussion and contextualization of main findings. We conclude that the energy social science community needs to develop codes of practice to build on the promising research within the field of computational text analysis and suggest first steps into this direction.
- Potsdam-Institut für Klimafolgenforschung (Potsdam Institute for Climate Impact Research) Germany
- Leibniz Association Germany
- University of Leeds United Kingdom
- Potsdam-Institut für Klimafolgenforschung (Potsdam Institute for Climate Impact Research) Germany
- Mercator Research Institute on Global Commons and Climate Change Germany
Computational Linguistics, Communication, Environmental Studies, Linguistics, Social and Behavioral Sciences, 004
Computational Linguistics, Communication, Environmental Studies, Linguistics, Social and Behavioral Sciences, 004
citations This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).22 popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.Top 10% influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).Top 10% impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.Top 10%
