
You have already added 0 works in your ORCID record related to the merged Research product.
You have already added 0 works in your ORCID record related to the merged Research product.
<script type="text/javascript">
<!--
document.write('<div id="oa_widget"></div>');
document.write('<script type="text/javascript" src="https://beta.openaire.eu/index.php?option=com_openaire&view=widget&format=raw&projectId=undefined&type=result"></script>');
-->
</script>
Evaluating observed versus predicted forest biomass: R-squared, index of agreement or maximal information coefficient?

The accurate prediction of forest above-ground biomass is nowadays key to implementing climate change mitigation policies, such as reducing emissions from deforestation and forest degradation. In this context, the coefficient of determination ($${R^2}$$) is widely used as a means of evaluating the proportion of variance in the dependent variable explained by a model. However, the validity of $${R^2}$$ for comparing observed versus predicted values has been challenged in the presence of bias, for instance in remote sensing predictions of forest biomass. We tested suitable alternatives, e.g. the index of agreement ($$d$$) and the maximal information coefficient ($$MIC$$). Our results show that $$d$$ renders systematically higher values than $${R^2}$$, and may easily lead to regarding as reliable models which included an unrealistic amount of predictors. Results seemed better for $$MIC$$, although $$MIC$$ favoured local clustering of predictions, whether or not they corresponded to the observations. Moreover, $${R^2}$$ was more sensitive to the use of cross-validation than $$d$$ or $$MIC$$, and more robust against overfitted models. Therefore, we discourage the use of statistical measures alternative to $${R^2}$$ for evaluating model predictions versus observed values, at least in the context of assessing the reliability of modelled biomass predictions using remote sensing. For those who consider $$d$$ to be conceptually superior to $${R^2}$$, we suggest using its square $${d^2}$$, in order to be more analogous to $${R^2}$$ and hence facilitate comparison across studies.
- Goddard Space Flight Center United States
- Universidade de São Paulo Brazil
- University of Cambridge United Kingdom
- Bangor University United Kingdom
- BioScience Laboratories (United States) United States
QE1-996.5, overfitting, biomass, model assessment, Geology, GC1-1581, Oceanography, lidar
QE1-996.5, overfitting, biomass, model assessment, Geology, GC1-1581, Oceanography, lidar
citations This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).25 popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.Top 10% influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).Top 10% impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.Top 10%
