Critical Difference Diagram (all). Average ranking of training sets (lower is better). Connected with a line =>not statistically significant.

Critical Difference Diagram (all). Average ranking of training sets (lower is better). Connected with a line =>not statistically significant.

Source publication
Preprint
Full-text available
In this paper we implement and compare 7 different data augmentation strategies for the task of automatic scoring of children's ability to understand others' thoughts, feelings, and desires (or "mindreading"). We recruit in-domain experts to re-annotate augmented samples and determine to what extent each strategy preserves the original rating. We a...