@brusilovsky

Blackbox, Five Years On: An Evaluation of a Large-scale Programming Data Collection Project

, , , and . Proceedings of the 2018 ACM Conference on International Computing Education Research, page 196--204. New York, NY, USA, ACM, (2018)
DOI: 10.1145/3230977.3230991

Abstract

The Blackbox project has been collecting programming activity data from users of BlueJ (a novice-targeted Java development environment) for nearly five years. The resulting dataset of more than two terabytes of data has been made available to interested researchers from the outset. In this paper, we assess the impact of the Blackbox project: we perform a mapping study to assess eighteen publications which have made use of the Blackbox data, and we report on the advantages and difficulties experienced by researchers working with this data, collected via a survey. We find that Blackbox has enabled pieces of research which otherwise would not have been possible, but there remain technical challenges in the analysis. Some of these -- but not all -- relate to the scale of the data. We provide suggestions for the future use of Blackbox, and reflections on the role of such data collection projects in programming research.

Description

Blackbox, Five Years On

Links and resources

Tags