Conservationists increasingly use unstructured observational data, such as citizen science records or ranger patrol observations, to guide decision making. These datasets are often large and relatively cheap to collect, and they have enormous potential. However, the resulting data are generally “messy,” and their use can incur considerable costs, some of which are hidden. We present an overview of the opportunities and limitations associated with messy data by explaining how the preferences, skills, and incentives of data collectors affect the quality of the information they contain and the investment required to unlock their potential. Drawing widely from across the sciences, we break down elements of the observation process in order to highlight likely sources of bias and error while emphasizing the importance of cross-disciplinary collaboration. We propose a framework for appraising messy data to guide those engaging with these types of dataset and make them work for conservation and broader sustainability applications.
- Citizen science
- Crowd sensing
- Observation process
- Unstructured observational data
- Volunteer data