Taking a 'Big Data' approach to data quality in a citizen science project

Steve Kelling*, Daniel Fink, Frank A. La Sorte, Alison Johnston, Nicholas E. Bruns, Wesley M. Hochachka

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

Abstract

Data from well-designed experiments provide the strongest evidence of causation in biodiversity studies. However, for many species the collection of these data is not scalable to the spatial and temporal extents required to understand patterns at the population level. Only data collected from citizen science projects can gather sufficient quantities of data, but data collected from volunteers are inherently noisy and heterogeneous. Here we describe a 'Big Data' approach to improve the data quality in eBird, a global citizen science project that gathers bird observations. First, eBird's data submission design ensures that all data meet high standards of completeness and accuracy. Second, we take a 'sensor calibration' approach to measure individual variation in eBird participant's ability to detect and identify birds. Third, we use species distribution models to fill in data gaps. Finally, we provide examples of novel analyses exploring population-level patterns in bird distributions.

Original languageEnglish
Pages (from-to)S601-S611
Number of pages11
JournalAmbio
Volume44
DOIs
Publication statusPublished - Nov 2015

Keywords

  • Biodiversity monitoring
  • Citizen science
  • eBird
  • Data quality
  • Species distribution models
  • DATA-INTENSIVE SCIENCE
  • CONSERVATION
  • MODELS
  • EBIRD
  • NETWORK
  • TOOL

Fingerprint

Dive into the research topics of 'Taking a 'Big Data' approach to data quality in a citizen science project'. Together they form a unique fingerprint.

Cite this