Download - QA/QC for ecological data: tips & cheat codes

Transcript
Page 1: QA/QC for ecological data: tips & cheat codes

dead data tell no tales

@cjlortie

Page 2: QA/QC for ecological data: tips & cheat codes

you will have to reuse your data

planning promotes reproducibility

Page 3: QA/QC for ecological data: tips & cheat codes

https://dmptool.org to begin your game/journey

try a data management planning tool

Michener & Jones 2012

Page 4: QA/QC for ecological data: tips & cheat codes

there is no perfect experiment

Ruxton & Colegrave 2016

Page 5: QA/QC for ecological data: tips & cheat codes

there are no perfect data

data vary in class and structure

Page 6: QA/QC for ecological data: tips & cheat codes

QA/QC

Cai & Zhu 2015

Page 7: QA/QC for ecological data: tips & cheat codes

no one set of criterianeed fit all ecological data

but practical principlescan be used as a guide

Pipino et al. 2002

QA/QC

Page 8: QA/QC for ecological data: tips & cheat codes

a practical guide to QA/QC for ecological data

increasingly adopt #rstats & #tidyverse workflows

Page 9: QA/QC for ecological data: tips & cheat codes

Tip #1. Pilot data & meta-data

build tidy data & do data by design

rnorm(n = 10, mean = 39.74, sd = 25.09)

Page 10: QA/QC for ecological data: tips & cheat codes

Tip #2. Use social coding for QA/QC

(at least) two-player mode

Page 11: QA/QC for ecological data: tips & cheat codes

Tip #3. Check #rstats for data tools

there is a package for that (at least two)i.e. like ‘cheat codes’ to get you there sooner

Page 12: QA/QC for ecological data: tips & cheat codes

Maia et al. 2013

pavo

Page 13: QA/QC for ecological data: tips & cheat codes

biogeo for occurrence data

Robertson et al. 2016

Page 14: QA/QC for ecological data: tips & cheat codes

codyn for community dynamic metrics with taxize to check names

codyn::check_multispp(), check_names(), check_sppvar()taxize::gnr_resolve()

Page 15: QA/QC for ecological data: tips & cheat codes

use R Markdown + GitHub for versioned reviews & data cleaning

Tip #4. Version & annotate your data cleaning

Page 16: QA/QC for ecological data: tips & cheat codes

Tip #5. Check classes of vectors/variables

str(), unique(), nrow(), tibble()

Page 17: QA/QC for ecological data: tips & cheat codes

Tip #6. Decide what is a true zero

Martin et al. 2005

is.na(), data[!is.na(data$x), ]

Page 18: QA/QC for ecological data: tips & cheat codes

Tip #7. Pre-print your data

publish sooner

Page 19: QA/QC for ecological data: tips & cheat codes

the reproducibility crisis in science needs to end. today.

avoid a ‘game-over’ effect before the reuse even begins.

Page 20: QA/QC for ecological data: tips & cheat codes

better data. better reproducibility.

nom nom