Package: CoordinateCleaner 3.0.1

Alexander Zizka

CoordinateCleaner: Automated Cleaning of Occurrence Records from Biological Collections

Automated flagging of common spatial and temporal errors in biological and paleontological collection data, for the use in conservation, ecology and paleontology. Includes automated tests to easily flag (and exclude) records assigned to country or province centroid, the open ocean, the headquarters of the Global Biodiversity Information Facility, urban areas or the location of biodiversity institutions (museums, zoos, botanical gardens, universities). Furthermore identifies per species outlier coordinates, zero coordinates, identical latitude/longitude and invalid coordinates. Also implements an algorithm to identify data sets with a significant proportion of rounded coordinates. Especially suited for large data sets. The reference for the methodology is: Zizka et al. (2019) <doi:10.1111/2041-210X.13152>.

Authors:Alexander Zizka [aut, cre], Daniele Silvestro [ctb], Tobias Andermann [ctb], Josue Azevedo [ctb], Camila Duarte Ritter [ctb], Daniel Edler [ctb], Harith Farooq [ctb], Andrei Herdean [ctb], Maria Ariza [ctb], Ruud Scharn [ctb], Sten Svanteson [ctb], Niklas Wengstrom [ctb], Vera Zizka [ctb], Alexandre Antonelli [ctb], Bruno Vilela [ctb], Irene Steves [rev], Francisco Rodriguez-Sanchez [rev]

CoordinateCleaner_3.0.1.tar.gz
CoordinateCleaner_3.0.1.zip(r-4.7)CoordinateCleaner_3.0.1.zip(r-4.6)CoordinateCleaner_3.0.1.zip(r-4.5)
CoordinateCleaner_3.0.1.tgz(r-4.6-any)CoordinateCleaner_3.0.1.tgz(r-4.5-any)
CoordinateCleaner_3.0.1.tar.gz(r-4.7-any)CoordinateCleaner_3.0.1.tar.gz(r-4.6-any)
CoordinateCleaner_3.0.1.tgz(r-4.6-emscripten)
manual.pdf |manual.html
DESCRIPTION |NEWS
card.svg |card.png
CoordinateCleaner/json (API)

# Install 'CoordinateCleaner' in R:
install.packages('CoordinateCleaner', repos = c('https://packages.ropensci.org', 'https://cloud.r-project.org'))

Reviews:rOpenSci Software Review #210

Bug tracker:https://github.com/ropensci/coordinatecleaner/issues

Pkgdown/docs site:https://ropensci.github.io

Datasets:
  • aohi - Artificial Hotspot Occurrence Inventory
  • buffland - Global Coastlines buffered by 1 degree
  • buffsea - Global Coastlines buffered by -1 degree
  • countryref - Country Centroids and Country Capitals
  • institutions - Global Locations of Biodiversity Institutions
  • pbdb_example - Example data from the Paleobiologydatabase

On CRAN:

Conda:

11.43 score 87 stars 4 packages 574 scripts 2.0k downloads 10 mentions 25 exports 60 dependencies

Last updated from:9195fb6415 (on master). Checks:10 OK. Indexed: yes.

TargetResultTimeFilesSyslog
linux-devel-x86_64OK241
pkgdown docsOK207
source / vignettesOK240
linux-release-x86_64OK242
macos-release-arm64OK176
macos-oldrel-arm64OK140
windows-develOK206
windows-releaseOK227
windows-oldrelOK176
wasm-releaseOK181

Exports:cc_aohicc_capcc_cencc_councc_duplcc_equcc_gbifcc_instcc_iucncc_outlcc_seacc_urbcc_valcc_zerocd_ddmmcd_roundcf_agecf_equalcf_outlcf_rangeclean_coordinatesclean_datasetclean_fossilsis.spatialvalidwrite_pyrate

Dependencies:askpassclassclassIntclicpp11crulcurldata.tableDBIdplyre1071farvergenericsgeosphereggplot2gluegtablehttpcodehttrisobandjsonliteKernSmoothlabelinglazyevallifecyclemagrittrMASSmimeoaiopensslpillarpkgconfigplyrproxyR6RColorBrewerRcpprgbifrlangrnaturalearths2S7scalessfstringistringrsysterratibbletidyselecttriebeardunitsurltoolsutf8vctrsviridisLitewhiskerwithrwkxml2

Cleaning GBIF data for the use in biogeography
Background | Identifying erroneous coordinates with CoordinateCleaner | Install CoordinateCleaner | Set up libraries and data | Visualize the data on a map | Use CoordinateCleaner to automatically flag problematic records | Option A) Using the clean_coordinates wrapper function | Option B) Using the magrittr pipe (%>%) | Temporal outliers | Improving data quality using GBIF meta-data | Improving data quality using external information | Flag records based on fixed longitude and latitude | Flag records based on species natural ranges | Identifying problematic data sets | Identify dataset with ddmm to dd.dd conversion error | Test for rasterized sampling | References

Last update: 2023-10-24
Started: 2018-07-16

Comparison of CoordinateCleaner to other tools

Last update: 2023-09-30
Started: 2020-10-09

Cleaning fossil data for the use in biogeography and palaeontology
Background | Install CoordinateCleaner | Load required libraries | Load test dataset | Visualize the records on a map | CoordinateCleaner | Spatial issues | Temporal issues | Improving data quality using meta-data | Basic taxonomy | Spatial coordinates | Time | Conclusions

Last update: 2023-08-31
Started: 2018-06-05

Using customized gazetteers

Last update: 2023-08-31
Started: 2020-08-18

Readme and manuals

Help Manual

Help pageTopics
Artificial Hotspot Occurrence Inventoryaohi
Global Coastlines buffered by 1 degreebuffland
Global Coastlines buffered by -1 degreebuffsea
Identify Coordinates in Artificial Hotspot Occurrence Inventorycc_aohi
Identify Coordinates in Vicinity of Country Capitals.cc_cap
Identify Coordinates in Vicinity of Country and Province Centroidscc_cen
Identify Coordinates Outside their Reported Countrycc_coun
Identify Duplicated Recordscc_dupl
Identify Records with Identical lat/loncc_equ
Identify Records Assigned to GBIF Headquarterscc_gbif
Identify Records in the Vicinity of Biodiversity Institutionscc_inst
Identify Records Outside Natural Rangescc_iucn
Identify Geographic Outliers in Species Distributionscc_outl
Identify Non-terrestrial Coordinatescc_sea
Identify Records Inside Urban Areascc_urb
Identify Invalid lat/lon Coordinatescc_val
Identify Zero Coordinatescc_zero
Identify Datasets with a Degree Conversion Errorcd_ddmm
Identify Datasets with Rasterized Coordinatescd_round
Identify Fossils with Outlier Agecf_age
Identify Fossils with equal min and max agecf_equal
Identify Outlier Records in Space and Timecf_outl
Identify Fossils with Extreme Age Rangescf_range
Geographic Cleaning of Coordinates from Biologic Collectionsclean_coordinates summary.spatialvalid
Coordinate Cleaning using Dataset Propertiesclean_dataset
Geographic and Temporal Cleaning of Records from Fossil Collectionsclean_fossils
Country Centroids and Country Capitalscountryref
Global Locations of Biodiversity Institutionsinstitutions
Check spatialvalid objectis.spatialvalid
Example data from the Paleobiologydatabasepbdb_example
Plot Method for Class Spatialvalidplot.spatialvalid
Create Input Files for PyRatewrite_pyrate