chopin
automatically distributes geospatial data
computation over multiple threads.chopin
workflowpar_pad_*
functions to par_grid
,
or running par_hierarchy
, or par_multirasters
functions at once.temp_dir <- tempdir(check = TRUE)
url_nccnty <-
paste0(
"https://raw.githubusercontent.com/",
"ropensci/chopin/refs/heads/main/",
"tests/testdata/nc_hierarchy.gpkg"
)
url_ncelev <-
paste0(
"https://raw.githubusercontent.com/",
"ropensci/chopin/refs/heads/main/",
"tests/testdata/nc_srtm15_otm.tif"
)
nccnty_path <- file.path(temp_dir, "nc_hierarchy.gpkg")
ncelev_path <- file.path(temp_dir, "nc_srtm15_otm.tif")
# download data
download.file(url_nccnty, nccnty_path, mode = "wb", quiet = TRUE)
download.file(url_ncelev, ncelev_path, mode = "wb", quiet = TRUE)
nccnty <- terra::vect(nccnty_path)
ncelev <- terra::rast(ncelev_path)
ncgrid <- par_pad_grid(ncsamp, mode = "grid", nx = 4L, ny = 2L, padding = 10000)
plot(ncgrid$original)
par_*
functions operate on
future
backends, users should define the future plan before
running the functions. multicore
plan supports
terra
objects which may lead to faster computation, but it
is not supported in Windows. An alternative is
future.mirai
’s mirai_multisession
plan, which
is supported in many platforms and generally faster than plain future
multisession plan.workers
argument should be defined with an integer
value to specify the number of threads to be used.extract_at
runs on the grid
polygons.## Reading layer `county' from data source `/tmp/Rtmpe2LDGA/nc_hierarchy.gpkg' using driver `GPKG'
## Simple feature collection with 100 features and 1 field
## Geometry type: POLYGON
## Dimension: XY
## Bounding box: xmin: 1054155 ymin: 1341756 xmax: 1838923 ymax: 1690176
## Projected CRS: NAD83 / Conus Albers
## Reading layer `tracts' from data source `/tmp/Rtmpe2LDGA/nc_hierarchy.gpkg' using driver `GPKG'
## Simple feature collection with 2672 features and 1 field
## Geometry type: MULTIPOLYGON
## Dimension: XY
## Bounding box: xmin: 1054155 ymin: 1341756 xmax: 1838923 ymax: 1690176
## Projected CRS: NAD83 / Conus Albers
px <-
par_hierarchy(
# from here the par_hierarchy-specific arguments
regions = nctrct,
regions_id = "GEOID",
length_left = 5,
pad = 10000,
pad_y = FALSE,
.debug = TRUE,
# from here are the dispatched function definition
# for parallel workers
fun_dist = extract_at,
# below should follow the arguments of the dispatched function
x = ncelev,
y = sf::st_as_sf(ncsamp),
id = "pid",
radius = 1e4,
func = "mean"
)
dim(px)
## [1] 10000 2
## pid mean
## 1 5 8.0452728
## 2 48 12.6830435
## 3 61 0.4792324
## 4 108 -4.9482679
## 5 146 17.3203983
## 6 287 -1.1155452
## pid mean
## 9995 9254 4.179764
## 9996 9374 -3.915423
## 9997 9455 6.095058
## 9998 9678 4.071713
## 9999 9688 5.385324
## 10000 9843 -1.530116
ncelev <- terra::rast(ncelev_path)
tdir <- tempdir(check = TRUE)
terra::writeRaster(ncelev, file.path(tdir, "test1.tif"), overwrite = TRUE)
terra::writeRaster(ncelev, file.path(tdir, "test2.tif"), overwrite = TRUE)
terra::writeRaster(ncelev, file.path(tdir, "test3.tif"), overwrite = TRUE)
terra::writeRaster(ncelev, file.path(tdir, "test4.tif"), overwrite = TRUE)
terra::writeRaster(ncelev, file.path(tdir, "test5.tif"), overwrite = TRUE)
rasts <- list.files(tdir, pattern = "tif$", full.names = TRUE)
pm <-
par_multirasters(
filenames = rasts,
fun_dist = extract_at,
x = NA,
y = sf::st_as_sf(ncsamp)[1:500, ],
id = "pid",
radius = 1e4,
func = "mean",
.debug = TRUE
)
dim(pm)
## [1] 3000 2
## mean base_raster
## 1 1202.806152 /tmp/Rtmpe2LDGA/nc_srtm15_otm.tif
## 2 301.077057 /tmp/Rtmpe2LDGA/nc_srtm15_otm.tif
## 3 39.768700 /tmp/Rtmpe2LDGA/nc_srtm15_otm.tif
## 4 813.464050 /tmp/Rtmpe2LDGA/nc_srtm15_otm.tif
## 5 8.045273 /tmp/Rtmpe2LDGA/nc_srtm15_otm.tif
## 6 220.443192 /tmp/Rtmpe2LDGA/nc_srtm15_otm.tif
## mean base_raster
## 2995 285.62518 /tmp/Rtmpe2LDGA/test5.tif
## 2996 950.98322 /tmp/Rtmpe2LDGA/test5.tif
## 2997 150.44771 /tmp/Rtmpe2LDGA/test5.tif
## 2998 265.12933 /tmp/Rtmpe2LDGA/test5.tif
## 2999 22.61705 /tmp/Rtmpe2LDGA/test5.tif
## 3000 86.87681 /tmp/Rtmpe2LDGA/test5.tif
chopin
works best with two-dimensional
(planar) geometries. Users should disable
s2
spherical geometry mode in sf
by setting
sf::sf_use_s2(FALSE)
. Running any chopin
functions at spherical or three-dimensional (e.g., including M/Z
dimensions) geometries may produce incorrect or unexpected results.