gbif_download()
now uses minioclient
as a backend, offering dramatically (100x+) better performance, especially on multi-core machines with high bandwidth network connections.
gbif_local()
now defaults to duckdb backend, and utilizes duckdbfs to streamline the interface. The latest performance of duckdb
is substantially better than alternatives.
Breaking changes
gbif_conn()
is deprecatedgbif_version()
). Works with local and remote sources, can also report all available versions.gbif_local()
to return a remote table instead of a connection; paralleling the use of gbif_remote()
gbif_conn()
(and thus gbif_local()
) gain the ability to use arrow as a backend to duckdb, and this is now the default. This improves performance and avoids crashes when all columns are requested.gbif_download()
and gbif_remote()
)gbif_download()
now automatically detects versions, downloads parquet files to a path that parallels the remote path (using release-specific subdirectories), and allows bucket to be configured.to_duckdb=TRUE
by default in gbif_remote()
, creating a consistent lazy-table interface with support for windowed functionsgbif_conn()
(and gbif_local()
) now automatically detect the path of most recent GBIF version in gbif_dir()
. No more need to to manually set path for occurrence.parquet/
subfolder.