Package: git2rdata 0.5.0

Thierry Onkelinx

git2rdata: Store and Retrieve Data.frames in a Git Repository

The git2rdata package is an R package for writing and reading dataframes as plain text files. A metadata file stores important information. 1) Storing metadata allows to maintain the classes of variables. By default, git2rdata optimizes the data for file storage. The optimization is most effective on data containing factors. The optimization makes the data less human readable. The user can turn this off when they prefer a human readable format over smaller files. Details on the implementation are available in vignette("plain_text", package = "git2rdata"). 2) Storing metadata also allows smaller row based diffs between two consecutive commits. This is a useful feature when storing data as plain text files under version control. Details on this part of the implementation are available in vignette("version_control", package = "git2rdata"). Although we envisioned git2rdata with a git workflow in mind, you can use it in combination with other version control systems like subversion or mercurial. 3) git2rdata is a useful tool in a reproducible and traceable workflow. vignette("workflow", package = "git2rdata") gives a toy example. 4) vignette("efficiency", package = "git2rdata") provides some insight into the efficiency of file storage, git repository size and speed for writing and reading.

Authors:Thierry Onkelinx [aut, cre], Floris Vanderhaeghe [ctb], Peter Desmet [ctb], Els Lommelen [ctb], Research Institute for Nature and Forest [cph, fnd]

git2rdata_0.5.0.tar.gz
git2rdata_0.5.0.zip(r-4.6)git2rdata_0.5.0.zip(r-4.5)git2rdata_0.5.0.zip(r-4.4)
git2rdata_0.5.0.tgz(r-4.5-any)git2rdata_0.5.0.tgz(r-4.4-any)
git2rdata_0.5.0.tar.gz(r-4.6-any)git2rdata_0.5.0.tar.gz(r-4.5-any)
git2rdata_0.5.0.tgz(r-4.5-emscripten)
git2rdata.pdf |git2rdata.html
git2rdata/json (API)
NEWS

# Install 'git2rdata' in R:
install.packages('git2rdata', repos = c('https://packages.ropensci.org', 'https://cloud.r-project.org'))

Reviews:rOpenSci Software Review #263

Bug tracker:https://github.com/ropensci/git2rdata/issues

Pkgdown site:https://ropensci.github.io

On CRAN:

Conda:

reproducible-researchversion-control

10.07 score 103 stars 4 packages 227 scripts 864 downloads 21 exports 3 dependencies

Last updated from:ad37387fdb (on main). Checks:10 OK. Indexed: yes.

TargetResultTotal timeArtifact
linux-devel-x86_64OK127
pkgdown docsOK180
source / vignettesOK202
linux-release-x86_64OK130
macos-release-arm64OK84
macos-oldrel-arm64OK113
windows-develOK83
windows-releaseOK80
windows-oldrelOK81
wasm-releaseOK105

Exports:commitdata_packagedisplay_metadatais_git2rdatais_git2rmetalist_datametaprune_metapullpushread_vcrecent_commitrelabelrename_variablerepositoryrm_datastatusupdate_metadataupgrade_dataverify_vcwrite_vc

Dependencies:assertthatgit2ryaml

Adding metadata

Rendered frommetadata.Rmdusingknitr::rmarkdownon Aug 27 2025.

Last update: 2024-09-06
Started: 2024-09-06

Efficiency Relative to Storage and Time

Rendered fromefficiency.Rmdusingknitr::rmarkdownon Aug 27 2025.

Last update: 2024-09-06
Started: 2019-02-26

Getting Started Storing Dataframes as Plain Text

Rendered fromplain_text.Rmdusingknitr::rmarkdownon Aug 27 2025.

Last update: 2024-09-06
Started: 2019-02-26

Optimizing Storage for Version Control

Rendered fromversion_control.Rmdusingknitr::rmarkdownon Aug 27 2025.

Last update: 2024-09-06
Started: 2019-02-26

Storing Large Dataframes

Rendered fromsplit_by.Rmdusingknitr::rmarkdownon Aug 27 2025.

Last update: 2022-03-17
Started: 2021-01-13

Suggested Workflow for Storing a Variable Set of Dataframes under Version Control

Rendered fromworkflow.Rmdusingknitr::rmarkdownon Aug 27 2025.

Last update: 2022-03-17
Started: 2019-02-26