CRAN/E | git2rdata

git2rdata

Store and Retrieve Data.frames in a Git Repository

Installation

About

The git2rdata package is an R package for writing and reading dataframes as plain text files. A metadata file stores important information. 1) Storing metadata allows to maintain the classes of variables. By default, git2rdata optimizes the data for file storage. The optimization is most effective on data containing factors. The optimization makes the data less human readable. The user can turn this off when they prefer a human readable format over smaller files. Details on the implementation are available in vignette("plain_text", package = "git2rdata"). 2) Storing metadata also allows smaller row based diffs between two consecutive commits. This is a useful feature when storing data as plain text files under version control. Details on this part of the implementation are available in vignette("version_control", package = "git2rdata"). Although we envisioned git2rdata with a git workflow in mind, you can use it in combination with other version control systems like subversion or mercurial. 3) git2rdata is a useful tool in a reproducible and traceable workflow. vignette("workflow", package = "git2rdata") gives a toy example. 4) vignette("efficiency", package = "git2rdata") provides some insight into the efficiency of file storage, git repository size and speed for writing and reading.

Citation git2rdata citation info
ropensci.github.io/git2rdata/
github.com/ropensci/git2rdata/
Bug report File report

Key Metrics

Version 0.4.0
R ≥ 3.5.0
Published 2022-03-17 770 days ago
Needs compilation? no
License GPL-3
CRAN checks git2rdata results
Language eng

Downloads

Yesterday 13 0%
Last 7 days 52 -38%
Last 30 days 306 -12%
Last 90 days 984 -23%
Last 365 days 4.055 -8%

Maintainer

Maintainer

Thierry Onkelinx

thierry.onkelinx@inbo.be

Authors

Thierry Onkelinx

aut / cre

Floris Vanderhaeghe

ctb

Peter Desmet

ctb

Els Lommelen

ctb

Research Institute for Nature
Forest

cph / fnd

Material

README
NEWS
Reference manual
Package source

Vignettes

Efficiency Relative to Storage and Time
Getting Started Storing Dataframes as Plain Text
Storing Large Dataframes
Optimizing Storage for Version Control
Suggested Workflow for Storing a Variable Set of Dataframes under Version Control

macOS

r-release

arm64

r-oldrel

arm64

r-release

x86_64

r-oldrel

x86_64

Windows

r-devel

x86_64

r-release

x86_64

r-oldrel

x86_64

Old Sources

git2rdata archive

Depends

R ≥ 3.5.0

Imports

assertthat
git2r ≥ 0.23.0
methods
yaml

Suggests

ggplot2
knitr
microbenchmark
rmarkdown
spelling
testthat