Interacting with local and remote data repositories using the stashR package

Sandrah P. Eckel, Roger Peng

Research output: Contribution to journalArticle

Abstract

The stashR package (a Set of Tools for Administering Shared Repositories) for R implements a basic versioned key-value style database where character string keys are associated with data values. Using the S4 classes 'localDB' and 'remoteDB', and associated methods, versioned key-value databases can be either created locally on the user's computer or accessed remotely via the Internet. The stashR package can enhance reproducible research by providing a 'localDB' database format for the caching of computations which can subsequently be stored on the Internet. To reproduce a particular computation, a reader can access the 'remoteDB' database and obtain the associated R objects.

Original languageEnglish (US)
Pages (from-to)247-254
Number of pages8
JournalComputational Statistics
Volume24
Issue number2
DOIs
StatePublished - May 2009

Fingerprint

Repository
Internet
Caching
Strings
Data base
World Wide Web

Keywords

  • Data distribution
  • Database
  • Reproducible research
  • Version control

ASJC Scopus subject areas

  • Statistics and Probability
  • Computational Mathematics
  • Statistics, Probability and Uncertainty

Cite this

Interacting with local and remote data repositories using the stashR package. / Eckel, Sandrah P.; Peng, Roger.

In: Computational Statistics, Vol. 24, No. 2, 05.2009, p. 247-254.

Research output: Contribution to journalArticle

@article{2f25e57a2d874c60b46cf822021ab7a3,
title = "Interacting with local and remote data repositories using the stashR package",
abstract = "The stashR package (a Set of Tools for Administering Shared Repositories) for R implements a basic versioned key-value style database where character string keys are associated with data values. Using the S4 classes 'localDB' and 'remoteDB', and associated methods, versioned key-value databases can be either created locally on the user's computer or accessed remotely via the Internet. The stashR package can enhance reproducible research by providing a 'localDB' database format for the caching of computations which can subsequently be stored on the Internet. To reproduce a particular computation, a reader can access the 'remoteDB' database and obtain the associated R objects.",
keywords = "Data distribution, Database, Reproducible research, Version control",
author = "Eckel, {Sandrah P.} and Roger Peng",
year = "2009",
month = "5",
doi = "10.1007/s00180-008-0124-x",
language = "English (US)",
volume = "24",
pages = "247--254",
journal = "Computational Statistics",
issn = "0943-4062",
publisher = "Springer Verlag",
number = "2",

}

TY - JOUR

T1 - Interacting with local and remote data repositories using the stashR package

AU - Eckel, Sandrah P.

AU - Peng, Roger

PY - 2009/5

Y1 - 2009/5

N2 - The stashR package (a Set of Tools for Administering Shared Repositories) for R implements a basic versioned key-value style database where character string keys are associated with data values. Using the S4 classes 'localDB' and 'remoteDB', and associated methods, versioned key-value databases can be either created locally on the user's computer or accessed remotely via the Internet. The stashR package can enhance reproducible research by providing a 'localDB' database format for the caching of computations which can subsequently be stored on the Internet. To reproduce a particular computation, a reader can access the 'remoteDB' database and obtain the associated R objects.

AB - The stashR package (a Set of Tools for Administering Shared Repositories) for R implements a basic versioned key-value style database where character string keys are associated with data values. Using the S4 classes 'localDB' and 'remoteDB', and associated methods, versioned key-value databases can be either created locally on the user's computer or accessed remotely via the Internet. The stashR package can enhance reproducible research by providing a 'localDB' database format for the caching of computations which can subsequently be stored on the Internet. To reproduce a particular computation, a reader can access the 'remoteDB' database and obtain the associated R objects.

KW - Data distribution

KW - Database

KW - Reproducible research

KW - Version control

UR - http://www.scopus.com/inward/record.url?scp=67349096293&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=67349096293&partnerID=8YFLogxK

U2 - 10.1007/s00180-008-0124-x

DO - 10.1007/s00180-008-0124-x

M3 - Article

AN - SCOPUS:67349096293

VL - 24

SP - 247

EP - 254

JO - Computational Statistics

JF - Computational Statistics

SN - 0943-4062

IS - 2

ER -