How to Share Data for Collaboration

Shannon E. Ellis, Jeffrey T Leek

Research output: Contribution to journalArticle

Abstract

Within the statistics community, a number of guiding principles for sharing data have emerged; however, these principles are not always made clear to collaborators generating the data. To bridge this divide, we have established a set of guidelines for sharing data. In these, we highlight the need to provide raw data to the statistician, the importance of consistent formatting, and the necessity of including all essential experimental information and pre-processing steps carried out to the statistician. With these guidelines we hope to avoid errors and delays in data analysis.

Original languageEnglish (US)
Pages (from-to)53-57
Number of pages5
JournalAmerican Statistician
Volume72
Issue number1
DOIs
StatePublished - Jan 2 2018

Fingerprint

Data Sharing
Information Processing
Preprocessing
Divides
Data analysis
Statistics
Collaboration
Data sharing
Necessity
Community

Keywords

  • Analysis
  • Data sharing
  • Guidelines
  • Statistician
  • Tidy data

ASJC Scopus subject areas

  • Statistics and Probability
  • Mathematics(all)
  • Statistics, Probability and Uncertainty

Cite this

How to Share Data for Collaboration. / Ellis, Shannon E.; Leek, Jeffrey T.

In: American Statistician, Vol. 72, No. 1, 02.01.2018, p. 53-57.

Research output: Contribution to journalArticle

Ellis, Shannon E. ; Leek, Jeffrey T. / How to Share Data for Collaboration. In: American Statistician. 2018 ; Vol. 72, No. 1. pp. 53-57.
@article{2d713f30bf4d4e749476eb7a266fea23,
title = "How to Share Data for Collaboration",
abstract = "Within the statistics community, a number of guiding principles for sharing data have emerged; however, these principles are not always made clear to collaborators generating the data. To bridge this divide, we have established a set of guidelines for sharing data. In these, we highlight the need to provide raw data to the statistician, the importance of consistent formatting, and the necessity of including all essential experimental information and pre-processing steps carried out to the statistician. With these guidelines we hope to avoid errors and delays in data analysis.",
keywords = "Analysis, Data sharing, Guidelines, Statistician, Tidy data",
author = "Ellis, {Shannon E.} and Leek, {Jeffrey T}",
year = "2018",
month = "1",
day = "2",
doi = "10.1080/00031305.2017.1375987",
language = "English (US)",
volume = "72",
pages = "53--57",
journal = "American Statistician",
issn = "0003-1305",
publisher = "American Statistical Association",
number = "1",

}

TY - JOUR

T1 - How to Share Data for Collaboration

AU - Ellis, Shannon E.

AU - Leek, Jeffrey T

PY - 2018/1/2

Y1 - 2018/1/2

N2 - Within the statistics community, a number of guiding principles for sharing data have emerged; however, these principles are not always made clear to collaborators generating the data. To bridge this divide, we have established a set of guidelines for sharing data. In these, we highlight the need to provide raw data to the statistician, the importance of consistent formatting, and the necessity of including all essential experimental information and pre-processing steps carried out to the statistician. With these guidelines we hope to avoid errors and delays in data analysis.

AB - Within the statistics community, a number of guiding principles for sharing data have emerged; however, these principles are not always made clear to collaborators generating the data. To bridge this divide, we have established a set of guidelines for sharing data. In these, we highlight the need to provide raw data to the statistician, the importance of consistent formatting, and the necessity of including all essential experimental information and pre-processing steps carried out to the statistician. With these guidelines we hope to avoid errors and delays in data analysis.

KW - Analysis

KW - Data sharing

KW - Guidelines

KW - Statistician

KW - Tidy data

UR - http://www.scopus.com/inward/record.url?scp=85045912478&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85045912478&partnerID=8YFLogxK

U2 - 10.1080/00031305.2017.1375987

DO - 10.1080/00031305.2017.1375987

M3 - Article

AN - SCOPUS:85045912478

VL - 72

SP - 53

EP - 57

JO - American Statistician

JF - American Statistician

SN - 0003-1305

IS - 1

ER -