A Guide to Teaching Data Science

Stephanie Hicks, Rafael A. Irizarry

Research output: Contribution to journalArticle

Abstract

Demand for data science education is surging and traditional courses offered by statistics departments are not meeting the needs of those seeking training. This has led to a number of opinion pieces advocating for an update to the Statistics curriculum. The unifying recommendation is that computing should play a more prominent role. We strongly agree with this recommendation, but advocate the main priority is to bring applications to the forefront as proposed by Nolan and Speed in 1999. We also argue that the individuals tasked with developing data science courses should not only have statistical training, but also have experience analyzing data with the main objective of solving real-world problems. Here, we share a set of general principles and offer a detailed guide derived from our successful experience developing and teaching a graduate-level, introductory data science course centered entirely on case studies. We argue for the importance of statistical thinking, as defined by Wild and Pfannkuch in 1999 and describe how our approach teaches students three key skills needed to succeed in data science, which we refer to as creating, connecting, and computing. This guide can also be used for statisticians wanting to gain more practical knowledge about data science before embarking on teaching an introductory course. Supplementary materials for this article are available online.

Original languageEnglish (US)
Pages (from-to)382-391
Number of pages10
JournalAmerican Statistician
Volume72
Issue number4
DOIs
StatePublished - Oct 2 2018
Externally publishedYes

Fingerprint

Recommendations
Statistics
Science Education
Computing
Teaching
Update
Experience
Training
Science education
Statistical thinking
Curriculum
Knowledge
Demand
Skills

Keywords

  • Active learning
  • Applied statistics
  • Computing data science
  • Reproducibility
  • Teaching principles

ASJC Scopus subject areas

  • Statistics and Probability
  • Mathematics(all)
  • Statistics, Probability and Uncertainty

Cite this

A Guide to Teaching Data Science. / Hicks, Stephanie; Irizarry, Rafael A.

In: American Statistician, Vol. 72, No. 4, 02.10.2018, p. 382-391.

Research output: Contribution to journalArticle

Hicks, Stephanie ; Irizarry, Rafael A. / A Guide to Teaching Data Science. In: American Statistician. 2018 ; Vol. 72, No. 4. pp. 382-391.
@article{9a7507cbe3b14b6eac1c531f86927277,
title = "A Guide to Teaching Data Science",
abstract = "Demand for data science education is surging and traditional courses offered by statistics departments are not meeting the needs of those seeking training. This has led to a number of opinion pieces advocating for an update to the Statistics curriculum. The unifying recommendation is that computing should play a more prominent role. We strongly agree with this recommendation, but advocate the main priority is to bring applications to the forefront as proposed by Nolan and Speed in 1999. We also argue that the individuals tasked with developing data science courses should not only have statistical training, but also have experience analyzing data with the main objective of solving real-world problems. Here, we share a set of general principles and offer a detailed guide derived from our successful experience developing and teaching a graduate-level, introductory data science course centered entirely on case studies. We argue for the importance of statistical thinking, as defined by Wild and Pfannkuch in 1999 and describe how our approach teaches students three key skills needed to succeed in data science, which we refer to as creating, connecting, and computing. This guide can also be used for statisticians wanting to gain more practical knowledge about data science before embarking on teaching an introductory course. Supplementary materials for this article are available online.",
keywords = "Active learning, Applied statistics, Computing data science, Reproducibility, Teaching principles",
author = "Stephanie Hicks and Irizarry, {Rafael A.}",
year = "2018",
month = "10",
day = "2",
doi = "10.1080/00031305.2017.1356747",
language = "English (US)",
volume = "72",
pages = "382--391",
journal = "American Statistician",
issn = "0003-1305",
publisher = "American Statistical Association",
number = "4",

}

TY - JOUR

T1 - A Guide to Teaching Data Science

AU - Hicks, Stephanie

AU - Irizarry, Rafael A.

PY - 2018/10/2

Y1 - 2018/10/2

N2 - Demand for data science education is surging and traditional courses offered by statistics departments are not meeting the needs of those seeking training. This has led to a number of opinion pieces advocating for an update to the Statistics curriculum. The unifying recommendation is that computing should play a more prominent role. We strongly agree with this recommendation, but advocate the main priority is to bring applications to the forefront as proposed by Nolan and Speed in 1999. We also argue that the individuals tasked with developing data science courses should not only have statistical training, but also have experience analyzing data with the main objective of solving real-world problems. Here, we share a set of general principles and offer a detailed guide derived from our successful experience developing and teaching a graduate-level, introductory data science course centered entirely on case studies. We argue for the importance of statistical thinking, as defined by Wild and Pfannkuch in 1999 and describe how our approach teaches students three key skills needed to succeed in data science, which we refer to as creating, connecting, and computing. This guide can also be used for statisticians wanting to gain more practical knowledge about data science before embarking on teaching an introductory course. Supplementary materials for this article are available online.

AB - Demand for data science education is surging and traditional courses offered by statistics departments are not meeting the needs of those seeking training. This has led to a number of opinion pieces advocating for an update to the Statistics curriculum. The unifying recommendation is that computing should play a more prominent role. We strongly agree with this recommendation, but advocate the main priority is to bring applications to the forefront as proposed by Nolan and Speed in 1999. We also argue that the individuals tasked with developing data science courses should not only have statistical training, but also have experience analyzing data with the main objective of solving real-world problems. Here, we share a set of general principles and offer a detailed guide derived from our successful experience developing and teaching a graduate-level, introductory data science course centered entirely on case studies. We argue for the importance of statistical thinking, as defined by Wild and Pfannkuch in 1999 and describe how our approach teaches students three key skills needed to succeed in data science, which we refer to as creating, connecting, and computing. This guide can also be used for statisticians wanting to gain more practical knowledge about data science before embarking on teaching an introductory course. Supplementary materials for this article are available online.

KW - Active learning

KW - Applied statistics

KW - Computing data science

KW - Reproducibility

KW - Teaching principles

UR - http://www.scopus.com/inward/record.url?scp=85038850549&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85038850549&partnerID=8YFLogxK

U2 - 10.1080/00031305.2017.1356747

DO - 10.1080/00031305.2017.1356747

M3 - Article

VL - 72

SP - 382

EP - 391

JO - American Statistician

JF - American Statistician

SN - 0003-1305

IS - 4

ER -