The Democratization of Data Science Education

Sean Kross, Roger D. Peng, Brian S. Caffo, Ira Gooding, Jeffrey T. Leek

Research output: Contribution to journalArticle

Abstract

Over the last three decades, data have become ubiquitous and cheap. This transition has accelerated over the last five years and training in statistics, machine learning, and data analysis has struggled to keep up. In April 2014, we launched a program of nine courses, the Johns Hopkins Data Science Specialization, which has now had more than 4 million enrollments over the past five years. Here, the program is described and compared to standard data science curricula as they were organized in 2014 and 2015. We show that novel pedagogical and administrative decisions introduced in our program are now standard in online data science programs. The impact of the Data Science Specialization on data science education in the U.S. is also discussed. Finally, we conclude with some thoughts about the future of data science education in a data democratized world.

Original languageEnglish (US)
JournalAmerican Statistician
DOIs
StateAccepted/In press - Jan 1 2019

Fingerprint

Science Education
Specialization
Science education
Democratization
Data analysis
Machine Learning
Statistics

Keywords

  • Applications and case studies
  • Education
  • Statistical computing

ASJC Scopus subject areas

  • Statistics and Probability
  • Mathematics(all)
  • Statistics, Probability and Uncertainty

Cite this

The Democratization of Data Science Education. / Kross, Sean; Peng, Roger D.; Caffo, Brian S.; Gooding, Ira; Leek, Jeffrey T.

In: American Statistician, 01.01.2019.

Research output: Contribution to journalArticle

@article{ff7bd6f25e1443cd82f5eeac6265d041,
title = "The Democratization of Data Science Education",
abstract = "Over the last three decades, data have become ubiquitous and cheap. This transition has accelerated over the last five years and training in statistics, machine learning, and data analysis has struggled to keep up. In April 2014, we launched a program of nine courses, the Johns Hopkins Data Science Specialization, which has now had more than 4 million enrollments over the past five years. Here, the program is described and compared to standard data science curricula as they were organized in 2014 and 2015. We show that novel pedagogical and administrative decisions introduced in our program are now standard in online data science programs. The impact of the Data Science Specialization on data science education in the U.S. is also discussed. Finally, we conclude with some thoughts about the future of data science education in a data democratized world.",
keywords = "Applications and case studies, Education, Statistical computing",
author = "Sean Kross and Peng, {Roger D.} and Caffo, {Brian S.} and Ira Gooding and Leek, {Jeffrey T.}",
year = "2019",
month = "1",
day = "1",
doi = "10.1080/00031305.2019.1668849",
language = "English (US)",
journal = "American Statistician",
issn = "0003-1305",
publisher = "American Statistical Association",

}

TY - JOUR

T1 - The Democratization of Data Science Education

AU - Kross, Sean

AU - Peng, Roger D.

AU - Caffo, Brian S.

AU - Gooding, Ira

AU - Leek, Jeffrey T.

PY - 2019/1/1

Y1 - 2019/1/1

N2 - Over the last three decades, data have become ubiquitous and cheap. This transition has accelerated over the last five years and training in statistics, machine learning, and data analysis has struggled to keep up. In April 2014, we launched a program of nine courses, the Johns Hopkins Data Science Specialization, which has now had more than 4 million enrollments over the past five years. Here, the program is described and compared to standard data science curricula as they were organized in 2014 and 2015. We show that novel pedagogical and administrative decisions introduced in our program are now standard in online data science programs. The impact of the Data Science Specialization on data science education in the U.S. is also discussed. Finally, we conclude with some thoughts about the future of data science education in a data democratized world.

AB - Over the last three decades, data have become ubiquitous and cheap. This transition has accelerated over the last five years and training in statistics, machine learning, and data analysis has struggled to keep up. In April 2014, we launched a program of nine courses, the Johns Hopkins Data Science Specialization, which has now had more than 4 million enrollments over the past five years. Here, the program is described and compared to standard data science curricula as they were organized in 2014 and 2015. We show that novel pedagogical and administrative decisions introduced in our program are now standard in online data science programs. The impact of the Data Science Specialization on data science education in the U.S. is also discussed. Finally, we conclude with some thoughts about the future of data science education in a data democratized world.

KW - Applications and case studies

KW - Education

KW - Statistical computing

UR - http://www.scopus.com/inward/record.url?scp=85074567757&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85074567757&partnerID=8YFLogxK

U2 - 10.1080/00031305.2019.1668849

DO - 10.1080/00031305.2019.1668849

M3 - Article

AN - SCOPUS:85074567757

JO - American Statistician

JF - American Statistician

SN - 0003-1305

ER -