Standardized Representation of Clinical Study Data Dictionaries with CIMI Archetypes

Deepak K. Sharma, Harold R. Solbrig, Eric Prud'hommeaux, Jyotishman Pathak, Guoqian Jiang

Research output: Contribution to journalArticlepeer-review

Abstract

Researchers commonly use a tabular format to describe and represent clinical study data. The lack of standardization of data dictionary's metadata elements presents challenges for their harmonization for similar studies and impedes interoperability outside the local context. We propose that representing data dictionaries in the form of standardized archetypes can help to overcome this problem. The Archetype Modeling Language (AML) as developed by the Clinical Information Modeling Initiative (CIMI) can serve as a common format for the representation of data dictionary models. We mapped three different data dictionaries (identified from dbGAP, PheKB and TCGA) onto AML archetypes by aligning dictionary variable definitions with the AML archetype elements. The near complete alignment of data dictionaries helped map them into valid AML models that captured all data dictionary model metadata. The outcome of the work would help subject matter experts harmonize data models for quality, semantic interoperability and better downstream data integration.

Original languageEnglish (US)
Pages (from-to)1119-1128
Number of pages10
JournalAMIA ... Annual Symposium proceedings. AMIA Symposium
Volume2016
StatePublished - 2016
Externally publishedYes

ASJC Scopus subject areas

  • Medicine(all)

Fingerprint

Dive into the research topics of 'Standardized Representation of Clinical Study Data Dictionaries with CIMI Archetypes'. Together they form a unique fingerprint.

Cite this