Effectiveness of lexico-syntactic pattern matching for ontology enrichment with clinical documents

Kaihong Liu, W. W. Chapman, G. Savova, C. G. Chute, N. Sioutos, R. S. Crowley

Research output: Contribution to journalArticlepeer-review

Abstract

Objective: To evaluate the effectiveness of a lexico-syntactic pattern (LSP) matching method for ontology enrichment using clinical documents. Methods: Two domains were separately studied using the same methodology. We used radiology documents to enrich RadLex and pathology documents to enrich National Cancer Institute Thesaurus (NCIT). Several known LSPs were used for semantic knowl - edge extraction. We first retrieved all sentences that contained LSPs across two large clinical repositories, and examined the frequency of the LSPs. From this set, we randomly sampled LSP instances which were examined by human judges. We used a two-step method to determine the utility of these patterns for enrichment. In the first step, domain experts annotated medically meaningful terms (MMTs) from each sentence within the LSP. In the second step, RadLex and NCIT curators evaluated how many of these MMTs could be added to the resource. To quantify the utility of this LSP method, we defined two evaluation metrics: suggestion rate (SR) and acceptance rate (AR). We used these measures to estimate the yield of concepts and relationships, for each of the two domains. Results: For NCIT, the concept SR was 24%, and the relationship SR was 65%. The concept AR was 21%, and the relationship AR was 14%. For RadLex, the concept SR was 37%, and the relationship SR was 55%. The concept AR was 11%, and the relationship AR was 44%. Conclusion: The LSP matching method is an effective method for concept and concept relationship discovery in biomedical domains.

Original languageEnglish (US)
Pages (from-to)397-407
Number of pages11
JournalMethods of information in medicine
Volume50
Issue number5
DOIs
StatePublished - 2011
Externally publishedYes

Keywords

  • Knowledge acquisition
  • Lexico-syntactic pattern
  • Natural language processing
  • Ontology enrichment
  • Ontology learning from text

ASJC Scopus subject areas

  • Health Informatics
  • Advanced and Specialized Nursing
  • Health Information Management

Fingerprint

Dive into the research topics of 'Effectiveness of lexico-syntactic pattern matching for ontology enrichment with clinical documents'. Together they form a unique fingerprint.

Cite this