Classification and retrieval of patient records using natural language: An experimental application of latent semantic analysis

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

The National Library of Medicine's Unified Medical Language System (UMLS) provides a structured lexicon enabling the application of latent semantic analysis for the classification and retrieval of patient diagnoses. Information matrices of complex number values from UMLS entries are constructed to create principal components via singular value decomposition. Natural language diagnosis entries or inquiries can be projected into the resultant N-dimension concept space, and evaluated by cosine deviation form the compressed concept components. Preliminary evaluations show that the technique is promising. A major advantage is the avoidance of manually constructed semantic network data schemes; semantic properties derive from statistical decomposition.

Original languageEnglish (US)
Title of host publicationProceedings of the Annual Conference on Engineering in Medicine and Biology
PublisherPubl by IEEE
Pages1162-1163
Number of pages2
Volume13
Editionpt 3
ISBN (Print)0780302168
StatePublished - 1991
Externally publishedYes
EventProceedings of the 13th Annual International Conference of the IEEE Engineering in Medicine and Biology Society - Orlando, FL, USA
Duration: Oct 31 1991Nov 3 1991

Other

OtherProceedings of the 13th Annual International Conference of the IEEE Engineering in Medicine and Biology Society
CityOrlando, FL, USA
Period10/31/9111/3/91

Fingerprint

Semantics
Singular value decomposition
Medicine

ASJC Scopus subject areas

  • Bioengineering

Cite this

Chute, C. (1991). Classification and retrieval of patient records using natural language: An experimental application of latent semantic analysis. In Proceedings of the Annual Conference on Engineering in Medicine and Biology (pt 3 ed., Vol. 13, pp. 1162-1163). Publ by IEEE.

Classification and retrieval of patient records using natural language : An experimental application of latent semantic analysis. / Chute, Christopher.

Proceedings of the Annual Conference on Engineering in Medicine and Biology. Vol. 13 pt 3. ed. Publ by IEEE, 1991. p. 1162-1163.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Chute, C 1991, Classification and retrieval of patient records using natural language: An experimental application of latent semantic analysis. in Proceedings of the Annual Conference on Engineering in Medicine and Biology. pt 3 edn, vol. 13, Publ by IEEE, pp. 1162-1163, Proceedings of the 13th Annual International Conference of the IEEE Engineering in Medicine and Biology Society, Orlando, FL, USA, 10/31/91.
Chute C. Classification and retrieval of patient records using natural language: An experimental application of latent semantic analysis. In Proceedings of the Annual Conference on Engineering in Medicine and Biology. pt 3 ed. Vol. 13. Publ by IEEE. 1991. p. 1162-1163
Chute, Christopher. / Classification and retrieval of patient records using natural language : An experimental application of latent semantic analysis. Proceedings of the Annual Conference on Engineering in Medicine and Biology. Vol. 13 pt 3. ed. Publ by IEEE, 1991. pp. 1162-1163
@inproceedings{042ca09d424f446ea16362509451d3cc,
title = "Classification and retrieval of patient records using natural language: An experimental application of latent semantic analysis",
abstract = "The National Library of Medicine's Unified Medical Language System (UMLS) provides a structured lexicon enabling the application of latent semantic analysis for the classification and retrieval of patient diagnoses. Information matrices of complex number values from UMLS entries are constructed to create principal components via singular value decomposition. Natural language diagnosis entries or inquiries can be projected into the resultant N-dimension concept space, and evaluated by cosine deviation form the compressed concept components. Preliminary evaluations show that the technique is promising. A major advantage is the avoidance of manually constructed semantic network data schemes; semantic properties derive from statistical decomposition.",
author = "Christopher Chute",
year = "1991",
language = "English (US)",
isbn = "0780302168",
volume = "13",
pages = "1162--1163",
booktitle = "Proceedings of the Annual Conference on Engineering in Medicine and Biology",
publisher = "Publ by IEEE",
edition = "pt 3",

}

TY - GEN

T1 - Classification and retrieval of patient records using natural language

T2 - An experimental application of latent semantic analysis

AU - Chute, Christopher

PY - 1991

Y1 - 1991

N2 - The National Library of Medicine's Unified Medical Language System (UMLS) provides a structured lexicon enabling the application of latent semantic analysis for the classification and retrieval of patient diagnoses. Information matrices of complex number values from UMLS entries are constructed to create principal components via singular value decomposition. Natural language diagnosis entries or inquiries can be projected into the resultant N-dimension concept space, and evaluated by cosine deviation form the compressed concept components. Preliminary evaluations show that the technique is promising. A major advantage is the avoidance of manually constructed semantic network data schemes; semantic properties derive from statistical decomposition.

AB - The National Library of Medicine's Unified Medical Language System (UMLS) provides a structured lexicon enabling the application of latent semantic analysis for the classification and retrieval of patient diagnoses. Information matrices of complex number values from UMLS entries are constructed to create principal components via singular value decomposition. Natural language diagnosis entries or inquiries can be projected into the resultant N-dimension concept space, and evaluated by cosine deviation form the compressed concept components. Preliminary evaluations show that the technique is promising. A major advantage is the avoidance of manually constructed semantic network data schemes; semantic properties derive from statistical decomposition.

UR - http://www.scopus.com/inward/record.url?scp=0026304961&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=0026304961&partnerID=8YFLogxK

M3 - Conference contribution

AN - SCOPUS:0026304961

SN - 0780302168

VL - 13

SP - 1162

EP - 1163

BT - Proceedings of the Annual Conference on Engineering in Medicine and Biology

PB - Publ by IEEE

ER -