TY - JOUR
T1 - Part of speech production in patients with primary progressive aphasia
T2 - An analysis based on natural language processing
AU - Themistocleous, Charalambos
AU - Webster, Kimberly
AU - Afthinos, Alexandros
AU - Tsapkini, Kyrana
N1 - Funding Information:
We are grateful to our participants and referring physicians for their interest in our study. We thank Olivia Hermann and Bronte Ficek for their assistance in our study. This work was supported through grants from the National Institutes of Health (Grant NIH/NIDCD R01 DC014475) and the Science of Learning Institute at Johns Hopkins University to K. T.
Publisher Copyright:
© 2020 American Speech-Language-Hearing Association.
PY - 2021/2
Y1 - 2021/2
N2 - Background: Primary progressive aphasia (PPA) is a neurodegenerative disorder characterized by a progressive decline of language functions. Its symptoms are grouped into three PPA variants: nonfluent PPA, logopenic PPA, and semantic PPA. Grammatical deficiencies differ depending on the PPA variant. Aims: This study aims to determine the differences between PPA variants with respect to part of speech (POS) production and to identify morphological markers that classify PPA variants using machine learning. By fulfilling these aims, the overarching goal is to provide objective measures that can facilitate clinical diagnosis, evaluation, and prognosis. Method and Procedure: Connected speech productions from PPA patients produced in a picture description task were transcribed, and the POS class of each word was estimated using natural language processing, namely, POS tagging. We then implemented a twofold analysis: (a) linear regression to determine how patients with nonfluent PPA, semantic PPA, and logopenic PPA variants differ in their POS productions and (b) a supervised classification analysis based on POS using machine learning models (i.e., random forests, decision trees, and support vector machines) to subtype PPA variants and generate feature importance (FI). Outcome and Results: Using an automated analysis of a short picture description task, this study showed that content versus function words can distinguish patients with nonfluent PPA, semantic PPA, and logopenic PPA variants. Verbs were less important as distinguishing features of patients with different PPA variants than earlier thought. Finally, the study showed that among the most important distinguishing features of PPA variants were elaborative speech elements, such as adjectives and adverbs.
AB - Background: Primary progressive aphasia (PPA) is a neurodegenerative disorder characterized by a progressive decline of language functions. Its symptoms are grouped into three PPA variants: nonfluent PPA, logopenic PPA, and semantic PPA. Grammatical deficiencies differ depending on the PPA variant. Aims: This study aims to determine the differences between PPA variants with respect to part of speech (POS) production and to identify morphological markers that classify PPA variants using machine learning. By fulfilling these aims, the overarching goal is to provide objective measures that can facilitate clinical diagnosis, evaluation, and prognosis. Method and Procedure: Connected speech productions from PPA patients produced in a picture description task were transcribed, and the POS class of each word was estimated using natural language processing, namely, POS tagging. We then implemented a twofold analysis: (a) linear regression to determine how patients with nonfluent PPA, semantic PPA, and logopenic PPA variants differ in their POS productions and (b) a supervised classification analysis based on POS using machine learning models (i.e., random forests, decision trees, and support vector machines) to subtype PPA variants and generate feature importance (FI). Outcome and Results: Using an automated analysis of a short picture description task, this study showed that content versus function words can distinguish patients with nonfluent PPA, semantic PPA, and logopenic PPA variants. Verbs were less important as distinguishing features of patients with different PPA variants than earlier thought. Finally, the study showed that among the most important distinguishing features of PPA variants were elaborative speech elements, such as adjectives and adverbs.
UR - http://www.scopus.com/inward/record.url?scp=85100938420&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85100938420&partnerID=8YFLogxK
U2 - 10.1044/2020_AJSLP-19-00114
DO - 10.1044/2020_AJSLP-19-00114
M3 - Article
C2 - 32697669
AN - SCOPUS:85100938420
SN - 1058-0360
VL - 30
SP - 466
EP - 480
JO - American Journal of Speech-Language Pathology
JF - American Journal of Speech-Language Pathology
IS - 1s
ER -