A Review of Automated Speech-Based Interaction for Cognitive Screening
Abstract
:1. Introduction
2. Literature Review
2.1. Research Questions
- RQ1: Which automated speech-based systems for cognitive screening have been studied?
- RQ2: What are the interaction-related characteristics of the studied automated, speech-based systems for cognitive screening?
2.2. Search Strategy
2.3. Inclusion and Exclusion Criteria
- Written in English and accepted and presented in peer-reviewed publications,
- Describing a speech-based system for cognitive screening that implements ASR and, possibly, ASA;
- Describing a speech-based system that screens for cognitive impairment related to dementia; and
- Evaluating the speech-based system for cognitive screening to some degree, either technically or empirically.
2.4. Screening Process and Results
2.5. Data Collection and Analysis
- The source and full reference;
- The description of the automated speech-based system(s) for cognitive screening, focusing on the technical and interaction aspects;
- The cognitive screening tests that were utilised from the automated speech-based system(s) and their administration details.
- Studies presenting automated speech-based systems for cognitive screening (addressing RQ1);
- The interaction aspects of these systems (addressing RQ2).
3. Results
3.1. User Interface
3.2. Modalities
3.3. Speech-Based Communication
- Structured speech-based communication: This type of communication is based on the screener evaluating the screenee on a particular set of predetermined, standardised, and speech-based screening tasks, where specific responses are expected for the screening score to be calculated. Speech-based, structured, and standardised cognitive screening tests, such as the speech-based versions of the MMSE and the MoCA, which ask for specific responses (e.g., what is the year or the season) from the screenee in order to score better, facilitate this type of communication [11,14,19,23].
- Semi-structured speech-based communication: This type of communication is based on the screener evaluating the screenee on a particular set of predetermined, standardised screening tasks where the screenee’s responses are open-ended. Semi-structured and standardised cognitive screening tests, such as verbal fluency tests that, for example, ask the screenee to name as many animals as they can in one minute, facilitate this type of communication [15,18].
- Unstructured speech-based communication: In this case, the screener and the screenee make conversation about various topics, and the evaluation of the screenee is based on linguistic markers and the quality of the dialogue. The user/screenee can converse with a human screener [16,17] or an AI agent [4].
3.4. Screening Content
3.5. Screener
4. Discussion
Study Limitations
5. Conclusions
Funding
Acknowledgments
Conflicts of Interest
References
- Asgari, M.; Kaye, J.; Dodge, H. Predicting mild cognitive impairment from spontaneous spoken utterances. Alzheimer’s Dement. Transl. Res. Clin. Interv. 2017, 3, 219–228. [Google Scholar] [CrossRef]
- Taler, V.; Phillips, N.A. Language performance in Alzheimer’s disease and mild cognitive impairment: A comparative review. J. Clin. Exp. Neuropsychol. 2008, 30, 501–556. [Google Scholar] [CrossRef] [PubMed]
- Murphy, K.J.; Rich, J.B.; Troyer, A.K. Verbal fluency patterns in amnestic mild cognitive impairment are characteristic of Alzheimer’s type dementia. J. Int. Neuropsychol. Soc. 2006, 12, 570–574. [Google Scholar] [CrossRef] [PubMed]
- Tang, F.; Uchendu, I.; Wang, F.; Dodge, H.H.; Zhou, J. Scalable diagnostic screening of mild cognitive impairment using AI dialogue agent. Sci. Rep. 2020, 10, 1–11. [Google Scholar] [CrossRef] [PubMed]
- König, A.; Satt, A.; Sorin, A.; Hoory, R.; Toledo-Ronen, O.; Derreumaux, A.; Manera, V.; Verhey, F.; Aalten, P.; Robert, P.H.; et al. Automatic speech analysis for the assessment of patients with predementia and Alzheimer’s disease. Alzheimer Dement. Diagn. Assess. Dis. Monit. 2015, 1, 112–124. [Google Scholar] [CrossRef] [PubMed] [Green Version]
- Falagas, M.E.; Pitsouni, E.I.; Malietzis, G.A.; Pappas, G. Comparison of PubMed, Scopus, Web of Science, and Google Scholar: Strengths and weaknesses. FASEB J. 2008, 22, 338–342. [Google Scholar] [CrossRef] [PubMed]
- Jacso, P. As we may search—Comparison of major features of the Web of Science, Scopus, and Google Scholar citation-based and citation-enhanced databases. Curr. Sci. 2005, 89, 1537–1547. [Google Scholar]
- Vom Brocke, J.; Simons, A.; Niehaves, B.; Riemer, K.; Plattfaut, R.; Cleven, A. Reconstructing the giant: On the importance of rigour in documenting the literature search process. In Proceedings of the European Conference on Information Systems, Verona, Italy, 8–10 June 2009; pp. 2206–2217. [Google Scholar]
- Webster, J.; Watson, R.T. Analyzing the past to prepare for the future: Writing a literature review. MIS Q. 2002, 26, 13–23. [Google Scholar]
- Boletsis, C. The new era of virtual reality locomotion: A systematic literature review of techniques and a proposed typology. Multimodal Technol. Interact. 2017, 1, 24. [Google Scholar] [CrossRef] [Green Version]
- Di Nuovo, A.; Varrasi, S.; Lucas, A.; Conti, D.; McNamara, J.; Soranzo, A. Assessment of cognitive skills via human-robot interaction and cloud computing. J. Bionic Eng. 2019, 16, 526–539. [Google Scholar] [CrossRef]
- Hakkani-Tür, D.; Vergyri, D.; Tur, G. Speech-based automated cognitive status assessment. In Proceedings of the Eleventh Annual Conference of the International Speech Communication Association, Chiba, Japan, 26–30 September 2010; pp. 258–261. [Google Scholar]
- López-de Ipiña, K.; Alonso, J.B.; Barroso, N.; Faundez-Zanuy, M.; Ecay, M.; Solé-Casals, J.; Travieso, C.M.; Estanga, A.; Ezeiza, A. New approaches for Alzheimer’s disease diagnosis based on automatic spontaneous speech analysis and emotional temperature. In International Workshop on Ambient Assisted Living; Springer-Verlag: Berlin/Heidelberg, Germany, 2012; pp. 407–414. [Google Scholar]
- Luperto, M.; Romeo, M.; Lunardini, F.; Basilico, N.; Abbate, C.; Jones, R.; Cangelosi, A.; Ferrante, S.; Borghese, N.A. Evaluating the Acceptability of Assistive Robots for Early Detection of Mild Cognitive Impairment. In Proceedings of the 2019 IEEE/RSJ International Conference on Intelligent Robots and Systems, Macau, China, 3–8 November 2019; pp. 1–8. [Google Scholar]
- Mirheidari, B.; Blackburn, D.; O’Malley, R.; Walker, T.; Venneri, A.; Reuber, M.; Christensen, H. Computational cognitive assessment: Investigating the use of an intelligent virtual agent for the detection of early signs of dementia. In Proceedings of the ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Brighton, UK, 12–17 May 2019; pp. 2732–2736. [Google Scholar]
- Mirheidari, B.; Blackburn, D.; Reuber, M.; Walker, T.; Christensen, H. Diagnosing people with dementia using automatic conversation analysis. In Proceedings of the Interspeech—ISCA, San Francisco, CA, USA, 8–12 September 2016; pp. 1220–1224. [Google Scholar]
- Mirheidari, B.; Blackburn, D.; Harkness, K.; Walker, T.; Venneri, A.; Reuber, M.; Christensen, H. Toward the automation of diagnostic conversation analysis in patients with memory complaints. J. Alzheimer’s Dis. 2017, 58, 373–387. [Google Scholar] [CrossRef] [Green Version]
- Pakhomov, S.V.; Marino, S.E.; Banks, S.; Bernick, C. Using automatic speech recognition to assess spoken responses to cognitive tests of semantic verbal fluency. Speech Commun. 2015, 75, 14–26. [Google Scholar] [CrossRef] [Green Version]
- Prange, A.; Niemann, M.; Latendorf, A.; Steinert, A.; Sonntag, D. Multimodal speech-based dialogue for the Mini-Mental State Examination. In Proceedings of the Extended Abstracts of the 2019 CHI Conference on Human Factors in Computing Systems, Glasgow, UK, 4–9 May 2019; pp. 1–8. [Google Scholar]
- Roark, B.; Hosom, J.P.; Mitchell, M.; Kaye, J.A. Automatically derived spoken language markers for detecting mild cognitive impairment. In Proceedings of the Second International Conference on Technology and Aging—ICTA, Toronto, ON, USA, 9–11 May 2007. [Google Scholar]
- Tóth, L.; Hoffmann, I.; Gosztolya, G.; Vincze, V.; Szatlóczki, G.; Bánréti, Z.; Pákáski, M.; Kálmán, J. A speech recognition-based solution for the automatic detection of mild cognitive impairment from spontaneous speech. Curr. Alzheimer Res. 2018, 15, 130–138. [Google Scholar] [CrossRef] [PubMed]
- Tröger, J.; Linz, N.; König, A.; Robert, P.; Alexandersson, J. Telephone-Based Dementia Screening I: Automated Semantic Verbal Fluency Assessment. In Proceedings of the 12th EAI International Conference on Pervasive Computing Technologies for Healthcare, New York, NY, USA, 21–24 May 2018; pp. 59–66. [Google Scholar]
- Varrasi, S.; Di Nuovo, S.; Conti, D.; Di Nuovo, A. A social robot for cognitive assessment. In Proceedings of the Companion of the 2018 ACM/IEEE International Conference on Human-Robot Interaction, Chicago, IL, USA, 5–8 March 2018; pp. 269–270. [Google Scholar]
- Folstein, M.F.; Folstein, S.E.; McHugh, P.R. “Mini-mental state”: A practical method for grading the cognitive state of patients for the clinician. J. Psychiatr. Res. 1975, 12, 189–198. [Google Scholar] [CrossRef]
- Nasreddine, Z.S.; Phillips, N.A.; Bédirian, V.; Charbonneau, S.; Whitehead, V.; Collin, I.; Cummings, J.L.; Chertkow, H. The Montreal Cognitive Assessment, MoCA: A brief screening tool for mild cognitive impairment. J. Am. Geriatr. Soc. 2005, 53, 695–699. [Google Scholar] [CrossRef] [PubMed]
- Waibel, A.; Steusloff, H.; Stiefelhagen, R.; Watson, K. Computers in the human interaction loop. In Computers in the Human Interaction Loop; Springer-Verlag: London, UK, 2009; pp. 3–6. [Google Scholar]
- Burgener, S.C.; Yang, Y.; Gilbert, R.; Marsh-Yant, S. The effects of a multimodal intervention on outcomes of persons with early-stage dementia. Am. J. Alzheimer’s Dis. Other Dementias® 2008, 23, 382–394. [Google Scholar] [CrossRef] [PubMed] [Green Version]
- Boletsis, C.; McCallum, S. Smartkuber: A serious game for cognitive health screening of elderly players. Games Health J. 2016, 5, 241–251. [Google Scholar] [CrossRef] [PubMed]
- Tong, T.; Guana, V.; Jovanovic, A.; Tran, F.; Mozafari, G.; Chignell, M.; Stroulia, E. Rapid deployment and evaluation of mobile serious games: A cognitive assessment case study. Procedia Comput. Sci. 2015, 69, 96–103. [Google Scholar] [CrossRef] [Green Version]
- Verhaegh, J.; Fontijn, W.F.; Aarts, E.H.; Resing, W.C. In-game assessment and training of nonverbal cognitive skills using TagTiles. Pers. Ubiquitous Comput. 2013, 17, 1637–1646. [Google Scholar] [CrossRef]
- Tong, T.; Chignell, M.; Tierney, M.C.; Lee, J. A serious game for clinical assessment of cognitive status: Validation study. JMIR Serious Games 2016, 4, e7. [Google Scholar] [CrossRef]
- Tong, T.; Chignell, M.; Lam, P.; Tierney, M.C.; Lee, J. Designing serious games for cognitive assessment of the elderly. In Proceedings of the International Symposium on Human Factors and Ergonomics in Health Care, Chicago, IL, USA, 16–19 March 2014; SAGE Publications Sage CA: Los Angeles, CA, USA, 2014; Volume 3, pp. 28–35. [Google Scholar]
References | User Interface | Modalities | Speech-BasedCommunication | ScreeningContent | Screener | ||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Voice | Multimodal | Intelligent Virtual Agent | Virtual Assistant | Socially Assistive Robot | Audio Recorder | Audio Recorder via Telephone | Structured | Semi-Structured | Unstructured | As-Is Validated Test | Adjusted Validated Test | Custom Test | AI Agent | Human | |
Di Nuovo et al. (2019) [11] | X | X | X | X | X | ||||||||||
Hakkani-Tür et al. (2010) [12] | X | X | X | X | X | ||||||||||
König et al. (2015) [5] | X | X | X | X | X | ||||||||||
López-de-Ipiña et al. (2012) [13] | X | X | X | X | X | ||||||||||
Luperto t al. (2019) [14] | X | X | X | X | X | ||||||||||
Mirheidari et al. (2019) [15] | X | X | X | X | X | ||||||||||
Mirheidari et al. (2016) [16] | X | X | X | X | X | ||||||||||
Mirheidari et al. (2017) [17] | X | X | X | X | X | ||||||||||
Pakhomov et al. (2015) [18] | X | X | X | X | X | ||||||||||
Prange et al. (2019) [19] | X | X | X | X | X | ||||||||||
Roark et al. (2007) [20] | X | X | X | X | X | ||||||||||
Tang et al. (2020) [4] | X | X | X | X | X | ||||||||||
Tóth et al. (2018) [21] | X | X | X | X | X | ||||||||||
Tröger et al. (2018) [22] | X | X | X | X | X | ||||||||||
Varrasi et al. (2018) [23] | X | X | X | X | X |
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations. |
© 2020 by the author. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
Share and Cite
Boletsis, C. A Review of Automated Speech-Based Interaction for Cognitive Screening. Multimodal Technol. Interact. 2020, 4, 93. https://doi.org/10.3390/mti4040093
Boletsis C. A Review of Automated Speech-Based Interaction for Cognitive Screening. Multimodal Technologies and Interaction. 2020; 4(4):93. https://doi.org/10.3390/mti4040093
Chicago/Turabian StyleBoletsis, Costas. 2020. "A Review of Automated Speech-Based Interaction for Cognitive Screening" Multimodal Technologies and Interaction 4, no. 4: 93. https://doi.org/10.3390/mti4040093
APA StyleBoletsis, C. (2020). A Review of Automated Speech-Based Interaction for Cognitive Screening. Multimodal Technologies and Interaction, 4(4), 93. https://doi.org/10.3390/mti4040093