Is it possible to identify individuals with mild cognitive impairment and Alzheimer's disease using a 30-minute neuropsychological battery?
É possível identificar indivíduos com comprometimento cognitivo leve e doença de Alzheimer usando uma bateria neuropsicológica de 30 minutos?
Gabriel Coutinho1,2, Ricardo de Oliveira-Souza1,3, Jorge Moll1, Fernanda Tovar-Moll1,2, Paulo Mattos1,2
1 D'Or Institute for Research and Education (IDOR), Rio de Janeiro, RJ, Brazil.
This work was developed at D'Or Institute for Research and Education.
Received: 12/3/2012 – Accepted: 4/4/2013
Background: Distinguishing individuals with dementia from those with mild cognitive impairment (MCI) might demand a neuropsychological evaluation.
Coutinho G, et al. / Rev Psiq Clín. 2013;40(4):139-43
Keywords: Alzheimer's disease, cognitive assessment, dementia, memory, mild cognitive impairment.
Contexto:A diferenciação de pacientes com diagnósticos de demência e de comprometimento cognitivo leve (CCL) pode exigir avaliação neuropsicológica.
Coutinho G, et al. / Rev Psiq Clín. 2013;40(4):139-43
Palavras-chave: Doença de Alzheimer, avaliação cognitiva, demência, memória, comprometimento cognitivo leve.
Many studies of mild cognitive impairment (MCI) in the last decade have sought to elucidate clinical, neuropsychological and neuroimaging aspects1. MCI diagnosis is associated with elevated rates of conversion to AD2. The use of screening tests only, such as the MMSE, is not recommended due to the lack of specificity, sensitivity, or both3. To identify individuals at risk for Alzheimer's disease (AD) or other dementias, appropriate neuropsychological tools to detect MCI are needed. There is no consensus, however, on which tests should be part of such an evaluation. It should also be considered that early treatment with cholinesterase inhibitors may benefit MCI individuals4.
Most studies focus on the amnestic subtype of MCI, single or multiple-domain5, as the best predictor of AD, which is the most prevalent dementia in the elderly6, which associate with significant impact even in initial in mild phase7. It should be noted that there is no consensus on the definition of impairment in non-memory functions, an aspect that may jeopardize the recognition of non-amnestic MCI.
Some studies in Brazil have proposed new neuropsychological batteries and brief screening tools for the diagnosis of both MCI and dementia8-11. Some studies have investigated single tests or existing batteries, such as the Clock Drawing Test (CDT)12, the Mini-Mental State Examination (MMSE) and the Cambridge Mental Disorders of the Elderly Examination (CAMDEX). Diniz et al.9 compared a brief screening and a neuropsychological battery, demonstrating that both could discriminate MCI from dementia (although they failed to discriminate among different types of MCI). Using a different design, Paula et al.13 evaluated a brief screening for a geriatric population that used tasks considered fast and easy to administrate. That study demonstrated the importance of evaluating cognition in elderly populations using batteries with good construction and valid criteria. Ladeira et al.14 showed that a combination of screening tools, such as the MMSE, verbal fluency and CDT, was not useful for identifying cases of AD or MCI.
The studies conducted by both Paula et al.13 and Ladeira et al.14 did not include memory tasks. Diniz et al.9 included a very short memory task that did not evaluate either retention over time or delayed recall. The use of batteries without memory tasks – or without memory tasks evaluating retention over time – has a greater likelihood of failure in identifying cases of amnestic MCI (single domain) where memory deficits are not accompanied by other cognitive impairments.
Comprehensive neuropsychological batteries have the obvious advantage of greater sensitivity to brain injury or dysfunction of any etiology, but at the expense of time and cost. Shorter batteries might also prove useful for research purposes and, to some extent, for clinical practice as well, particularly where a full neuropsychological examination is not immediately available or feasible. The need for consensus on a diagnostic battery for MCI motivated the current study. We aimed to develop a brief neuropsychological battery (lasting up to 30 minutes) to discriminate among normal aging, MCI and dementia. Based on the literature review, we hypothesized that a 30-minute battery would be suitable to discriminate the three groups with good sensitivity and specificity.
One hundred and thirty-one elderly patients (37 controls, 41 MCI and 53 possible or probable AD) were consecutively referred for neuropsychological evaluation in a private clinic. All patients were referred by their physicians because of memory complaints to discriminate among normal aging, MCI or dementia. The demographic profile of the patients is shown in table 1. This study was approved by the Ethics Committee of D'Or Institute for Research and Education; being a retrospective survey of clinical files, patients did not sign an informed consent, in accordance to Brazilian regulations.
Categorical ("golden standard") diagnoses. Diagnoses were made by consensus by a trained, board-certified neuropsychologist (GC) and a board-certified psychiatrist (PM), considering DSM-IV criteria, neuroimaging (MRI), clinical data and the neuropsychological results from the full battery. AD diagnoses were made based on NINCDS-ADRDA criteria15. MCI diagnoses were made according to the definitions proposed by Winblad et al.16, but restricted to amnestic subtype, since we aimed to study individuals at higher risks of developing AD: memory complaints; not demented but also not normal for age; cognitive decline; essentially normal functional activities of daily living. The classification of MCI was determined by a consensus by the aforementioned researchers (PM and GC) based on clinical data and neuropsychological results from the full battery. After clinical data were collected (present complaints, collateral reports, premorbid history, family history, medication), all individuals underwent a comprehensive evaluation that included the following test battery: Logical Memory from WMS-III, the Brazilian version of RAVLT17-18, Family Pictures, Digit Span, Spatial Span, CDT, MMSE, Vocabulary from WAIS-III, Matrix Reasoning from WAIS-III, and verbal fluency, both semantic (animals and fruits) and letter (F, A, and S). Participants were also evaluated with the Geriatric Depression Scale (GDS)19 and the Memory Assessment Complaints Questionnaire (MAC-Q)20. Daily activities were investigated with Pfeffer Functional Activities Questionnaire21. The questionnaire was filled out by a close relative or caregiver.
Brief neuropsychological battery. The results for all patients were blindly coded and evaluated a second time with a subset of the tests used in the full battery. The results of performance on the brief battery were used to make a categorical diagnosis of "normal", "MCI", or "dementia" based on the judgment of the authors who determined categorical diagnoses. In this step, the clinicians only considered neuropsychological results. The diagnoses generated by a brief battery were then compared to the golden standard diagnoses, as explained above. The tests in the brief battery were chosen based on the following criteria: a) a short time should be required to administer the tasks; b) there should be extensive neuropsychological data in the literature about the test structure and its interpretation;
c) there should be normative data for each test – it must be highlighted that some of the used tasks do not have available Brazilian normative groups; d) there should be a memory task that evaluate both immediate and delayed recall, but other cognitive domains should be covered as well; and e) the tests should be either the ones used in the original MCI studies or structurally equivalent to them. The following tests complied with these criteria: the verbal memory task from the original MCI studies (i.e., Logical Memory from WMS22), the semantic (animals) fluency test, the auditory working memory test (Digit Span, comprising measures of the phonological loop and central executive functions), constructional praxis and executive functioning (CDT), and the MMSE. Tests of brief and full battery are listed in table 2.
Agreement rates between the full (comprehensive) and brief batteries were calculated with the kappa statistics with quadratic weighting by considering three categorical diagnoses (dementia, MCI, and normal). Two analyses were then performed, both comparing the brief and the comprehensive batteries, but considering only two groups for each analysis (described below). These analyses used ROC curves to establish the sensitivity and specificity of the brief battery in discriminating (i) normal individuals from a group of demented and MCI patients (i.e., "normal" versus "pathologic"; (ii) individuals with dementia from individuals without dementia, the latter group comprising normal and MCI subjects; and (iii) normal individuals from MCI group. We also compared the performance of the three
groups on all tasks of the full battery. Because most measures did not have a normal distribution, comparisons were performed using the Kruskal-Wallis test, and contrasts were performed using the Mann-Whitney with the post-hoc Bonferroni test. Comparisons of age were made with ANOVA, and the Kruskal-Wallis test was used for comparisons of educational level. The gender distribution was calculated with Pearson's chi-square test (χ²).
The three groups did not differ in age. The normal group had more years of education than the AD group (p = 0.003), but the normal and MCI groups did not differ in terms of education. Gender distribution did not differ across groups. The AD group comprised 53 individuals, whereas the MCI and control groups comprised 41 and 37 individuals, respectively. The results are summarized in table 1.
All neuropsychological tests showed significant differences between the normal and AD groups. When comparing the normal and MCI groups, memory tests (RAVLT A1, RAVLT A5, Family Pictures and Logical Memory [both immediate and delayed recall]), MEEM and semantic verbal fluency (a animals and fruits) reached significance. When comparing the MCI and AD groups, only the forward Digit Span did not reach significance. The results of the comparisons are summarized in table 3.
We also assessed the combination of the tests used to compose the brief battery. The agreement between the full battery and the brief battery was considered substantial (kappa = 0.805) according to the reliability patterns proposed by Shrout and Fleiss23. When the MCI and AD cases were sorted in the same group, the brief battery had good sensitivity (78.4%) and specificity (89.4%) for discriminating patients (MCI plus AD) from normal cases [area under the curve (auc) = 0.84]. The positive and negative predictive values were 74.4% and 91.3%, respectively. When the MCI and normal cases were sorted in the same group, the brief battery had good sensitivity (83.3%) and specificity (90.6%) for discriminating non-demented (MCI plus normal) from AD patients (auc = 0.87). The positive and negative predictive values were 92.9% and 78.7%, respectively. We performed an additional analysis, this time comparing only MCI and normal groups – excluding all individuals that received a diagnosis of AD from either battery –; the comparisons were made considering 28 MCI and 37 normal individuals. In this case, both sensitivity and specificity decreased (64.28% and 78.37%, respectively; auc = 0.71). The positive and negative predictive values were 69.2% and 74.4%, respectively.
Normal individuals had more years of education in comparison to the other groups, a result previously identified in other studies6. The groups did not differ in terms of age (Table 1) or gender distribution The latter two findings are not in accordance with the findings of other studies, which highlight age and gender (female) as two of the most important risk factors for developing AD24. This discrepancy may be secondary to a referral bias because we evaluated a sample of patients referred to a private unit for neuropsychological evaluation of memory problems, which occur in a wider age range of individuals in clinical practice, from adulthood to old age.
Most of the studies on MCI have considered only amnestic subtypes, including both single and multiple domains25. The study conducted by Diniz et al.9 found that amnestic subtypes of MCI were the most common in their sample, corresponding to approximately two-thirds of MCI cases. One possible explanation is that AD is the most common type of dementia; thus, samples comprising individuals with cognitive decline without dementia are expected to include primarily amnestic subtypes of MCI. For this reason, we developed a brief battery that used an episodic memory task allowing us to assess immediate and delayed recall. Therefore, our MCI individuals are amnestic (both single and multiple domains).
The performances in the three groups showed discrepancies in most of the measures, except for the Digit Span and the CDT. Despite its failure to discriminate normal individuals from MCI individuals, the CDT discriminated MCI from AD. This result could be explained by the fact that CDT is a task that aims to evaluate executive functions and constructional praxis, which are not expected to be impaired in the amnestic forms of MCI but are often impaired in AD12. In addition, the CDT is often used to support an AD diagnosis in the literature. We found it to be important even though it does not help to discriminate normal individuals from those with MCI.
Digit Span forward performances did not differ among the three groups, indicating that there were no differences in the phonological loop. In fact, Digit Span does not process semantic information26 and is generally intact at the preclinical and early stages of AD27. Performances on the Digit Span backward (the second step of the Digit Span, in which digits are repeated in the reverse order) were similar in comparisons of normal and MCI individuals, whereas individuals with AD were significantly impaired compared to the non-demented groups. The Digit Span test is a measure of central executive function26; it is expected that amnestic type (single domain) of MCI do not involve difficulties with this function.
All of the other measures of the brief battery (i.e., Logical Memory I and II, verbal fluency (animals), and MMSE) were significantly different between normal and MCI patients as well as between MCI and AD patients.
We also performed a post-hoc analysis of the tests used to compose the brief battery. The agreement between full and brief batteries had a kappa of 0.805. This finding suggests that, at least for samples including individuals with cognitive decline marked by memory deficits, the brief battery might be applied. When grouping MCI and AD cases in the same group and comparing the sensitivity and specificity of the brief battery in discriminating those groups, we found good sensitivity and specificity (78.38% and 89.36%, respectively). The AUC was 0.839, suggesting that this brief battery might also be useful in discriminating normal and pathological aging. When grouping MCI and normal individuals in the same group and comparing their performance to that of AD individuals, we found good sensitivity and specificity (83.33% and 90.57%, respectively), suggesting that this battery might be useful to discriminate individuals with AD from normal individuals.
Contrarily to our expectations, both sensitivity and specificity of brief battery significantly decreased (64.28% and 78.37%, respectively) when we compared only groups of MCI and normal individuals. Additionally, positive and negative predictive values were 69.2% and 74.4%, respectively, which indicates that this brief battery might not be recommended to discriminate MCI and normal individuals.
Controversially to our results, another study conducted in Brazil8 showed that brief batteries could accurately discriminate MCI from normal and demented individuals, although not sensitively enough to establish subtypes of MCI (e.g., amnestic, non-amnestic or multiple domain). Our findings suggest that, at least in a sample that mirror clinical everyday clinical practice (i.e., clinical-control individuals, MCI and individuals with dementia), the use of this brief battery should be restricted to the purpose of determining less specific groups, such as: 1) individuals with no decline versus individuals with cognitive decline (normal versus MCI plus AD); or 2) individuals without AD (MCI plus normal individuals) from individuals with AD.
It should be stressed that the proposed brief battery alone does not provide an MCI diagnosis per se, which requires clinical evaluation by the physician. The use of a brief battery with both immediate and delayed recall might not be recommend for clinicians with the aim of discriminating MCI from normal aging. Furthermore, it should be noted that for the purposes of an individual evaluation, there is a paucity of adequate normative data in our country, which is well known for the enormous heterogeneity of the population in different regions. For this reason, we selected tests with normative data, extensive data, or both because future studies may provide specific norms for the proposed brief battery.
Despite good agreement rates between the full battery and the brief battery ((kappa = 0.805), several diagnoses using our gold standard changed when using the brief battery, which is critical from the perspective of individual cases. For example, 13 MCI subjects were incorrectly classified as having dementia, whereas 5 individuals with dementia were reclassified as MCI.
The brief battery demonstrated good sensitivity and specificity for two specific purposes: 1) the discrimination of individuals without significant cognitive decline versus individuals with significant cognitive decline (normal versus MCI plus AD); and 2) the discrimination of AD from non-AD groups (MCI plus normal individuals).
This study should be understood in light of some limitations. Our dementia group could be heterogeneous regarding the severity of the cognitive deficit because we used categorical diagnoses but not severity scales. Because our study was focused in individuals with memory complaints (following other studies comprising amnestic MCI), we do not know whether our brief battery might be helpful in diagnosing non-amnestic MCI or non-Alzheimer dementias. For the same reason, the value of our brief battery in the ancillary diagnosis of dementias associated with behavioral changes or psychopathological symptoms remains to be demonstrated. Moreover, despite the good statistical agreement between both batteries, its use should be restricted to the aforementioned conditions.
We would like to thank Raphael Suwwan for his contribution with statistical analyses.
Órgão Oficial do Departamento e Instituto de Psiquiatria Faculdade de Medicina - Universidade de São Paulo