This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
1. Introduction
Executive function (EF) is an umbrella term that incorporates a collection of interrelated processes responsible for purposeful, goal-directed behaviour [1]. It encompasses various processes (i.e., cognitive flexibility, working memory, and response inhibition), which play a key role in regulating goal-oriented behaviour and can support function in children [2, 3]. EFs traditionally have been assessed through standardized psychometric measurements [3]. Standardized assessments provide health-related professionals a clinical picture of a person’s ability to perform activities necessary to develop a comprehensive assessment of intervention effectiveness, comparing clinical groups, and outcome monitoring [4, 5]. Standardization entails a rigorous process of examining the psychometric properties (i.e., validity and reliability) of assessment tools [4]. However, the use of standardized assessments in allied health professions has been low due to issues related to resources (i.e., time, financial, and limited clinician’s knowledge) [6, 7].
Traditionally, EF assessments have been carried out using standardized laboratory-based measurements within a controlled environment [3, 8]. Although neuropsychological measures provide good indicators of fundamental cognitive and executive components, neuropsychological tests’ performance is often not predictive of real-world complex task performance and functional ability [9–11]. Daily life performance and the executive abilities that support it often require multitasking and the generation and implementation of adaptive strategies to accommodate novel environments and perform tasks in the real world [9, 11, 12]. EF measures that are used for school-aged children were originally designed and validated for adult populations. When these measures are administered, they should be scaled down towards a version applicable for children and take into consideration that children have different skills, the level of complexity of the tasks, and the developmental context. Therefore, measures with definite norms and better performance should be considered [13]. Furthermore, research on EF assessments has also been through tests that measure singular processes, instead of from a pluralistic perspective reflecting EF’s fundamental construct. Assessment should contain an array of EFs necessary for complex life tasks encompassing real-world contexts [8, 9, 11].
Different allied health professionals have been reported to be responsible for the assessment and intervention related to EF. OTs are part of that allied health team concerned with determining cognitive abilities needed in everyday task performance to perform various activities [14]. These cognitive abilities fall within the EF domains, whose main outcome reflects performance in daily activities and how it contributes to functional independence [8, 11]. Critical to the occupational therapy process is a thorough and comprehensive evaluation of a child’s EF that may influence their abilities to participate in childhood occupations [8]. This is enabled by using adequate measures of EF assessment tools that consider their occupations and contexts. Given the importance of EFs in children’s daily activities, preference should be on performance-based assessments within naturalistic contexts [8, 15]. While there are several available EF tools for children, it is crucial that ecologically valid and performance-based assessments be used in occupational therapy. However, there is no existing systematic review that produces evidence on the measurement properties of EF tools used in children in occupational therapy. Therefore, it is imperative to review the extant evidence base that supports performance-based EF instruments used by OTs for children and examine their measurement properties.
2. Objectives
This systematic review is aimed at identifying performance-based EF instruments used by OTs for children and evaluating their measurement properties.
3. Methods
This systematic review was registered with PROSPERO (CRD42020172107) and written based on the recommendation of PRISMA [16].
3.1. Search Strategy and Selection Criteria
Possible articles for inclusion were searched using the following databases: PubMed, Scopus, CINAHL, EBSCO, MEDLINE, and Google Scholar. In the likelihood that some relevant articles might be missed, we also performed hand searching through known occupational therapy journals, using an initially preestablished and tested search strategy (Supplementary File 1). Two review authors searched the source systems until March 2021. No time filter was applied. A three-level selection process was used. Any disagreements between the review authors were sorted through consensus discussion or a third review author. Articles were included if they met the following criteria: reported on a performance-based EF assessment used in children up to 12 years old; developed, used, or tested by an OT in the study; the instrument measures several EF processes; must report their result of at least one measurement property conducted within the study; published in a peer-reviewed journal; and must have an English version if written in a different language.
3.2. Evaluation of Methodological Quality and Measurement Properties
In assessing the methodological quality, design, and reporting of the included studies, we used the COSMIN criteria (Supplementary File 2) to evaluate measurement properties’ risk of bias within studies. The assessment of methodological quality was accomplished by two independent review authors, with a consensus or a third author being consulted when a conflict in rating arose.
3.3. Data Extraction and Synthesis
The psychometric properties of the reported EF tools in the reviewed articles were extracted using the data extraction form from the Joanna Briggs Institute, which outlines the specific constructs assessed, country/language/culture, mode of administration, setting/context, participant characteristics, results (measurement properties), and authors’ comments. Information on measurement properties was based on the COSMIN taxonomy [17]. Two review authors extracted the data, with a third review author mitigating differences.
The pooled summaries of the reported EF instruments are presented in a summary table and further discussed using a narrative synthesis. To provide the best synthesis measure (Supplementary File 3), we used the levels of evidence for the overall quality of the measurement property, previously usedby Dobson et al. [18]), adapted from Terwee et al.[19]). The summarized evidence in this review was evaluated in its certainty using the Grading of Recommendations Assessment, Development and Evaluation (GRADE; Supplementary File 4). While the GRADE assessment is mainly subjective, we used the rating recommendations suggested for each criterion component to identify the quality of evidence and strength of recommendation.
4. Results
4.1. Study Selection
The comprehensive search resulted in 1,337 articles across all databases and sources. After the first screening level, 91 articles were filtered for duplicates, which resulted in 68 articles eligible for title and abstract screening. Full-text article screening was performed on 20 studies, with 12 articles excluded for reasons of not completely meeting the review criteria. Only eight articles were included in the final review using narrative synthesis and analysis of measurement properties. A summary of the study selection procedures is outlined in Figure 1.
[figure omitted; refer to PDF]4.2. Study Characteristics
The eight included articles reported on five performance-based EF assessment tools: (1) Behavioural Assessment of the Dysexecutive Syndrome for Children (BADS-C) [20], (2) Children’s Cooking Task (CCT) [21–23], (3) Children’s Kitchen Task Assessment (CKTA) [24], (4) Do-Eat [25, 26], and (5) PETA [27]. In combination, the tools were tested among typically developing children and children with conditions between the ages of 5-12 years old (
Table 1
Summary of included studies.
Author/s | Year | Instrument | Age | Population characteristics | Country of development/testing and language | Type of assessment | Executive function assessed | Tasks | Scoring |
Engel-Yeger et al. | 2009 | Behavioural Assessment of the Dysexecutive Syndrome for Children | 8-15 years | Israel (Hebrew) | Performance-based assessment and a 20-item questionnaire for caregivers (Dysexecutive Questionnaire for Children, DEX–C) | Inflexibility, perseverance, novel problem solving, impulsivity, planning, and the ability to utilise feedback to moderate behaviour | Six subtests: | Each subtest will have its own scoring guideline. However, generally, scores are derived from the number of tasks completed correctly and rules broken or errors | |
Chevignard et al. | 2009 | Children’s cooking task | 9-14 years | France (French) | Performance-based assessment | EFs assessed were not explicitly mentioned. However, the execution error assessed by the tool is related to volition, planning, goal direction, or task monitoring | The task entails preparing a chocolate cake and a fruit cocktail while following a recipe with the necessary ingredients and utensils on a table. | Scoring is based on a classification and quantification of errors and the qualitative analysis of the task. | |
Chevignard et al. | 2010 | Children’s cooking task | 8-20 years | n =46 (TBI =25, TD =21) | Australia (English) | ||||
Fogel et al. | 2020 | Children’s cooking task | 10-14 years | n =81 (EFD =41, TD =40) | Israel (Hebrew) | ||||
Rocke et al. | 2008 | Children’s Kitchen Task Assessment | 8-12 years | USA (English) | Performance-based assessment | Initiation, organization, planning and sequencing, judgment and safety, and completion | The child is asked to make a playdough. | Scoring is based on the cues given. The type of cue given is scored from 0 to 5: | |
Josman et al. | 2010 | Do-eat | 5-6.5 years | Israel (Hebrew) | Performance-based assessment | Attention, initiation, sequencing, transition from one activity to another, spatial and temporal organization, inhibition, problem solving, and remembering instructions | The child is asked to perform three tasks: | Test scores range from 1 (unsatisfactory performance) to 5 (very good performance). | |
Rosenblum et al. | 2015 | Do-eat | 6-9 years | Israel (Hebrew) | Performance-based assessment | ||||
Downes et al. | 2018 | Preschool executive | 3-6 years | UK (English) | Performance-based assessment | Working memory, distractibility, organization, emotional control | The task involves using an “ingredients” box with preprepared materials, a recipe book, a timer, and cueing/scoring sheets. The child follows a picture recipe book step-by-step, using the supplied materials, to make the final picture. | Scoring is based on a classification and quantification of errors and the qualitative analysis of the task. |
Note: ADHD: attention-deficit hyperactivity disorder; DCD: developmental coordination disorder; EF: executive function; EFD: executive function deficits; OT: occupational therapist; TD: typically developing; TBI: traumatic brain injury.
Table 2
Summary of EF tools’ measurement properties.
Instrument | Author | Year | COSMIN adequacy of measurement properties | ||||||
Content validity | Structural validity | Internal consistency | Reliability | Construct validity | Cross-cultural validity | Criterion validity | |||
BADS-C | Engel-Yeger et al. | 2009 | Age: significant differences exist between the different age groups between three age groups on the following: playing card test ( | Underwent forward (Hebrew) and backward (English) translations by a bilingual clinician. | |||||
CCT | Chevignard et al. | 2009 | Interrater reliability | Group differences: significant differences exist only between the total number of errors ( | No significant correlation was found between the total number of errors in the cooking task and the scores on the different neuropsychological tests or behavioural questionnaires (i.e., RCF, WCST, TMT-B, Tower of London, six-part test, RBMT, BRIEF, DEX-C). | ||||
Chevignard et al. | 2010 | Cronbach’s | Test-retest reliability | Age: the total number of errors in the CCT significantly decreased with age in the control group ( | The CCT was translated. | Overall, performance in the CCT was significantly correlated ( | |||
Fogel et al. | 2020 | Group: significant differences were found between the groups in the CCT assessment scores ( | The CCT was translated into Hebrew through a process of forward and backward translations. Content validity was pilot tested on a group of five children and a focus group of seven OTs. | A medium positive correlation was found only between the BRIEF-SR subscales plan/-organization ( | |||||
CKTA | Rocke et al. | 2008 | Cronbach’s | Interrater reliability: | Performance significantly improved as age increased (ns). Can discriminate between high- and low-scoring participants when compared to the BRIEF (inhibition: | ||||
Do-eat | Josman et al. | 2010 | Content and face validity: validated by five expert consultants and five experienced pediatric occupational therapists. | Cronbach’s | Interrater reliability: | Construct validity for the Do-Eat was assessed by gauging the tool’s ability to distinguish between the groups of children with and without DCD and found significant differences in executive functions ( | The EF task was not specifically correlated to any EF assessment. | ||
Rosenblum et al. | 2015 | Cronbach’s | Significant group differences were found in the EF scores ( | Significant correlations were found in the ADHD group between the EF Do-Eat score for “preparing chocolate milk” and BRIEF BRI ( | |||||
PETA | Downes et al. | 2018 | Interrater reliability: | Age: performance significantly increased with age in line with the rapid development of executive skills reported during this period ( | The PETA TS was compared with the BRIEF-P GEC. A significant association was observed between the PETA TS and the BRIEF-P GEC ( |
Note: EF: executive function; BADS-C: Behavioural Assessment of the Dysexecutive Syndrome for Children; CKTA: Children’s Kitchen Task Assessment; DEX-C: Dysexecutive Syndrome for Children; PETA: Preschool Executive Task Assessment; TS: total summary score; TC: total number of cues; ICC: intraclass correlation; GEC: general executive composite; BRIEF: Behavior Rating Index of Executive Function; BRIEF-SR: Behavior Rating Index of Executive Function-Self-Report; BRIEF-P: Behavior Rating Index of Executive Function-Preschool; BRI: Behavioural Regulation Index; MI: metacognition index; RBMT: Rivermead Behavioural Memory Test; RCF: Rey-Osterrieth Complex Figure; D-KEFS: Delis–Kaplan Executive Function System; WCST: Wisconsin Card Sorting Test; WISC-IV: Wechsler Intelligence Scale for Children-IV; TMT-B: Trail Making Test Part B; DCD: developmental coordination disorder; ADHD: attention-deficit hyperactivity disorder. Empty cells: no evidence found.
Table 3
Summary of adequacy of EF tools’ measurement properties.
Instrument | Author | Year | COSMIN adequacy of measurement properties | ||||||
Content validity | Structural validity | Internal consistency | Reliability | Construct validity | Cross-cultural validity | Criterion validity | |||
BADS-C | Engel-Yeger et al. | 2009 | + | ? | |||||
CCT | Chevignard et al. | 2009 | + | − | − | ||||
CCT | Chevignard et al. | 2010 | + | + | − | ? | − | ||
CCT | Fogel et al. | 2020 | + | ? | − | ||||
CKTA | Rocke et al. | 2008 | − | + | − | ||||
Do-Eat | Josman et al. | 2010 | ? | + | + | + | − | ||
Do-Eat | Rosenblum et al. | 2015 | + | + | − | ||||
PETA | Downes et al. | 2018 | + | − | − |
Note: BADS-C: Behavioural Assessment of the Dysexecutive Syndrome for Children; CCT: Children’s Cooking Task; CKTA: Children’s Kitchen Task Assessment; PETA: Preschool Executive Task Assessment. Legends (−, ?, +) are explained in Supplementary File 2. Empty cells: no evidence found.
Table 4
Summary of synthesis of best evidence and GRADE rating on evidence certainty for the reviewed EF tools.
Measure | Synthesis of best evidence | GRADE rating on certainty of evidence | ||||||
Content validity | Structural validity | Internal consistency | Reliability | Construct validity | Cross-cultural validity | Criterion validity | ||
BADS-C | +++ | + | Low | |||||
CCT | + | + | ± | ++ | ± | Low | ||
CKTA | + | + | + | Low | ||||
Do-eat | + | +++ | ++ | +++ | ± | Low | ||
PETA | +++ | ++ | ± | Low |
Note: BADS-C: Behavioural Assessment of the Dysexecutive Syndrome for Children; CCT: Children’s Cooking Task; CKTA: Children’s Kitchen Task Assessment; PETA: Preschool Executive Task Assessment. Legends used for synthesis of best evidence (±, +, ++, +++) are explained in Supplementary File 3, while the GRADE rating (i.e., low) is explained in Supplementary File 4. Empty cells: no evidence found.
4.3. Behavioural Assessment of the Dysexecutive Syndrome for Children (BADS-C)
BADS-C [28] is a standardized assessment battery that examines EF in children and adolescents. It contains five developmentally appropriate measures and one questionnaire, amounting to six administered tests. The defined EF areas examined by the BADS-C include (1) inflexibility, (2) perseverance, (3) novel problem solving, (4) impulsivity, (5) planning, and (6) the ability to use feedback to moderate behaviour. The subtests performed were the following: (1) playing card test, (2) water test, (3) key search test, (4) zoo map test 1, (5) zoo map test 2, and (6) six-part test. Each subtest had its scoring guideline. Generally, scores were derived from the number of tasks completed correctly and any broken rules or errors committed.
We found one article that tested the psychometric property of BADS-C within the occupational therapy field [20] among Hebrew-speaking Israeli children aged 8-15 years. The instrument was translated from its original English version to Hebrew, using forward and backward translation by a bilingual clinician to ensure cross-cultural validity. The construct validity showed significant differences between the different age groups on: playing card test (
Construct validity was noted to have at least 75% of the results following the hypotheses, whereas cross-cultural validity significantly lacked information as multiple group factor analysis and DIF analysis were not performed. The content validity, structural validity, internal consistency, reliability, and criterion validity were not examined. On the other hand, strong construct validity and limited cross-cultural validity were recorded for the BADS-C. Overall, the GRADE rating certainty in the evidence for BADS-C was low due to limitations in study quality, sparse data, and probability of reporting bias.
4.4. Children’s Cooking Task (CCT)
CCT was adapted from the Adult Cooking Task to be suitable for children [21]. CCT is a performance-based assessment that measures a child’s EF while doing an open-ended real-life cooking task. The tasks include preparing a chocolate cake and a fruit cocktail while following a recipe using the necessary ingredients and utensils on a table. Three published articles assessed its measurement properties among typically developing children and children with traumatic brain injury in France [22] and Australia [21] and typically developing children and children with EF disorder in Israel [23]. The published studies did not explicitly state the EF measured; however, the execution error assessed by the tool is related to volition, planning, goal direction, and task monitoring. The scoring is based on a classification and quantification of errors and a qualitative analysis of the task.
Reliability properties were only reported in two articles. Internal consistency was found to be good at Cronbach’s
Validity properties were reported in all three articles; however, these were mainly limited to construct and criterion validity. The original version of CCT was in French and translated into English [21] and Hebrew [23] to fit the cultural contexts tested. Aside from language translation, certain aspects of the items were mildly culturally modified (i.e., utensils and measurement units). Construct validity was tested for age and group differences. The total number of errors in the CCT significantly decreased with age in the control group (
The extant evidence fails to provide a report on the content and structural validity of the CCT. There is limited evidence on its internal consistency, with only one study of fair quality due to the number of samples recruited. The overall rating for reliability was deemed limited, as the two articles that reported reliability focused on different types of reliability. While there are significant differences across age and clinical groups, the current evidence was only found for the total number of errors and not for the other measures found in the CCT. Similarly, criterion validity was inconsistent across studies, where item scores on the CCT did not correlate with the varied comparative tools used. Nevertheless, the CCT was the most frequent tool reviewed, with translations and cultural adaptations. Albeit, the specific results of the cross-cultural validation were incompletely reported or reported elsewhere. Overall, the GRADE rating for certainty in the evidence of CCT was low due to limitations in study quality, imprecise or sparse data, inconsistency, and a possible reporting bias.
4.5. Children’s Kitchen Task Performance
Developed by American OTs, the CKTP is an iteration of the Kitchen Task Assessment (KTA; Baum & Edwards [29]) intended for adults. The CKTP involves assessing the child’s EF as they are engaged in a functional and age-appropriate activity (i.e., making playdough) that simulates a simple cooking task seen in the original KTA. Only one article was found to describe the measurement properties of CKTP [24]. In reviewing the measurement properties of CKTP, [24]) were able to describe its development process and report on its reliability (interrater reliability and internal consistency) and validity (discriminant validity). The administration of CKTP is performed by an OT and is aimed at examining a child’s EF skills in the areas of initiation, organization, planning and sequencing, judgment and safety, and completion. These EF processes are represented in the specific tasks within the instrument. Scoring is based on the level of cues given. The available measurement properties of CKTP were tested among African-American, English-speaking children.
Reliability properties report an internal consistency of Cronbach’s
The CKTP falls short in addressing evidence on content, structural, and criterion validity. While internal consistency is reported, the results do not provide concrete evidence on unidimensionality or positive structural validity, with its Cronbach’s α falling short of the <0.70 thresholds. Construct validity, through its ability to discriminate age and performance indicators (i.e., high vs. low), was not consistent with the intended hypothesis. Nevertheless, the CKTP has shown excellent interrater reliability. Thus, while this review finds limited support on its internal consistency and construct validity in the synthesis of best evidence, there is moderate evidence on its interrater reliability. Overall, the GRADE rating certainty in the evidence for CKTA was low due to limitations in study quality, imprecise or sparse data, and inconsistency.
4.6. Do-Eat
Do-Eat is a set of questionnaire and test that measures a child’s task performance, sensory-motor skills, and EF as they participate in three tasks: (1) make a sandwich, (2) prepare chocolate milk, and (3) fill out a certificate of outstanding performance. This review found two published articles that examined its measurement properties among 5-9-year-old Israeli typically developing children and children with DCD [25] or ADHD [26]. The test is administered by an OT and can measure EF processes of attention, initiation, sequencing, transition from one activity to another, spatial and temporal organization, inhibition, problem solving, and remembering instructions, with a scoring of 1-5 (increasing range).
In both articles, Do-Eat has high internal consistency ranging from 0.877 to 0.890. Interrater reliability was only examined among typically developing children and children with DCD [25] and ranged from 0.92 to 1.00. In the same article, content and face validation by OTs was reported, but no specific statistical data was provided. Both articles reported on construct validity (discriminant and concurrent validity), with conflicting results. Do-Eat is reported to discriminate between typically developing children and children with DCD (
Do-Eat fails to provide evidence on its structural and criterion validity. Synthesis of best evidence considered both articles due to the limited availability of evidence. While high internal consistency is reported, there is still limited evidence for exceptionality in its unidimensionality and positive structural validity. High interrater reliability was only reported in one study. The correlation between Do-Eat’s EF tasks with other established EF assessments is inconsistent. Considering this, the synthesis of best evidence on Do-Eat’s measurement properties suggests limited evidence on its content validity, moderate evidence on its internal consistency and interrater reliability, and conflicting evidence on its structural validity. Overall, the GRADE rating certainty in Do-Eat’s evidence was low due to limitations in study quality, uncertainty indirectness, and possible reporting bias.
4.7. Preschool Executive Task Assessment (PETA)
PETA was developed to measure children’s EF using ecologically valid measures (Burgess et al. [30]). EF functions measured by the tool include working memory, distractibility, organization, and emotional control. The tool’s measures included a scoring system independent from a person’s level of functioning and ability in linguistics and motor skills; it focuses on the process of the child in doing a multistep task that could be appropriately seen in the context of the classroom. It consists of both qualitative and quantitative scoring values that cater to different dimensions of interest. We found one study that fit into our review criteria that tested PETA within a population of 166 typically developing English preschool-aged children from the UK [27].
PETA reports good to excellent reliability (interrater reliability:
Among the reviewed studies, PETA is the most recent, which may contribute to the findings that it still fails to provide salient evidence on its content and structural validity and internal consistency. Its reliability properties were deemed strong. However, its construct validity failed to provide convincing reports of age effects across its different scores. Likewise, there is limited evidence on its criterion validity property as the correlations reported were restricted to only significant findings for one measure. Overall, the GRADE rating certainty in the evidence for PETA was noted to be low due to limitations in inconsistency and sparse and imprecise data.
5. Discussion
The evidence on the use of performance-based EF assessment among children in OT practice is limited. This review has highlighted a few articles discussing only five performance-based EF instruments for children: BADS-C, CCT, CKTP, Do-Eat, and PETA. The measurement properties of these EF instruments are at times lacking or conflicting, which substantiates the low certainty of evidence supporting them.
EFs are sets of higher cognitive processes that enable children the ability to participate in various age-related activities [31, 32]. While there are a number of available EF instruments available, the use of performance-based instruments is a key concept that is important in OT practice. Performance-based assessments can link EF with performance in occupations (Burgess et al. [30]). They provide ecologically valid measures of EF outcomes in an authentic context that requires multitasking and reflects the press of everyday task performances [8, 11]. To an extent, performance-based EF assessments ground themselves in an occupation-based perspective. The results in this review emphasize the need to develop EF instruments that can provide a picture of a child’s participation in age-related occupations.
EF assessments in OT practice have gained attention in recent years [33, 34]. The types of EF instruments used for childhood populations include pen and paper proxy-reported measures (i.e., parent answered), laboratory-based procedures (i.e., computer software), and performance-based assessments. One of the more common traditional pen and paper EF assessment tools found in OT literature is the BRIEF [35], a parent-answered questionnaire that measures EF in individuals aged 5-18 using items formulated to reflect activities in daily life. While it has been suggested to have ecological validity [36], it lacks occupation-based contexts, which is important in OT evaluation. In the more recent years, alternative new forms of EF instruments have been suggested in the form of computerised or virtual reality testing [37]. The Jansari assessment of Executive Functions for Children (JEF-C) is a computerised EF assessment in a nonimmersive gamified virtual environment [15]. While both of these tools may seem ecologically valid, it lacks performance aspect in situ. Thus, the application in a real-life setting of EF is not observed and assessed.
This review included BADS-C, a performance-based measure of EF intended for individuals aged 7-16 years old and contains tasks that measure EF used in specific tasks reflecting daily life activities [28]. BADS-C development was not within the occupational therapy context; however, the referred validation study was conducted concerning occupational therapy [20]. BADS-C has been reported in the occupational therapy literature, despite its grounding on neurophysiological perspectives, reflecting the utility of performance-based EF tools across professions. The reviewed performance-based EF tools reported in this study (while developed, tested, or used in the occupational therapy profession) may have implications on its use across other disciplines whose concern is children’s EF.
This review found low certainty of evidence on the measurement properties of BADS-C, CCT, CKTA, Do-Eat, and PETA as EF instruments for children in OT practice. The available yet limited evidence supporting these tools contribute to the decision suggesting that their true measurement property effects might be markedly different from the estimated effect. The low rating in evidence certainty is due to various limitations commonly in the number of available studies reviewed, study quality, inconsistency of measurement properties, and probability of reporting bias. However, considering the critical need for evaluating EF among children [8] and the determinants of the strength of recommendations (GRADE [38]), the authors conditionally suggest using any of these EF tools. Nevertheless, future researchers will need to address further the adequacy of measurement properties of these EF tools.
Individual differences in EF have been seen in various childhood populations. EF can vary between children from different environments [39]. Developmental trajectories related to EF processes have likewise been suggested [40]. Thus, researchers will need to include contextualization of these EF instruments in different cultures, ages, and clinical populations in the future.
5.1. Limitations
There are several limitations to this review. While we used the COSMIN criteria for adequacy of measurement properties and the Terwee criteria for synthesis for best evidence criteria, there are times when these criteria are difficult to interpret. Some ratings were based on the review authors’ judgment and may have violated the standards, making the replication of this review challenging. The use of these criteria in assessing the methodological rigor of the included studies allowed us to appraise and assess these studies robustly. Second, the COSMIN framework is intended for patient-reported outcome measures, and there have been adjustments in its use for performance-based clinical assessment methods. Nevertheless, the methods reported in this review may provide a basis for future studies of the same nature. An extension of COSMIN to cover performance-based assessments will be a welcome development. Lastly, the limited number of studies reviewed may be from the stringent conceptualization of our inclusion criteria, specifically in deciding that the EF assessment tools must have been relevant to OTs. Our review may likely underestimate the breadth of the evidence on EF assessments for children; however, it provides a niche market for our intended end-users. Future research may need to review other EF assessment methods used as an adjunct to occupational therapy.
6. Conclusions
In the current review, we identified five performance-based EF instruments for children in the practice of OT: BADS-C, CCT, CKTP, Do-Eat, and PETA. These five tools assess a child’s EF in real-life settings and age-appropriate activities within the context of their occupations. However, their adequacy of measurement properties is lacking, which led to low certainty in their evidence. Nevertheless, the authors conditionally suggest their use based on the critical need to measure children’s EF. Future research will need to adequately address and report a complete set of measurement properties tested in different childhood populations across ethnicity, age groups, and clinical conditions.
Authors’ Contributions
All authors contributed equally to the development of this systematic review.
[1] G. Gioia, P. Isquith, S. Guy, "Assessment of executive functions in children with neurological impairment," Psychological and Developmental Assessment: Children with Disabilities and Chronic Conditions, pp. 317-356, 2001.
[2] R. Bull, K. Lee, "Executive functioning and mathematics achievement," Child Development Perspectives, vol. 8 no. 1, pp. 36-41, DOI: 10.1111/cdep.12059, 2014.
[3] M. D. Lezak, Neuropsychological Assessment, 2004.
[4] A. L. Fawcett, Principles of Assessment and Outcome Measurement for Occupational Therapists and Physiotherapists: Theory, Skills and Application, 2013.
[5] M. H. Romli, F. Wan Yunus, L. Mackenzie, "Overview of reviews of standardised occupation-based instruments for use in occupational therapy practice," Australian Cccupational Therapy Journal, vol. 66 no. 4, pp. 428-445, DOI: 10.1111/1440-1630.12572, 2019.
[6] L. J. Burton, S. Tyson, A. McGovern, "Staff perceptions of using outcome measures in stroke rehabilitation," Disability and Rehabilitation, vol. 35 no. 10, pp. 828-834, DOI: 10.3109/09638288.2012.709305, 2013.
[7] J. Wallace, C. Byrne, M. Clarke, "Improving the uptake of systematic reviews: a systematic review of intervention effectiveness and relevance," BMJ Open, vol. 4 no. 10,DOI: 10.1136/bmjopen-2014-005834, 2014.
[8] N. Josman, S. Meyer, "Conceptualisation and use of executive functions in paediatrics: a scoping review of occupational therapy literature," Australian Occupational Therapy Journal, vol. 66 no. 1, pp. 77-90, DOI: 10.1111/1440-1630.12525, 2019.
[9] P. A. Crist, "Framing ecological validity in occupational therapy practice," The Open Journal of Occupational Therapy, vol. 3 no. 3,DOI: 10.15453/2168-6408.1181, 2015.
[10] G. A. Gioia, P. K. Isquith, "Ecological assessment of executive function in traumatic brain injury," Developmental Neuropsychology, vol. 25 no. 1, pp. 135-158, DOI: 10.1207/s15326942dn2501&2_8, 2004.
[11] A. Wallisch, L. M. Little, E. Dean, W. Dunn, "Executive function measures for children: a scoping review of ecological validity," OTJR: Occupation, Participation and Health, vol. 38 no. 1,DOI: 10.1177/1539449217727118, 2018.
[12] D. Manchester, N. Priestley, H. Jackson, "The assessment of executive functions: coming out of the office," Brain Injury, vol. 18 no. 11, pp. 1067-1081, DOI: 10.1080/02699050410001672387, 2004.
[13] P. J. Anderson, N. Reidy, "Assessing executive function in preschoolers," Neuropsychological Review, vol. 22 no. 4, pp. 345-360, DOI: 10.1007/s11065-012-9220-3, 2012.
[14] V. Poulin, N. Korner-Bitensky, D. R. Dawson, "Stroke-specific executive function assessment: a literature review of performance-based tools," Australian Occupational Therapy Journal, vol. 60 no. 1,DOI: 10.1111/1440-1630.12024, 2013.
[15] Y. Gilboa, A. Jansari, B. Kerrouche, E. Uçak, A. Tiberghien, O. Benkhaled, M. Chevignard, "Assessment of executive functions in children and adolescents with acquired brain injury (ABI) using a novel complex multi-tasking computerised task: the Jansari assessment of Executive Functions for Children (JEF-C©)," Neuropsychological Rehabilitation, vol. 29 no. 9, pp. 1359-1382, DOI: 10.1080/09602011.2017.1411819, 2019.
[16] D. Moher, A. Liberati, J. Tetzlaff, D. G. Altman, Prisma Group, "Preferred reporting items for systematic reviews and meta-analyses: the PRISMA statement," PLoS Medicine, vol. 6 no. 7, article e1000097,DOI: 10.1371/journal.pmed.1000097, 2009.
[17] L. B. Mokkink, C. B. Terwee, D. L. Patrick, J. Alonso, P. W. Stratford, D. L. Knol, L. M. Bouter, H. C. de Vet, "The COSMIN study reached international consensus on taxonomy, terminology, and definitions of measurement properties for health-related patient-reported outcomes," Journal of Clinical Epidemiology, vol. 63 no. 7, pp. 737-745, DOI: 10.1016/j.jclinepi.2010.02.006, 2010.
[18] F. Dobson, R. S. Hinman, M. Hall, C. B. Terwee, E. M. Roos, K. L. Bennell, "Measurement properties of performance-based measures to assess physical function in hip and knee osteoarthritis: a systematic review," Osteoarthritis and Cartilage, vol. 20 no. 12, pp. 1548-1562, DOI: 10.1016/j.joca.2012.08.015, 2012.
[19] C. B. Terwee, S. D. Bot, M. R. de Boer, D. A. van der Windt, D. L. Knol, J. Dekker, H. C. de Vet, "Quality criteria were proposed for measurement properties of health status questionnaires," Journal of Clinical Epidemiology, vol. 60 no. 1, pp. 34-42, DOI: 10.1016/j.jclinepi.2006.03.012, 2007.
[20] B. Engel-Yeger, N. Josman, S. Rosenblum, "Behavioural assessment of the dysexecutive syndrome for children (BADS-C): an examination of construct validity," Neuropsychological Rehabilitation, vol. 19 no. 5, pp. 662-676, DOI: 10.1080/09602010802622730, 2009.
[21] M. P. Chevignard, C. Catroppa, J. Galvin, V. Anderson, "Development and evaluation of an ecological task to assess executive functioning post childhood TBI: the children's cooking task," Brain Impairment, vol. 11 no. 2, pp. 125-143, DOI: 10.1375/brim.11.2.125, 2010.
[22] M. P. Chevignard, V. Servant, A. Mariller, G. Abada, P. Pradat-Diehl, A. Laurent-Vannier, "Assessment of executive functioning in children after TBI with a naturalistic open-ended task: a pilot study," Developmental Neurorehabilitation, vol. 12 no. 2, pp. 76-91, DOI: 10.1080/17518420902777019, 2009.
[23] Y. Fogel, S. Rosenblum, R. Hirsh, M. Chevignard, N. Josman, "Daily performance of adolescents with executive function deficits: an empirical study using a complex-cooking task," Occupational therapy international, vol. 2020,DOI: 10.1155/2020/3051809, 2020.
[24] K. Rocke, P. Hays, D. Edwards, C. Berg, "Development of a performance assessment of executive function: the Children’s Kitchen Task Assessment," American Journal of Occupational Therapy, vol. 62 no. 5, pp. 528-537, DOI: 10.5014/ajot.62.5.528, 2008.
[25] N. Josman, A. Goffer, S. Rosenblum, "Development and standardization of a “Do–Eat” activity of daily living performance test for children," American Journal of Occupational Therapy, vol. 64 no. 1, pp. 47-58, DOI: 10.5014/ajot.64.1.47, 2010.
[26] S. Rosenblum, C. Frisch, T. Deutsh-Castel, N. Josman, "Daily functioning profile of children with attention deficit hyperactive disorder: a pilot study using an ecological assessment," Neuropsychological Tehabilitation, vol. 25 no. 3, pp. 402-418, DOI: 10.1080/09602011.2014.940980, 2015.
[27] M. Downes, C. Berg, F. J. Kirkham, L. Kischkel, I. McMurray, M. de Haan, "Task utility and norms for the preschool executive task assessment (PETA)," Child Neuropsychology, vol. 24 no. 6, pp. 784-798, DOI: 10.1080/09297049.2017.1333092, 2018.
[28] H. Emslie, F. C. Wilson, V. Burden, I. Nimmo-Smith, B. A. Wilson, Behavioural Assessment of the Dysexecutive Syndrome for Children (BADS-C), 2003.
[29] C. Baum, D. F. Edwards, "Cognitive performance in senile dementia of the Alzheimer's type: the Kitchen Task Assessment," American Journal of Occupational Therapy, vol. 47 no. 5, pp. 431-436, DOI: 10.5014/ajot.47.5.431, 1993.
[30] P. W. Burgess, N. Alderman, C. Forbes, A. Costello, L. M-A. Coates, D. R. Dawson, S. Channon, "The case for the development and use of “ecologically valid” measures of executive function in experimental and clinical neuropsychology," Journal of the international neuropsychological society, vol. 12 no. 2, pp. 194-209, DOI: 10.1017/s1355617706060310, 2006.
[31] S. Gestsdottir, R. M. Lerner, "Positive development in adolescence: the development and role of intentional self-regulation," Human Development, vol. 51 no. 3, pp. 202-224, DOI: 10.1159/000135757, 2008.
[32] M. K. Nyongesa, D. Ssewanyana, A. M. Mutua, E. Chongwo, G. Scerif, C. R. Newton, A. Abubakar, "Assessing executive function in adolescence: a scoping review of existing measures and their psychometric robustness," Frontiers in Psychology, vol. 10,DOI: 10.3389/fpsyg.2019.00311, 2019.
[33] K. M. Antshel, B. O. Hier, R. A. Barkley, Executive functioning theory and ADHD,DOI: 10.1007/978-1-4614-8106-5_7, 2014.
[34] W. L. Tseng, S. S. F. Gau, "Executive function as a mediator in the link between attention-deficit/hyperactivity disorder and social problems," Journal of Child Psychology and Psychiatry, vol. 54 no. 9, pp. 996-1004, DOI: 10.1111/jcpp.12072, 2013.
[35] G. A. Gioia, P. K. Isquith, S. C. Guy, L. Kenworthy, "Test review behavior rating inventory of executive function," Child Neuropsychology, vol. 6 no. 3, pp. 235-238, DOI: 10.1076/chin.6.3.235.3152, 2000.
[36] M. A. Shimoni, B. Engel-Yeger, E. Tirosh, "Executive dysfunctions among boys with attention deficit hyperactivity disorder (ADHD): performance-based test and parents report," Research in Developmental Disabilities, vol. 33 no. 3, pp. 858-865, DOI: 10.1016/j.ridd.2011.12.014, 2012.
[37] G. Lalonde, M. Henry, A. Drouin-Germain, P. Nolin, M. H. Beauchamp, "Assessment of executive function in adolescence: a comparison of traditional and virtual reality tools," Journal of Neuroscience Methods, vol. 219 no. 1, pp. 76-82, DOI: 10.1016/j.jneumeth.2013.07.005, 2013.
[38] GRADE Working Group, "Grading quality of evidence and strength of recommendations," BMJ, vol. 328 no. 7454,DOI: 10.1136/bmj.328.7454.1490, 2004.
[39] T. W. K. Yung, C. Y. Lai, I. N. Gomez, V. Loh, J. Wang, J. Y. Chan, S. S. Ng, "Parents’ perceptions of children’s executive functions across different cities," Child Neuropsychology, vol. 25 no. 2, pp. 152-161, DOI: 10.1080/09297049.2017.1406075, 2019.
[40] J. R. Pureza, H. A. Gonçalves, L. Branco, R. Grassi-Oliveira, R. P. Fonseca, "Executive functions in late childhood: age differences among groups," Psychology & Neuroscience, vol. 6 no. 1, pp. 79-88, DOI: 10.3922/j.psns.2013.1.12, 2013.
You have requested "on-the-fly" machine translation of selected content from our databases. This functionality is provided solely for your convenience and is in no way intended to replace human translation. Show full disclaimer
Neither ProQuest nor its licensors make any representations or warranties with respect to the translations. The translations are automatically generated "AS IS" and "AS AVAILABLE" and are not retained in our systems. PROQUEST AND ITS LICENSORS SPECIFICALLY DISCLAIM ANY AND ALL EXPRESS OR IMPLIED WARRANTIES, INCLUDING WITHOUT LIMITATION, ANY WARRANTIES FOR AVAILABILITY, ACCURACY, TIMELINESS, COMPLETENESS, NON-INFRINGMENT, MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Your use of the translations is subject to all use restrictions contained in your Electronic Products License Agreement and by using the translation functionality you agree to forgo any and all claims against ProQuest or its licensors for your use of the translation functionality and any output derived there from. Hide full disclaimer
Copyright © 2021 Ivan Neil B. Gomez et al. This is an open access article distributed under the Creative Commons Attribution License (the “License”), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License. https://creativecommons.org/licenses/by/4.0/
Abstract
Introduction. The use of executive function (EF) instruments to assess children’s functional performance is obscured with a lack of consensus on which is most suitable to use within the occupational therapy profession. This review identifies EF instruments used by occupational therapists (OTs) for children and evaluates their measurement properties. Methods. This systematic review was registered in PROSPERO (
You have requested "on-the-fly" machine translation of selected content from our databases. This functionality is provided solely for your convenience and is in no way intended to replace human translation. Show full disclaimer
Neither ProQuest nor its licensors make any representations or warranties with respect to the translations. The translations are automatically generated "AS IS" and "AS AVAILABLE" and are not retained in our systems. PROQUEST AND ITS LICENSORS SPECIFICALLY DISCLAIM ANY AND ALL EXPRESS OR IMPLIED WARRANTIES, INCLUDING WITHOUT LIMITATION, ANY WARRANTIES FOR AVAILABILITY, ACCURACY, TIMELINESS, COMPLETENESS, NON-INFRINGMENT, MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Your use of the translations is subject to all use restrictions contained in your Electronic Products License Agreement and by using the translation functionality you agree to forgo any and all claims against ProQuest or its licensors for your use of the translation functionality and any output derived there from. Hide full disclaimer
Details








1 Center for Health Research and Movement Science, University of Santo Tomas, Manila, Philippines; Department of Occupational Therapy, College of Rehabilitation Sciences, University of Santo Tomas, Manila, Philippines
2 The Graduate School, University of Santo Tomas, Manila, Philippines
3 Department of Occupational Therapy, College of Rehabilitation Sciences, University of Santo Tomas, Manila, Philippines