Introduction
In recent years, the study of individuals reaching very advanced ages, particularly centenarians, has gained substantial traction in gerontology and epidemiology, driven by an increasing interest in potential implications for healthcare and society at large [1–3]. This growing interest has led to an increase in research aimed at understanding the unique characteristics, experiences, and health profiles of individuals who have reached 100 years of age [4–6]. Yet, scholars in this field are confronted with many challenges when it comes to collecting accurate and comprehensive data from this growing demographic group.
The challenges are manifold: poor health status and frailty, potential sensorial and cognitive impairments, and increased fatigue while participating in an interview make self-reported data collection a difficult task. Furthermore, previous studies have shown that in Western countries at least one in five of those aged 80 or over lives in an institution such as retirement or nursing homes and that prevalence rises to at least 30% among people living with dementia [7,8]. In addition, the access to institutions of external individuals, such as researchers or interviewers, can be restricted by gatekeepers or caregivers that want to protect the centenarians [9]. For all these reasons the very old belong to the so called “hard-to-reach” population, more precisely, they represent a typical example of the sub-category of “hard-to-interview” population due to their specific characteristics [10].
The inclusion of proxy respondents, individuals who provide information on behalf of centenarians, offers a pragmatic solution to overcome these obstacles. Indeed, proxy respondents can serve as conduits of valuable information when the centenarians themselves are unable to participate fully in interviews.
While involving proxy respondents can indeed facilitate data collection, it introduces a critical concern: data quality. Previous evidence in the field of survey methodology has identified a wide range of indicators of data quality [11]. In this paper, we concentrate on a particularly critical aspect of centenarian research, namely the potential influence of using proxy respondents on data quality, with a specific focus on item nonresponse (INR). INR refers to the phenomenon where respondents omit or fail to answer specific survey questions [12].
Through analysing data collected within the Swiss Centenarian Study (SWISS100), the first nationwide study on centenarians in Switzerland, we seek to shed light on the proxy respondents’ involvement and its implications for data quality. More specifically, we aim to assess to what extent different types of proxy respondents affect INR, as a paramount indicator. In the following sections, we will review the literature on data quality and data collection when investigating oldest-old people, generally defined as individuals aged 85 and older, as outlined by Kydd et al [13]. Then, we will outline our study’s methodology, present our findings, and discuss their implications for research practice. By undertaking this investigation, we aim to respond to the call for more research addressing the impact of proxy respondents on survey validity [14], while also providing guidance for researchers who navigate the complex terrain of surveying the oldest-old.
Background
Item nonresponse as an indicator of data quality
Data of old and oldest populations may be affected by various sources of errors of representation and measurement [15]. Even a well-designed study will produce biased results if the collected information are correlated with the probability of some groups or individuals to participate in the data collection or not. The use of proxy interviews allows researchers to gather information about those who are less likely to be recruited and interviewed, such as old and very old individuals living in institutional settings and those with more health issues [8,16]. However, to provide valid and reliable answers proxy respondents have to successfully search for the most relevant information, retrieve information from memory, compute judgments on specific situations. If respondents do not run through the question-answer-process as thoroughly as possible, they may not provide answers that most precisely reflect their judgments. This “satisficing behaviour” [17] has been frequently used to explain response effects and bias. Specifically, strong satisficing occurs when respondents completely skip the tasks of retrieval or judgement, leading to INR, one of the main indicators of measurement error that negatively affects quality of data [18,19]. INR can be considered a second-level nonresponse, in contrast to unit nonresponse, which occurs when no data at all is obtained for a respondent [20].
The survey methodology literature usually distinguishes three different types of INR, based on the reasons for the lack of information: “don’t know”, ‘refusal’, and ‘no answer’. In the case of ‘don’t know’ a respondent is willing but unable to respond to a question, maybe because of memory problems when asked about past behaviour or the lack of an opinion on a particular issue. Even though the ‘don’t know’ option may not always be interpreted as INR since it may reflect the absence of a real opinion [21], it is mainly interpreted as a form of a satisficing behaviour occurring when respondents have low ability or low motivation to exert the cognitive efforts necessary to answer a question or when questions are particularly difficult [17,22]. When the respondent has the relevant information but is not willing to answer (e.g., because of the sensitive nature of the information) this is a typical example of ‘refusal’. The third type of INR–‘no answer’–can be considered a residual category. It is usually less intentional since it may contain for instance missing values due to the fact that the interviewer or the respondent did not correctly follow the skip patterns of a questionnaire and thereby missed one or several questions [19]. Several scholars in their studies on old and very old populations have operationalized the INR indicator by combining at least two of these three categories, most of the times ‘don’t know’ and ‘refusal’ options [23–26].
Regardless of its operationalization, INR is one of the major concerns when collecting data since it decreases the available sample size for analyses and can lead to the risk of biased results if the missingness does not occur at random [20].
The use of proxy respondents to collect data about the oldest-old
Proxy respondents, often family members, caregivers, or designated surrogates, step in to provide information on behalf of the primary subject [27]. This practice becomes especially prevalent when dealing with very old individuals, such as centenarians, whose potential health, sensorial, and cognitive limitations may negatively affect their ability to participate fully in surveys [28–30]. While the use of proxy respondents for the old and oldest-old is rooted in practicality, its potential implications for data quality are not well understood [31].
Most of the studies investigating the methodological implications of the use of proxies as data sources have focused so far on the analysis of the agreement between proxy and target respondents and the extent to which the characteristics of these proxy respondents may affect data quality [32]. The critical dimension that received more attention is the nature of the relationship between target participants and their proxies, although with mixed results. Recruiting a spouse as a proxy respondent has been associated with more reliable data in some studies [33,34], while others reported better results when the answers have been given by older people’s offspring or other family members [35,36] because partners of very old adults can be themselves old and frail [37]. Furthermore, in a study of old and oldest-old adults’ health, professional proxy provided better data quality compared to spouses and other family proxies [38]. In addition, in a study on functional abilities and continence there was a tendency for more distant relationship to produce better agreement between older women and their proxy respondents [26]. Other studies reported that higher agreement was associated with being male or older than 60 years [39], living with or visiting more often the target participant [29,40], or less subjective burden due to caregivers duties [41].
Surprisingly, despite the key role of INR as an indicator of bias and measurement error, few scholars focused on this topic when analysing the quality of data collected through the proxies of old and oldest-old adults. Hansen and colleagues [23], in their study about quality of life among German oldest-old, reported that INR was low, i.e., about 4%, and mainly due to non-valid answers given by proxy respondents. Similarly, in a study on older women aged from 66 to 100 years living in England, INR was also low–i.e., less than 3%–for all questions asked to proxy respondents, who were mainly children of target respondents [26]. In their discussion of data collection procedures and challenges with the oldest-old, Rodgers and Herzog [42] stressed the importance of the relationship between target population and proxy respondents in determining the level of INR, especially regarding subjective information. The type of proxy respondent emerged as the primary factor influencing the completeness of responses also in the study by Pickle and colleagues [43]. Specifically, when considering sociodemographic characteristics of both target and proxy respondents, it was observed that siblings were more effective in responding to questions related to the subject’s immediate family or events from early life, while spouses and offspring were better suited to describe events from the subject’s adult life.
All the above results refer to target populations composed of older adults, while there is limited evidence on the very old, such as centenarians, even though research on this population often uses proxy ratings instead of self-reports [44–47]. MacDonald and colleagues [48] conducted a study examining the correspondence between a sample of American centenarians and their proxies regarding mental health ratings. They found no significant differences between mean values provided by the different raters. Another study involving 53 couples of German centenarians and their proxies reported a high degree of similarity in some health ratings [49]. However, neither study delved into the investigation of INR and its association with proxy respondents’ characteristics.
In sum, our analysis of the literature reveals a notable gap in the examination of the characteristics of proxy respondents, particularly in the context of centenarian research, and their potential impact on INR. More specifically, despite the recognized importance of the proxy-target relationship, the exploration of how different types of relationships may affect INR when collecting data on the very old remains relatively unexplored across various disciplines, such as survey methodology, epidemiology, and gerontology. The target respondent-proxy relationship, characterized by its depth, familiarity, and emotional bonds, has the potential to significantly influence the completeness of responses. In order to advance our understanding in this domain, our study placed a central focus on this critical variable. First, we described the characteristics of centenarians who participated in the phone survey within the SWISS100 study, to evaluate to what extent those who answered directly differed from those who were replaced by a proxy respondent. This allowed us to assess whether the exclusion of centenarians who could not participate directly in the data collection would lead to a potential bias of representativeness in the sample, particularly the under-representation of certain categories, such as the frailest individuals or those living in nursing homes. Then, we focused on the association between INR and the type of proxy respondents, accounting for some potential confounding factors, to assess the independent role of the relationship between centenarians and their proxies in influencing the probability of INR. Through this analysis, we could shed light on which proxies would be best to engage in future studies on the very old to collect more complete and higher quality data even in the presence of hard-to-reach and hard-to-interview groups.
Data and methods
The study sample
We based this study on data from SWISS100 study. It is a study aimed at identifying specific characteristics, challenges, and needs of Swiss centenarians, adopting a multidisciplinary perspective, since it investigates biological, health, psychiatric, psychological, and social aspects of the life at this age. SWISS100 involves centenarians as the target population and, for each of them, a proxy respondent, for instance, a family member or a person who has a solid knowledge of the target participant, such as friends or formal caregivers. The study started in 2020 but, due to the Covid-19 outbreak, the data collection, conducted through face-to-face interviews with centenarians and proxies, as well as a paper-and-pencil self-completed questionnaire, was delayed. Waiting for the situation to improve from December 15, 2020, to June 2022, we conducted an exploratory cross-sectional study with telephone interviews. The study protocol was approved by the Swiss Ethics Committees on research involving humans (ID 2020–02063) and was conducted in accordance with the Declaration of Helsinki and with the principles and procedures for integrity in scientific research involving human beings.
The study sample of the phone survey included both centenarians, i.e., older adults aged 100 years or older, living in Switzerland, and proxy respondents. Centenarians were contacted according to a pre-established random order within each Swiss canton, based on a list provided by the Federal Statistical Office. Prior to initiating telephone contact, an invitation letter was sent to all potential centenarian participants in the study. Then, centenarians were contacted via telephone, and those who did not speak one of the Swiss main national languages, namely French, German, or Italian, were excluded. Additionally, individuals who were unable to participate directly in the interview due to severe cognitive impairment, and lacked an available proxy respondent, were also excluded. Specifically, if the telephone contact with the centenarian was not feasible due to issues such as hearing or cognitive problems, trained interviewers inquired whether the person on the phone could answer questions about the centenarian. If not, they requested an alternative contact who might be able to provide information about the centenarian, without specifying the relationship, given the exploratory nature of the telephone study. In the case of institutionalized centenarians unable to participate directly in the interview, the interviewers required from the staff the contact information of a close family member. If no family members were available, it was allowed to recruit as a proxy respondent either a non-kin person or a staff member, both of whom should have a good knowledge of the centenarian. Recruited proxies provided information not only about centenarians who were unable to participate directly in the interview but also about themselves. An information sheet containing all the study characteristics and the rights of each participant was sent after scheduling the date of the interview. Each phone interview, lasting about 60 minutes, began with the acquisition of verbal informed consent and was recorded after the participant’s approval. At the conclusion of each interview, interviewers assessed the respondents’ cooperation and the need for additional clarifications. In total, our data collection encompassed 169 centenarians residing in 22 cantons, out of 26 Swiss cantons. Specifically, our sample included 94 centenarians and 75 proxy respondents who stepped in for those centenarians unable to participate directly.
Variables
Our dependent variable pertains to INR for each proxy respondent during the interview. To address the issue of a small sample size, we aggregated non-valid responses, including ’don’t know’ and ’refusal,’ when calculating the total number of such responses for each proxy respondent, as previously done in survey methodology research [50]. These responses were collected across a set of 14 questions about the centenarian, covering various aspects such as sociodemographic factors (i.e., age, gender, marital status, education, and living situation), self-reported health, a short version of the Geriatric Depression Scale (5-GDS) consisting in 5 items with yes/no answers [51], and 3 items from the Valuation of Life (VOL) scale (i.e., whether the centenarian has religious beliefs, has a strong will to live, and experiences the life as meaningful), measured on a 5-point scale ranging from 0 (not at all) to 4 (very much) [52]. First, we calculated the proportion of INR. Then, owing to the highly skewed distribution of INR, we converted the total INR count for each respondent into a binary variable format. In this format, ’0’ indicated the absence of any INR, while ’1’ represented the presence of at least one non-valid answer in the responses, consistent with the approach employed by previous scholars in their studies on INR. [25,53].
The independent variable pertains to the relationship between the centenarian and the proxy respondent. The original variable provided by SWISS100 data distinguished between sons, daughters, stepchildren, grandchildren, nieces, nephews, spouses, other relatives, friends, acquaintances, formal caregivers, nurses, and nursing home staff members. Given the limited sample size, we reconfigured these categories into a new three-category variable: ‘children’ (son, daughter, stepchild), ’other relatives’ (spouse, grandchild, nephew, niece), and ’non-relatives’ (friend, acquaintance, formal caregiver, nurse, nursing home staff member). The rationale for this choice extends beyond the scarcity within specific categories (e.g., only one spouse among the recruited proxies) to encompass a deliberate focus on the offspring of centenarians. Research suggests that centenarians often rely on their children for primary support in their daily lives [54]. This could make children an accessible and willing group for recruitment as proxy respondents. As for the other proxies, we have distinguished between those related to the centenarian and those who aren’t. As mentioned earlier, this separation potentially leads to variations in the quality and accuracy of the collected data.
The control variables belong to four domains: characteristics of proxy respondents, characteristics of centenarians, proxy respondents’ behaviour during the interview, and data collection features. Characteristics of proxy respondents include gender (0 = female, 1 = male), age at interview, marital status (0 = unmarried, divorced, 1 = married, 2 = widow, widower), education (0 = less than high school, 1 = high school or more), and self-reported health (SRH) (0 = poor, fair, 1 = good, 2 = very good, excellent). Characteristics of centenarians are age at interview, gender, living conditions (0 = home, 1 = nursing home), and health reported by their proxies (0 = poor, fair, 1 = good, 2 = very good, excellent). Proxy respondent’s behaviour during the interview, an essential factor influencing satisficing and respondent ability to provide accurate responses [17,55], was assessed by interviewers at the interview’s conclusion. Specifically, cooperation, originally rated on a 5-point Likert scale (’very good’ to ’very poor’), was recoded as follows to account for valid answers: 0 = very good, 1 = good or fair. Also request for clarifications, evaluated using a 6-item scale (’never’ to ’always’), was recoded as follows: 0 = never, 1 = at least once or twice. Data collection features include year of interview (0 = 2020–2021, 1 = 2022) as a proxy of the variations in the stringency measures implemented to contain the COVID-19 pandemic, and the language-speaking region where the data was collected (0 = French, 1 = German, 2 = Italian).
Data analysis
To start, we analysed the characteristics of our centenarian sample, looking at age, sex, marital status, education, living situation, health, and depression. Descriptive statistics–i.e., percentages for categorical variables and means with standard deviations for continuous variables–were used to summarize the data. We compared centenarians who provided self-reports to those for whom proxy reports were obtained using appropriate statistical tests. For categorical variables, Pearson’s chi-squared or Fisher’s exact test was used, depending on the sample size. For continuous variables, the Mann-Whitney U test, a non-parametric alternative to the t-test, was employed to account for non-normal distributions.
Next, we assessed the prevalence of INR and its association with the proxy-centenarian relationship using the Kruskal-Wallis test (a non-parametric alternative to the analysis of variance) for the proportion of unanswered questions, and Fisher’s exact test for the binary INR variable. Subsequently, to evaluate the independent association between the relationship of centenarians and proxy respondents with INR, we employed a Firth’s logistic regression model. This method was selected due to its suitability for the specific characteristics of our sample, which includes a small sample size and potential issues of data separation. Firth’s logistic regression is an adaptation of standard logistic regression designed to handle situations where traditional methods might struggle, particularly in the presence of small samples or rare events. In such cases, standard logistic regression can yield biased parameter estimates or even fail to converge, especially when a predictor variable perfectly separates the outcome categories–i.e., when the predictor perfectly predicts the outcome. To mitigate these issues, Firth’s logistic regression applies a penalized likelihood estimation technique. This adjustment reduces the bias in the parameter estimates by penalizing the likelihood, making the estimates more robust in the context of small or imbalanced samples. This is particularly important in our study, where the sample size may limit the reliability of traditional logistic regression results. Moreover, the use of Firth’s method ensures that the model can accommodate a variety of independent variables, whether they are continuous, categorical, or a combination of both, without the risk of inflated or biased estimates. This leads to more reliable and interpretable results, which are less likely to be unduly influenced by outliers or the rarity of certain events within the sample [56,57].
We initially ran an unadjusted model, followed by four models adjusting for different sets of control variables: a) characteristics of proxy respondents, b) characteristics of centenarians, c) proxy respondents’ behaviour during the interview, and d) data collection features. Additionally, we conducted checks for multicollinearity among covariates in each model by testing variance inflation factors and tolerance criteria. These checks revealed no issues of collinearity (see S1 Table for complete values). We performed all the analysis using Stata 16.0 [58].
Results
Differences in centenarians’ characteristics based on self-vs proxy reports
We analysed differences in characteristics of centenarians who participated directly in the interview and those for whom we obtained proxy respondent reports (Table 1). There were significant differences in the living situation, χ2 (1, N = 169) = 8.5, p = .003, and there were trends in the distribution of gender across the two groups, χ2 (1, N = 169) = 5.7, p = .017. More specifically, in the centenarian self-reports group there were significantly more men (31.9%) and individuals living in private homes (45.7%) compared to those for whom we collected information through proxy respondents (16.0% and 24.0%, respectively). No significant differences were found regarding other characteristics. Although we found no significant differences for subjective health, χ2(2, N = 165) = 0.652, p = 0.652, and depression scores, t(163) = 1.5, p = 0.091,, their numeric values suggested a trend towards poorer evaluations of health and higher depression levels when the information was provided by a proxy respondent replacing the centenarian (SRH: 45.4% vs. 42.2%, 5-GDS: M = 1.8, SD = 1.6 vs. M = 1.3, SD = 1.4).
[Figure omitted. See PDF.]
Characteristics of proxy respondents by relationship with centenarians
Table 2 presents key characteristics of proxy respondents categorized by their relationship with the centenarians. The largest group comprised the children of centenarians (n = 51), followed by non-relatives (n = 14) and other relatives (n = 10). It is noteworthy that, among centenarians for whom the proxy was not a child, only one of them had no living children. Unfortunately, it is not possible to determine why the children of those centenarians were not recruited in place of the other relatives or non-relatives. The Kruskal-Wallis test indicated a statistically significant difference in age among the three groups, χ2 (2, N = 74) = 6.2, p = .045, with children of centenarians (M = 69.8, SD = 7.7) and other relatives (M = 69.8, SD = 10.8) being older on average compared to non-relatives (M = 56.9, SD = 16.5). No statistically significant differences were observed in sex, marital status, education, or subjective health of the proxies. It could however be the case that the children who did not participate in the study had by themselves health issues, given that they are also of advanced age. We could not test this assumption due to lack of information on these non-participants.
[Figure omitted. See PDF.]
Item non-response rates across proxy types
With regards to the total amount of INR across the 14 questions we considered, the overall rate of unanswered questions among all proxy respondents in the sample was 4% (Table 3). The Kruskal-Wallis test, χ2 (2, N = 75) = 8.5, p = .014, showed that there was a statistically significant difference in the average rate of INR between the three types of proxy respondents. Specifically, children of centenarians (M = 3.4, SD 8.1) and other relatives (M = 2.9, SD = 4.9) had lower rate of INR than non-relatives (M = 7.1, SD = 6.9)–see also S1 Fig. S2 Table provides the absolute and relative distribution of INR for each variable considered. While the INR among the socio-demographic questions concerns only information related to education, most of the INR is due to subjective questions, specifically the VOL items asking whether the centenarian has religious beliefs, a strong will to live, and experiences meaning in life, and even more the GDS depressive symptom items. Table 3 also presents the derived INR variable and shows that the share of proxy respondents who did not answer all questions, i.e., having at least one INR, was 29.3%. Furthermore, there was a notable pattern regarding INR among proxy respondents and their relationship with centenarians: there was a significant association between the type of proxy respondent and INR, as revealed by the Fisher’s exact test (p = .004). Specifically, non-relatives serving as proxies for centenarians showed a much higher likelihood of INR, with 64.3% of cases having at least one unanswered item. This percentage is considerably higher compared to children (19.6%) and other relatives (30.0%) of the centenarians.
[Figure omitted. See PDF.]
The association between item non-response and the type of proxy respondents
The results of the Firth’s logistic models (Table 4) demonstrated a significant pattern in the association between INR and the type of proxy respondents, even when controlling for potential confounding factors. Although odds ratios generally showed a positive association across all models, when comparing the odds ratios for other relatives engaged as proxy respondents to the reference category (i.e. being children of the centenarian), they did not reach statistical significance without or with control variables (no control [M0]: OR = 1.84, 95% CI [0.44–7.77]; proxy characteristics as controls [M1]: OR = 2.94, 95% CI [0.55–15.70]; centenarian’s characteristics as controls [M2]: OR = 1.97, 95% CI [0.43–8.91]; proxy behaviour during interview as control [M3]: OR = 2.26, 95% CI [0.50–10.18]; data collection features [M4]: OR = 1.98, 95% CI [0.43–9.15]). In contrast, despite the wide confidence intervals, the presence of non-relatives as proxy respondent consistently exhibited a statistically significant positive association with INR across models (M0: OR = 6.8, 95% CI [1.96–23.82]; M1: OR = 12.62, 95% CI [1.86–85.47]; M2 (OR = 6.63, 95% CI [1.70–25.85]; M3: OR = 6.10, 95% CI [1.22–30.14]; M4: OR = 6.25, 95% CI [1.67–23.35]). Thus, even controlling for potential confounding factors, the association between being a non-relative proxy and INR remained generally consistent with the unadjusted model suggesting a robust and persistent association between this type of proxy respondent and INR. Complete Firth’s regression models are available in S3 Table and S2 Fig. provides graphical representation of the predicted probabilities of INR across the five models.
[Figure omitted. See PDF.]
Discussion
Our study is situated within the ongoing debate about data quality when investigating hard-to-reach and hard-to-interview populations, with centenarians representing a particularly challenging group in this context. We contribute to the existing literature by shedding light on the understudied association between INR as an essential indicator of data quality and the characteristics of proxy respondents.
First, our results indicate a significant difference in living situations between centenarians interviewed directly and those interviewed through proxy respondents, with centenarians living private setting were more likely to provide self-reports, whereas centenarians in nursing home settings were more likely to be replaced by a proxy respondent. This finding may indeed suggest a potential sample representativeness bias if we had excluded those unable to participate directly in the study, a concern previously highlighted by scholars who emphasized the importance of including oldest-old individuals residing in long-term care facilities such as nursing homes [8,9]. Relying on proxy respondents enables the inclusion of centenarians who might have been too frail or cognitively too limited to participate directly in the interviews. While not statistically significant, our results showed a trend in line with expectations that centenarians who would have been excluded without the opportunity to participate through a proxy respondent tended to exhibit poorer physical and mental health. It is important to note, however, that previous research has highlighted the risk of overestimating negative health conditions when relying on proxy respondents for older individuals [34,59]. Nevertheless, the approach employed for the SWISS100 data collection undeniably guaranteed a more inclusive and comprehensive representation of the Swiss centenarian population.
Considering our results on quality of data given the involvement of proxy respondent, two distinct but interrelated aspects warrant attention and further discussion. Firstly, it’s noteworthy that our study reveals a generally low prevalence of INR, which is consistent with findings from other previous research [23,26]. Thus, including proxy respondents is associated with a general increase in important information, and missingness is quite limited when choosing this approach. This may be because due to the study design, we only took into account a subset of questions and that many of these concerned socio-economic conditions, for which answering is relatively easy even for proxies who know little about the centenarian (e.g., gender or place of living). However, if we focus on the rate of respondents who did not provide complete answers, we have almost a third of them with at least one INR. Plausible explanations for this phenomenon could be linked to the nature of the unanswered questions and the interview mode. The SWISS100 questions addressed to proxy respondents that mainly contributed to INR asked for subjective topics, more specifically the evaluation of psychological states, beliefs or attitudes towards life (i.e., depression, religious beliefs, will to life and meaning in life). These topics may have presented a greater challenge for respondents, resulting in a higher occurrence of unanswered questions. This aligns with the notion that respondents are more likely to skip or avoid questions that touch upon subjective or sensitive matters, a phenomenon that has been observed in various survey research involving the general population [60]. Moreover, as indicated by previous findings in the field of survey methodology, telephone interviews may place a greater cognitive burden on respondents than face-to-face interviews, particularly when dealing with subjective topics [22,53].
Secondly, our findings strongly emphasize the significance of the relationship between proxy respondents and the target population, centenarians in our case. In this regard, our results align with scholars who underscore the importance of this factor as a primary determinant of the data quality collected through proxies for old and oldest-old individuals. [42]. We observed that, unlike non-relatives, children acting as proxy respondents consistently provided more comprehensive responses, irrespective of individual characteristics, target population conditions, and study design features. Additionally, although not statistically significant, our results suggest a tendency according to which other relatives may also offer more complete data compared to non-relatives. This implies that they could be a preferred type of proxy respondents, especially in scenarios where, for instance, a centenarian has no children or they are unavailable for recruitment. This result concerning INR contributes to the broader debate on data quality and the use of proxy respondents in the study of old and oldest-old individuals. Specifically, it aligns with that of scholars who, while focusing more on other indicators of data quality such as proxy reliability or agreement with the target respondents, emphasize that being a close relative, especially a child, often leads to the provision of more accurate and complete data [35,36]. In addition, even though previous studies primarily focusing on populations younger than those involved in SWISS100 have shown that age can lead to an increased risk of INR [19,37], our result supports the argument that the effect of age is more negligible when considering the relationship with the target respondent [43]. This is because children, despite being older adults themselves, were more likely to provide complete data compared to younger non-relative proxy respondents.
Limitations
While this study offers valuable insights, there are several limitations that warrant consideration. First, the study’s small sample size, consisting of 75 proxy respondents, may limit the generalizability of the findings to a broader population. This small sample also led to wider confidence intervals, reducing the precision of our estimates and necessitating cautious interpretation of the results [61–63]. Additionally, due to sample size constraints, a fully adjusted model with all control variables could not be estimated, which may have left certain confounding factors unaddressed. Second, our approach to measuring INR involved combining ’don’t know’ and ’refusal’ responses into a single category. While this was necessary to manage the small sample size, it may have obscured important distinctions between these types of non-responses. Separating these categories in future studies could provide more nuanced insights into the factors contributing to INR [19,64].
Third, the use of telephone interviews, while necessary due to the circumstances of the COVID-19 pandemic, may have placed a greater cognitive burden on respondents, particularly when dealing with subjective questions. This mode of data collection could have contributed to the observed INR, especially for questions requiring assessments of psychological and emotional states.
Addressing these limitations in future research will be essential for enhancing the robustness and generalizability of findings related to data quality in studies involving hard-to-reach and hard-to-interview populations like centenarians.
Conclusion
This study is among the few to investigate the methodological implications of using proxy respondents in centenarian research. Our findings underscore that the type of proxy respondent can significantly affect data quality, particularly INR, in surveys of the oldest-old, such as centenarians. We demonstrate that proxies with close familial ties, such as adult children, generally provide more accurate and complete data, which is crucial for achieving representative samples of this hard-to-reach population. Although this may not be true for all types of questions, and the quality of the resulting answers may vary, this strategy can help alleviate biases that might arise due to the exclusion of individuals with physical or cognitive limitations or those residing in nursing homes. However, it is essential to conduct further research to comprehensively assess the impact of proxy responses on data quality, ideally in the context of larger samples, in order to gain a more nuanced insights on the underlying factors. This research should also consider various indicators of data quality, such as rounding, acquiescence, and straightlining, as this will further contribute to expanding our understanding and reducing uncertainty in the results.
For future research involving proxy respondents of the very old, we recommend addressing potential data quality issues, from the study’s design phase to the analysis stage, to contribute to reinforcing the robustness of the findings. We also encourage researchers to continue exploring innovative data collection methods aimed at improving the representation and accuracy of information on the oldest-old population.
Supporting information
S1 Table. Variance inflation factors and tolerance criteria.
https://doi.org/10.1371/journal.pone.0311847.s001
S2 Table. Distribution of unanswered questions across variables.
https://doi.org/10.1371/journal.pone.0311847.s002
S3 Table. Firth’s logistic regression models showing the association of item nonresponse with proxy-centenarian relationship, controlled for potential confounders.
https://doi.org/10.1371/journal.pone.0311847.s003
S1 Fig. Average INR rate by type of proxy respondent.
https://doi.org/10.1371/journal.pone.0311847.s004
(TIF)
S2 Fig. Mean predicted probability of INR by type of proxy in models M0 to M4.
https://doi.org/10.1371/journal.pone.0311847.s005
(TIF)
References
1. 1. Poon LW, Martin P, Johnson MA, Jazwinski SM, Woodard JL, Nelson P. The Georgia centenarian study. Encyclopedia of Gerontology and Population Aging. Springer; 2022. pp. 5037–5043.
2. 2. Samuelsson S-M, Alfredson BB, Hagberg B, Samuelsson G, Nordbeck B, Brun A, et al. The Swedish Centenarian Study: a multidisciplinary study of five consecutive cohorts at the age of 100. The International Journal of Aging and Human Development. 1997;45: 223–253. pmid:9438877
* View Article
* PubMed/NCBI
* Google Scholar
3. 3. Teixeira L, Araújo L, Paúl C, Ribeiro O. Further survival at age 100: Findings from the Oporto Centenarian study. Experimental Gerontology. 2020;133: 110854. pmid:32004634
* View Article
* PubMed/NCBI
* Google Scholar
4. 4. Cho J, Martin P, Poon LW, Study GC. Successful aging and subjective well-being among oldest-old adults. The Gerontologist. 2015;55: 132–143. pmid:25112594
* View Article
* PubMed/NCBI
* Google Scholar
5. 5. Jopp DS, Min-Kyung S. Park , Lehrfeld J, Paggi ME. Physical, cognitive, social and mental health in near-centenarians and centenarians living in New York City: findings from the Fordham Centenarian Study. BMC Geriatr. 2016;16: 1. pmid:26729190
* View Article
* PubMed/NCBI
* Google Scholar
6. 6. Poon LW, Martin P, Bishop A, Cho J, da Rosa G, Deshpande N, et al. Understanding centenarians’ psychosocial dynamics and their contributions to health and quality of life. Current Gerontology and Geriatrics Research. 2010;2010: 1–13. pmid:20936141
* View Article
* PubMed/NCBI
* Google Scholar
7. 7. Bisla J, Calem M, Begum A, Stewart R. Have we forgotten about dementia in care homes? The importance of maintaining survey research in this sector. Age and ageing. Oxford University Press; 2011. pp. 5–6.
8. 8. Kelfve S, Thorslund M, Lennartsson C. Sampling and non-response bias on health-outcomes in surveys of the oldest old. Eur J Ageing. 2013;10: 237–245. pmid:28804299
* View Article
* PubMed/NCBI
* Google Scholar
9. 9. Schanze J-L, Zins S. Undercoverage of the elderly institutionalized population: The risk of biased estimates and the potentials of weighting. 2019; 19.
* View Article
* Google Scholar
10. 10. Tourangeau R. Defining hard-to-survey populations. Hard-to-survey populations. Cambridge, England: Cambridge University Press; 2014. pp. 3–20.
11. 11. Roberts C, Gilbert E, Allum N, Eisner L. Satisficing in surveys: A systematic review of the literature. Public Opinion Quarterly. 2019;83: 598–626.
* View Article
* Google Scholar
12. 12. Dillman DA, Eltinge JL, Groves RM, Little RJA. Survey nonresponse in design, data collection, and analysis. Survey Nonresponse. New York: Wiley; 2002. pp. 3–26.
13. 13. Kydd A, Fleming A, Paoletti I, Hvalič Touzery S. Exploring terms used for the oldest old in the gerontological literature. Journal of aging and social change. 2020;10: 53–73.
* View Article
* Google Scholar
14. 14. Benítez Baena I, Padilla García JL, Ongena Y. Evaluation of the convergence between “self-reporters” and “proxies” in a disability questionnaire by means of behaviour coding method. Qual Quant. 2012;46: 1311–1322.
* View Article
* Google Scholar
15. 15. Groves RM, Lyberg L. Total survey error: Past, present, and future. Public Opinion Quarterly. 2010;74: 849–879.
* View Article
* Google Scholar
16. 16. Kelfve S. Underestimated health Inequalities among older people—A consequence of excluding the most disabled and disadvantaged. Carr D, editor. The Journals of Gerontology: Series B. 2019;74: e125–e134. pmid:28369563
* View Article
* PubMed/NCBI
* Google Scholar
17. 17. Krosnick JA. Response strategies for coping with the cognitive demands of attitude measures in surveys. Applied Cognitive Psychology. 1991;5: 213–236.
* View Article
* Google Scholar
18. 18. Groves RM, Fowler FJ, Couper MP, Lepkowski JM, Singer E, Tourangeau R. Survey methodology. Second edition. Hoboken, NJ: Wiley Publication; 2009.
19. 19. Koch A, Blohm M. Item nonresponse in the European Social Survey. Research & Methods. 2009;18: 45–65.
* View Article
* Google Scholar
20. 20. De Leeuw ED, Hox JJ, Huisman M. Prevention and treatment of item nonresponse. Journal of Official Statistics. 2003;19: 153–176.
* View Article
* Google Scholar
21. 21. Converse P. Attitudes and non-atitudes: Continuation of a dialogue. The quantitative analysis of social problems. Reading, MA: Addison-Wesley; 1970. pp. 168–189.
22. 22. Krosnick JA, Holbrook AL, Berent MK, Carson RT, Michael Hanemann W, Kopp RJ, et al. The impact of" no opinion" response options on data quality: non-attitude reduction or an invitation to satisfice? Public Opinion Quarterly. 2002;66: 371–403.
* View Article
* Google Scholar
23. 23. Hansen S, Kaspar R, Wagner M, Woopen C, Zank S. The NRW80+ study: conceptual background and study groups. Z Gerontol Geriatr. 2021;54: 76–84. pmid:34570267
* View Article
* PubMed/NCBI
* Google Scholar
24. 24. Kutschar P, Weichbold M. Interviewing elderly in nursing homes–Respondent and survey characteristics as predictors of item nonresponse. Publisher: Survey Methods: Insights from the Field. 2019 [cited 22 Sep 2023].
* View Article
* Google Scholar
25. 25. Schanze J-L. Response behavior and quality of survey data: Comparing elderly respondents in institutions and private households. Sociological Methods & Research. 2021; 0049124121995534.
* View Article
* Google Scholar
26. 26. Shaw C, McColl E, Bond S. Functional abilities and continence: The use of proxy respondents in research involving older people. Quality of Life Research. 2001;9: 1117–1126.
* View Article
* Google Scholar
27. 27. Triplett T. Can your spouse accurately report your activities? An examination of proxy reporting. Survey Practice. 2010;3.
* View Article
* Google Scholar
28. 28. Wolinsky FD, Jones MP, Wehby GL. Gathering data from older adults via proxy respondents: research challenges. Journal of Comparative Effectiveness Research. 2012;1: 467–470. pmid:24236462
* View Article
* PubMed/NCBI
* Google Scholar
29. 29. Gruber-Baldini AL, Shardell M, Lloyd KD, Magaziner J. Use of proxies and informants. The epidemiology of aging. Dordrecht: Springer; 2012. pp. 81–90.
30. 30. Rodgers WL, Herzog AR. The consequences of accepting proxy respondents on total survey error for elderly populations. Health survey research methods. Bethesda: National Center for Health Statistics; 1989. pp. 139–146.
31. 31. Wolinsky FD, Ayres L, Jones MP, Lou Y, Wehby GL, Ullrich FA. A pilot study among older adults of the concordance between their self-reports to a health survey and spousal proxy reports on their behalf. BMC Health Serv Res. 2016;16: 485. pmid:27612571
* View Article
* PubMed/NCBI
* Google Scholar
32. 32. Neumann PJ, Araki SS, Gutterman EM. The use of proxy respondents in studies of older adults: Lessons, challenges, and opportunities. Journal of the American Geriatrics Society. 2000;48: 1646–1654. pmid:11129756
* View Article
* PubMed/NCBI
* Google Scholar
33. 33. Magaziner J, Bassett SS, Hebel JR, Gruber-Baldini A. Use of proxies to measure health and functional status in epidemiologic Studies of community-dwelling women aged 65 years and older. American Journal of Epidemiology. 1996;143: 283–292. pmid:8561163
* View Article
* PubMed/NCBI
* Google Scholar
34. 34. Pierre U, Wood-Dauphinee S, Korner-Bitensky N, Gayton D, Hanley J. Proxy use of the canadian SF-36 in rating health status of the disabled elderly. Journal of Clinical Epidemiology. 1998;51: 983–990. pmid:9817116
* View Article
* PubMed/NCBI
* Google Scholar
35. 35. Lopez A, Tinella L, Caffò A, Bosco A. Measuring the reliability of proxy respondents in behavioural assessments: an open question. Aging Clin Exp Res. 2023 [cited 12 Aug 2023]. pmid:37540380
* View Article
* PubMed/NCBI
* Google Scholar
36. 36. Wehby GL, Jones MP, Ullrich F, Lou Y, Wolinsky FD. Does the relationship of the proxy to the target person affect the concordance between survey reports and medicare claims measures of health services use? Health Serv Res. 2016;51: 314–327. pmid:26059195
* View Article
* PubMed/NCBI
* Google Scholar
37. 37. Bouscaren N, Dartois L, Boutron-Ruault M-C, Vercambre M-N. How do self and proxy dependency evaluations agree? Results from a large cohort of older women. Age and Ageing. 2018;47: 619–624. pmid:29733342
* View Article
* PubMed/NCBI
* Google Scholar
38. 38. Ball AE, Russell EM, Seymour DG, Primrose WR, Garratt AM. Problems in using health survey questionnaires in older patients with physical disabilities. Gerontology. 2001;47: 334–340. pmid:11721148
* View Article
* PubMed/NCBI
* Google Scholar
39. 39. Bassett SS, Magaziner J, Hebel JR. Reliability of proxy response on mental health indices for aged, community-dwelling women. 1990; 6. pmid:2317291
* View Article
* PubMed/NCBI
* Google Scholar
40. 40. Hardy SE, Allore H, Studenski SA. Missing data: A special challenge in aging research. Journal of the American Geriatrics Society. 2009;57: 722–729. pmid:19220562
* View Article
* PubMed/NCBI
* Google Scholar
41. 41. Zank S, Frank S. Family and professional caregivers’ ratings of dementia symptoms and activities of daily living of day care patients: Do differences change over time? Aging & Mental Health. 2002;6: 161–165. pmid:12028885
* View Article
* PubMed/NCBI
* Google Scholar
42. 42. Rodgers WL, Herzog AR. Collecting data about the oldest old: Problems and procedures. The oldest old. New York: Oxford University Press; 1992. pp. 135–156.
43. 43. Pickle LW, Brown LM, Blot WJ. Information available from surrogate respondents in case-control interview studies. American Journal of Epidemiology. 1983;118: 99–108. pmid:6869368
* View Article
* PubMed/NCBI
* Google Scholar
44. 44. Rott C, d ‘Heureuse V, Kliegel M, Schönemann P, Becker G. Die Heidelberger Hundertjährigen-Studie: Theoretische und methodische Grundlagen zur sozialwissenschaftlichen Hochaltrigkeitsforschung. Zeitschrift für Gerontologie und Geriatrie. 2001;34: 356–364.
* View Article
* Google Scholar
45. 45. Sachdev PS, Levitan C, Crawford J, Sidhu M, Slavin M, Richmond R, et al. The Sydney Centenarian Study: methodology and profile of centenarians and near-centenarians. International Psychogeriatrics. 2013;25: 993–1005. pmid:23510643
* View Article
* PubMed/NCBI
* Google Scholar
46. 46. Poon LW, Jazwinski M, Green RC, Woodard JL, Martin P, Rodgers WL, et al. Methodological considerations in studying centenarians: lessons learned from the Georgia centenarian studies. Annual review of gerontology and geriatrics. 2007;27: 231–264. pmid:21852888
* View Article
* PubMed/NCBI
* Google Scholar
47. 47. Ribeiro O, Araújo L, Teixeira L, Duarte N, Brandão D, Martin I, et al. Health status, living arrangements, and service use at 100: Findings from the Oporto Centenarian Study. Journal of Aging & Social Policy. 2016;28: 148–164. pmid:27010687
* View Article
* PubMed/NCBI
* Google Scholar
48. 48. MacDonald M, Martin P, Margrett J, Poon† LW. Correspondence of perceptions about centenarians’ mental health. Aging & Mental Health. 2009;13: 827–837. pmid:19888703
* View Article
* PubMed/NCBI
* Google Scholar
49. 49. Schönemann-Gieck P, Rott Ch, Martin M, d’Heureuse V, Kliegel M, Becker G. Übereinstimmungen und Unterschiede in der selbst- und fremdeingeschätzten Gesundheit bei extrem Hochaltrigen. Z Gerontol Geriat. 2003;36: 429–436. pmid:14685732
* View Article
* PubMed/NCBI
* Google Scholar
50. 50. Skafida V, Morrison F, Devaney J. Answer refused: Exploring how item non-response on domestic abuse questions in a social survey affects analysis. Survey Research Methods. 2022;16: 227–240.
* View Article
* Google Scholar
51. 51. Sheikh JI, Yesavage JA. Geriatric Depression Scale (GDS): recent evidence and development of a shorter version. Clinical Gerontologist: The Journal of Aging and Mental Health. 1986.
* View Article
* Google Scholar
52. 52. Lawton MP, Moss M, Hoffman C, Kleban MH, Ruckdeschel K, Winter L. Valuation of life: a concept and a scale. Journal of Aging and Health. 2001;13: 3–31. pmid:11503845
* View Article
* PubMed/NCBI
* Google Scholar
53. 53. Sloan M, Wright D, Barrett K. Data comparability in a mixed mode telephone and face to face survey of persons with disabilities. Citeseer; 2012.
54. 54. Boerner K, Jopp DS, Park M-KS, Rott C. Whom do centenarians rely on for support? Findings from the Second Heidelberg Centenarian Study. Journal of Aging & Social Policy. 2016;28: 165–186. pmid:26959657
* View Article
* PubMed/NCBI
* Google Scholar
55. 55. Loosveldt G. Interaction characteristics of the difficult-to-interview respondent. International Journal of Public Opinion Research. 1997;9: 386–394.
* View Article
* Google Scholar
56. 56. Firth D. Bias reduction of maximum likelihood estimates. Biometrika. 1993;80: 27–38.
* View Article
* Google Scholar
57. 57. Heinze G, Schemper M. A solution to the problem of separation in logistic regression. Statistics in medicine. 2002;21: 2409–2419. pmid:12210625
* View Article
* PubMed/NCBI
* Google Scholar
58. 58. StataCorp. Stata Statistical Software: Release 16. College Station, Texas, USA: StataCorp LLC; 2019.
59. 59. Andresen EM, Vahle VJ, Lollar D. Proxy reliability: Health-related quality of life (HRQoL) measures for people with disability. Qual Life Res. 2001; 11. pmid:11822794
* View Article
* PubMed/NCBI
* Google Scholar
60. 60. Kays K, Gathercoal K, Buhrow W. Does survey format influence self-disclosure on sensitive question items? Computers in Human Behavior. 2012;28: 251–256.
* View Article
* Google Scholar
61. 61. Creswell JW, Creswell JD. Research design: Qualitative, quantitative, and mixed methods approaches. Sage publications; 2017.
62. 62. Vittinghoff E, Glidden DV, Shiboski SC, McCulloch CE. Regression methods in biostatistics: linear, logistic, survival, and repeated measures models. 2006.
* View Article
* Google Scholar
63. 63. Agresti A. Categorical data analysis. John Wiley & Sons; 2012.
64. 64. Groves RM, Couper MP. Nonresponse in household interview surveys. John Wiley & Sons; 2012.
Citation: Zaccaria D, Falciola J, Masotti B, von Gunten A, Herrmann F, Jopp DS, et al. (2025) Data quality in centenarian research: The proxy-centenarian relationship and item nonresponse in the SWISS100 study. PLoS ONE 20(1): e0311847. https://doi.org/10.1371/journal.pone.0311847
About the Authors:
Daniele Zaccaria
Roles: Conceptualization, Formal analysis, Writing – original draft, Writing – review & editing
E-mail: [email protected]
Affiliation: Department of Business Economics, Centre of Competence on Ageing, Health and Social Care, University of Applied Sciences and Arts of Southern Switzerland, Manno, Switzerland
ORICD: https://orcid.org/0000-0002-9715-3796
Justine Falciola
Roles: Data curation, Writing – review & editing
Affiliation: Faculty of Medicine, Department of Rehabilitation and Geriatrics, Geneva University Hospitals and University of Geneva, Geneva, Switzerland
ORICD: https://orcid.org/0000-0002-1135-9321
Barbara Masotti
Roles: Conceptualization, Writing – review & editing
Affiliations: Department of Business Economics, Centre of Competence on Ageing, Health and Social Care, University of Applied Sciences and Arts of Southern Switzerland, Manno, Switzerland, LIVES Swiss Centre of Expertise in Life Course Research, University of Lausanne and University of Geneva, Geneva, Switzerland
Armin von Gunten
Roles: Funding acquisition, Writing – review & editing
Affiliation: Department of Psychiatry, Service of Old Age Psychiatry, Lausanne University Hospital and University of Lausanne, Lausanne, Switzerland
François Herrmann
Roles: Funding acquisition, Writing – review & editing
Affiliation: Faculty of Medicine, Department of Rehabilitation and Geriatrics, Geneva University Hospitals and University of Geneva, Geneva, Switzerland
Daniela S. Jopp
Roles: Funding acquisition, Writing – review & editing
Affiliations: LIVES Swiss Centre of Expertise in Life Course Research, University of Lausanne and University of Geneva, Geneva, Switzerland, Institute of Psychology, University of Lausanne, Lausanne, Switzerland
Stefano Cavalli
Roles: Conceptualization, Funding acquisition, Writing – review & editing
Affiliation: Department of Business Economics, Centre of Competence on Ageing, Health and Social Care, University of Applied Sciences and Arts of Southern Switzerland, Manno, Switzerland
[/RAW_REF_TEXT]
[/RAW_REF_TEXT]
[/RAW_REF_TEXT]
[/RAW_REF_TEXT]
[/RAW_REF_TEXT]
[/RAW_REF_TEXT]
[/RAW_REF_TEXT]
[/RAW_REF_TEXT]
[/RAW_REF_TEXT]
[/RAW_REF_TEXT]
[/RAW_REF_TEXT]
[/RAW_REF_TEXT]
[/RAW_REF_TEXT]
[/RAW_REF_TEXT]
1. Poon LW, Martin P, Johnson MA, Jazwinski SM, Woodard JL, Nelson P. The Georgia centenarian study. Encyclopedia of Gerontology and Population Aging. Springer; 2022. pp. 5037–5043.
2. Samuelsson S-M, Alfredson BB, Hagberg B, Samuelsson G, Nordbeck B, Brun A, et al. The Swedish Centenarian Study: a multidisciplinary study of five consecutive cohorts at the age of 100. The International Journal of Aging and Human Development. 1997;45: 223–253. pmid:9438877
3. Teixeira L, Araújo L, Paúl C, Ribeiro O. Further survival at age 100: Findings from the Oporto Centenarian study. Experimental Gerontology. 2020;133: 110854. pmid:32004634
4. Cho J, Martin P, Poon LW, Study GC. Successful aging and subjective well-being among oldest-old adults. The Gerontologist. 2015;55: 132–143. pmid:25112594
5. Jopp DS, Min-Kyung S. Park , Lehrfeld J, Paggi ME. Physical, cognitive, social and mental health in near-centenarians and centenarians living in New York City: findings from the Fordham Centenarian Study. BMC Geriatr. 2016;16: 1. pmid:26729190
6. Poon LW, Martin P, Bishop A, Cho J, da Rosa G, Deshpande N, et al. Understanding centenarians’ psychosocial dynamics and their contributions to health and quality of life. Current Gerontology and Geriatrics Research. 2010;2010: 1–13. pmid:20936141
7. Bisla J, Calem M, Begum A, Stewart R. Have we forgotten about dementia in care homes? The importance of maintaining survey research in this sector. Age and ageing. Oxford University Press; 2011. pp. 5–6.
8. Kelfve S, Thorslund M, Lennartsson C. Sampling and non-response bias on health-outcomes in surveys of the oldest old. Eur J Ageing. 2013;10: 237–245. pmid:28804299
9. Schanze J-L, Zins S. Undercoverage of the elderly institutionalized population: The risk of biased estimates and the potentials of weighting. 2019; 19.
10. Tourangeau R. Defining hard-to-survey populations. Hard-to-survey populations. Cambridge, England: Cambridge University Press; 2014. pp. 3–20.
11. Roberts C, Gilbert E, Allum N, Eisner L. Satisficing in surveys: A systematic review of the literature. Public Opinion Quarterly. 2019;83: 598–626.
12. Dillman DA, Eltinge JL, Groves RM, Little RJA. Survey nonresponse in design, data collection, and analysis. Survey Nonresponse. New York: Wiley; 2002. pp. 3–26.
13. Kydd A, Fleming A, Paoletti I, Hvalič Touzery S. Exploring terms used for the oldest old in the gerontological literature. Journal of aging and social change. 2020;10: 53–73.
14. Benítez Baena I, Padilla García JL, Ongena Y. Evaluation of the convergence between “self-reporters” and “proxies” in a disability questionnaire by means of behaviour coding method. Qual Quant. 2012;46: 1311–1322.
15. Groves RM, Lyberg L. Total survey error: Past, present, and future. Public Opinion Quarterly. 2010;74: 849–879.
16. Kelfve S. Underestimated health Inequalities among older people—A consequence of excluding the most disabled and disadvantaged. Carr D, editor. The Journals of Gerontology: Series B. 2019;74: e125–e134. pmid:28369563
17. Krosnick JA. Response strategies for coping with the cognitive demands of attitude measures in surveys. Applied Cognitive Psychology. 1991;5: 213–236.
18. Groves RM, Fowler FJ, Couper MP, Lepkowski JM, Singer E, Tourangeau R. Survey methodology. Second edition. Hoboken, NJ: Wiley Publication; 2009.
19. Koch A, Blohm M. Item nonresponse in the European Social Survey. Research & Methods. 2009;18: 45–65.
20. De Leeuw ED, Hox JJ, Huisman M. Prevention and treatment of item nonresponse. Journal of Official Statistics. 2003;19: 153–176.
21. Converse P. Attitudes and non-atitudes: Continuation of a dialogue. The quantitative analysis of social problems. Reading, MA: Addison-Wesley; 1970. pp. 168–189.
22. Krosnick JA, Holbrook AL, Berent MK, Carson RT, Michael Hanemann W, Kopp RJ, et al. The impact of" no opinion" response options on data quality: non-attitude reduction or an invitation to satisfice? Public Opinion Quarterly. 2002;66: 371–403.
23. Hansen S, Kaspar R, Wagner M, Woopen C, Zank S. The NRW80+ study: conceptual background and study groups. Z Gerontol Geriatr. 2021;54: 76–84. pmid:34570267
24. Kutschar P, Weichbold M. Interviewing elderly in nursing homes–Respondent and survey characteristics as predictors of item nonresponse. Publisher: Survey Methods: Insights from the Field. 2019 [cited 22 Sep 2023].
25. Schanze J-L. Response behavior and quality of survey data: Comparing elderly respondents in institutions and private households. Sociological Methods & Research. 2021; 0049124121995534.
26. Shaw C, McColl E, Bond S. Functional abilities and continence: The use of proxy respondents in research involving older people. Quality of Life Research. 2001;9: 1117–1126.
27. Triplett T. Can your spouse accurately report your activities? An examination of proxy reporting. Survey Practice. 2010;3.
28. Wolinsky FD, Jones MP, Wehby GL. Gathering data from older adults via proxy respondents: research challenges. Journal of Comparative Effectiveness Research. 2012;1: 467–470. pmid:24236462
29. Gruber-Baldini AL, Shardell M, Lloyd KD, Magaziner J. Use of proxies and informants. The epidemiology of aging. Dordrecht: Springer; 2012. pp. 81–90.
30. Rodgers WL, Herzog AR. The consequences of accepting proxy respondents on total survey error for elderly populations. Health survey research methods. Bethesda: National Center for Health Statistics; 1989. pp. 139–146.
31. Wolinsky FD, Ayres L, Jones MP, Lou Y, Wehby GL, Ullrich FA. A pilot study among older adults of the concordance between their self-reports to a health survey and spousal proxy reports on their behalf. BMC Health Serv Res. 2016;16: 485. pmid:27612571
32. Neumann PJ, Araki SS, Gutterman EM. The use of proxy respondents in studies of older adults: Lessons, challenges, and opportunities. Journal of the American Geriatrics Society. 2000;48: 1646–1654. pmid:11129756
33. Magaziner J, Bassett SS, Hebel JR, Gruber-Baldini A. Use of proxies to measure health and functional status in epidemiologic Studies of community-dwelling women aged 65 years and older. American Journal of Epidemiology. 1996;143: 283–292. pmid:8561163
34. Pierre U, Wood-Dauphinee S, Korner-Bitensky N, Gayton D, Hanley J. Proxy use of the canadian SF-36 in rating health status of the disabled elderly. Journal of Clinical Epidemiology. 1998;51: 983–990. pmid:9817116
35. Lopez A, Tinella L, Caffò A, Bosco A. Measuring the reliability of proxy respondents in behavioural assessments: an open question. Aging Clin Exp Res. 2023 [cited 12 Aug 2023]. pmid:37540380
36. Wehby GL, Jones MP, Ullrich F, Lou Y, Wolinsky FD. Does the relationship of the proxy to the target person affect the concordance between survey reports and medicare claims measures of health services use? Health Serv Res. 2016;51: 314–327. pmid:26059195
37. Bouscaren N, Dartois L, Boutron-Ruault M-C, Vercambre M-N. How do self and proxy dependency evaluations agree? Results from a large cohort of older women. Age and Ageing. 2018;47: 619–624. pmid:29733342
38. Ball AE, Russell EM, Seymour DG, Primrose WR, Garratt AM. Problems in using health survey questionnaires in older patients with physical disabilities. Gerontology. 2001;47: 334–340. pmid:11721148
39. Bassett SS, Magaziner J, Hebel JR. Reliability of proxy response on mental health indices for aged, community-dwelling women. 1990; 6. pmid:2317291
40. Hardy SE, Allore H, Studenski SA. Missing data: A special challenge in aging research. Journal of the American Geriatrics Society. 2009;57: 722–729. pmid:19220562
41. Zank S, Frank S. Family and professional caregivers’ ratings of dementia symptoms and activities of daily living of day care patients: Do differences change over time? Aging & Mental Health. 2002;6: 161–165. pmid:12028885
42. Rodgers WL, Herzog AR. Collecting data about the oldest old: Problems and procedures. The oldest old. New York: Oxford University Press; 1992. pp. 135–156.
43. Pickle LW, Brown LM, Blot WJ. Information available from surrogate respondents in case-control interview studies. American Journal of Epidemiology. 1983;118: 99–108. pmid:6869368
44. Rott C, d ‘Heureuse V, Kliegel M, Schönemann P, Becker G. Die Heidelberger Hundertjährigen-Studie: Theoretische und methodische Grundlagen zur sozialwissenschaftlichen Hochaltrigkeitsforschung. Zeitschrift für Gerontologie und Geriatrie. 2001;34: 356–364.
45. Sachdev PS, Levitan C, Crawford J, Sidhu M, Slavin M, Richmond R, et al. The Sydney Centenarian Study: methodology and profile of centenarians and near-centenarians. International Psychogeriatrics. 2013;25: 993–1005. pmid:23510643
46. Poon LW, Jazwinski M, Green RC, Woodard JL, Martin P, Rodgers WL, et al. Methodological considerations in studying centenarians: lessons learned from the Georgia centenarian studies. Annual review of gerontology and geriatrics. 2007;27: 231–264. pmid:21852888
47. Ribeiro O, Araújo L, Teixeira L, Duarte N, Brandão D, Martin I, et al. Health status, living arrangements, and service use at 100: Findings from the Oporto Centenarian Study. Journal of Aging & Social Policy. 2016;28: 148–164. pmid:27010687
48. MacDonald M, Martin P, Margrett J, Poon† LW. Correspondence of perceptions about centenarians’ mental health. Aging & Mental Health. 2009;13: 827–837. pmid:19888703
49. Schönemann-Gieck P, Rott Ch, Martin M, d’Heureuse V, Kliegel M, Becker G. Übereinstimmungen und Unterschiede in der selbst- und fremdeingeschätzten Gesundheit bei extrem Hochaltrigen. Z Gerontol Geriat. 2003;36: 429–436. pmid:14685732
50. Skafida V, Morrison F, Devaney J. Answer refused: Exploring how item non-response on domestic abuse questions in a social survey affects analysis. Survey Research Methods. 2022;16: 227–240.
51. Sheikh JI, Yesavage JA. Geriatric Depression Scale (GDS): recent evidence and development of a shorter version. Clinical Gerontologist: The Journal of Aging and Mental Health. 1986.
52. Lawton MP, Moss M, Hoffman C, Kleban MH, Ruckdeschel K, Winter L. Valuation of life: a concept and a scale. Journal of Aging and Health. 2001;13: 3–31. pmid:11503845
53. Sloan M, Wright D, Barrett K. Data comparability in a mixed mode telephone and face to face survey of persons with disabilities. Citeseer; 2012.
54. Boerner K, Jopp DS, Park M-KS, Rott C. Whom do centenarians rely on for support? Findings from the Second Heidelberg Centenarian Study. Journal of Aging & Social Policy. 2016;28: 165–186. pmid:26959657
55. Loosveldt G. Interaction characteristics of the difficult-to-interview respondent. International Journal of Public Opinion Research. 1997;9: 386–394.
56. Firth D. Bias reduction of maximum likelihood estimates. Biometrika. 1993;80: 27–38.
57. Heinze G, Schemper M. A solution to the problem of separation in logistic regression. Statistics in medicine. 2002;21: 2409–2419. pmid:12210625
58. StataCorp. Stata Statistical Software: Release 16. College Station, Texas, USA: StataCorp LLC; 2019.
59. Andresen EM, Vahle VJ, Lollar D. Proxy reliability: Health-related quality of life (HRQoL) measures for people with disability. Qual Life Res. 2001; 11. pmid:11822794
60. Kays K, Gathercoal K, Buhrow W. Does survey format influence self-disclosure on sensitive question items? Computers in Human Behavior. 2012;28: 251–256.
61. Creswell JW, Creswell JD. Research design: Qualitative, quantitative, and mixed methods approaches. Sage publications; 2017.
62. Vittinghoff E, Glidden DV, Shiboski SC, McCulloch CE. Regression methods in biostatistics: linear, logistic, survival, and repeated measures models. 2006.
63. Agresti A. Categorical data analysis. John Wiley & Sons; 2012.
64. Groves RM, Couper MP. Nonresponse in household interview surveys. John Wiley & Sons; 2012.
You have requested "on-the-fly" machine translation of selected content from our databases. This functionality is provided solely for your convenience and is in no way intended to replace human translation. Show full disclaimer
Neither ProQuest nor its licensors make any representations or warranties with respect to the translations. The translations are automatically generated "AS IS" and "AS AVAILABLE" and are not retained in our systems. PROQUEST AND ITS LICENSORS SPECIFICALLY DISCLAIM ANY AND ALL EXPRESS OR IMPLIED WARRANTIES, INCLUDING WITHOUT LIMITATION, ANY WARRANTIES FOR AVAILABILITY, ACCURACY, TIMELINESS, COMPLETENESS, NON-INFRINGMENT, MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Your use of the translations is subject to all use restrictions contained in your Electronic Products License Agreement and by using the translation functionality you agree to forgo any and all claims against ProQuest or its licensors for your use of the translation functionality and any output derived there from. Hide full disclaimer
© 2025 Zaccaria et al. This is an open access article distributed under the terms of the Creative Commons Attribution License: http://creativecommons.org/licenses/by/4.0/ (the “License”), which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.
Abstract
Background
In recent years, there has been a growing interest in the investigation of very old individuals. However, various challenges arise when collecting data from this age group. Given potential health and cognitive impairments and the difficulty of retrieving accurate self-reported data, involving individuals knowledgeable of the target person as proxy respondents are an invaluable solution. The primary goal of this study is to explore the association between proxy-centenarian relationship and item nonresponse, a pivotal data quality measure.
Data and methods
We used data from the phone study conducted within SWISS100, a study on centenarians in Switzerland, focusing on a subsample of 94 centenarians and 75 proxy respondents. We compared characteristics of centenarians who provided self-reports with those interviewed through proxy respondents using descriptive analyses, and Firth’s regression models to investigate the association of different types of proxies (children, other relatives, and non-relatives) with item nonresponse.
Results
We noted differences between centenarians participating themselves in interviews and those represented by proxies, observing higher proportions of men and private home residents in the self-report group. There was a low prevalence of item nonresponse among proxy respondents. Proxy type significantly impacted item nonresponse, particularly with non-relatives showing notably higher unanswered question rates. A robust association between non-relatives as proxies and increased item nonresponse persisted even when controlling for potential confounding factors.
Discussion and conclusions
This study enhances the understanding of data quality, particularly item nonresponse, and highlights the impact of the proxy-centenarian relationship when studying the oldest-old. The findings stress the importance of carefully choosing proxy respondents, preferably children, to improve data quality and inclusivity for those individuals hard to reach or to interview. Future research should investigate various data quality indicators and rely on larger samples to enhance the representation and methodological rigour in the study of the oldest-old.
You have requested "on-the-fly" machine translation of selected content from our databases. This functionality is provided solely for your convenience and is in no way intended to replace human translation. Show full disclaimer
Neither ProQuest nor its licensors make any representations or warranties with respect to the translations. The translations are automatically generated "AS IS" and "AS AVAILABLE" and are not retained in our systems. PROQUEST AND ITS LICENSORS SPECIFICALLY DISCLAIM ANY AND ALL EXPRESS OR IMPLIED WARRANTIES, INCLUDING WITHOUT LIMITATION, ANY WARRANTIES FOR AVAILABILITY, ACCURACY, TIMELINESS, COMPLETENESS, NON-INFRINGMENT, MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Your use of the translations is subject to all use restrictions contained in your Electronic Products License Agreement and by using the translation functionality you agree to forgo any and all claims against ProQuest or its licensors for your use of the translation functionality and any output derived there from. Hide full disclaimer