Responsiveness and minimal important change of seven PROMIS computerized adaptive tests (CAT) in patients with advanced chronic kidney disease

Terwee, Caroline B.; van der Willik, Esmee M.; van Breda, Fenna; van Jaarsveld, Brigit C.; van de Putte, Marlon; Jetten, Isabelle W.; Dekker, Friedo W.; Meuleman, Yvette; van Ittersum, Frans J.

doi:10.1186/s41687-023-00574-y

Research
Open access
Published: 04 April 2023

Responsiveness and minimal important change of seven PROMIS computerized adaptive tests (CAT) in patients with advanced chronic kidney disease

Caroline B. Terwee ORCID: orcid.org/0000-0003-4570-2826^1,2,
Esmee M. van der Willik^1,3,
Fenna van Breda⁴,
Brigit C. van Jaarsveld⁴,
Marlon van de Putte⁴,
Isabelle W. Jetten⁴,
Friedo W. Dekker³,
Yvette Meuleman³ &
…
Frans J. van Ittersum⁴

Journal of Patient-Reported Outcomes volume 7, Article number: 35 (2023) Cite this article

1814 Accesses
1 Altmetric
Metrics details

Abstract

Background

The Patient-Reported Outcomes Measurement Information System (PROMIS®) has the potential to harmonize the measurement of health-related quality of life (HRQL) across medical conditions. We evaluated responsiveness and minimal important change (MIC) of seven Dutch-Flemish PROMIS computerized adaptive tests (CAT) in Dutch patients with advanced chronic kidney disease (CKD).

Methods

CKD patients (eGFR < 30 ml/min.1.73m²) completed at baseline and after 6 months seven PROMIS CATs (assessing physical function, pain interference, fatigue, sleep disturbance, anxiety, depression, and ability to participate in social roles and activities), Short Form Health Survey 12 (SF-12), PROMIS Pain Intensity single item, Dialysis Symptom Index (DSI), and Global Rating Scales (GRS) of change. Responsiveness was assessed by testing predefined hypotheses about expected correlations among measures, area under the ROC Curve, and effect sizes. MIC was determined with predictive modelling.

Results

207 patients were included; 186 (90%) completed the follow-up. Most results were in accordance with expectations (70–91% of hypotheses confirmed), with some exceptions for PROMIS Anxiety and Ability to Participate (60% and 42% of hypotheses confirmed, respectively). For PROMIS Anxiety and Depression correlations with the GRS were too low (0.04 and 0.20, respectively) to calculate a MIC. MIC values, representing minimal important deterioration, ranged from 0.4 to 2.5 T-score points for the other domains.

Conclusion

We found sufficient responsiveness of PROMIS CATs Physical Function, Fatigue, Sleep Disturbance, and Depression. The results for PROMIS CATs Pain Interference were almost sufficient, but some results for Anxiety and Ability to Participate in Social Roles and Activities were not as expected. Reported MIC values should be interpreted with caution because most patients did not change.

Background

In the Netherlands, the Dialysis Symptom Index (DSI) and the generic 12-item Short-Form health survey (SF-12) are routine used in daily clinical care for patients with chronic kidney disease (CKD) [1]. However, the Patient-Reported Outcomes Measurement Information System (PROMIS®) was recently selected as preferred generic instruments for use in daily medical specialty care across conditions by a national working group of representatives of all umbrella organizations involved in Dutch medical specialist care together with PROM experts and patient organizations, under the auspices of the Dutch Ministry of Health, Welfare, and Sport (program “Outcomes Based Healthcare”) [2]. Internationally, a combination of PROMIS Global Health and PROMIS-29 has been recommended as one of three possible PROMs for use in patients with chronic kidney disease (CKD) by a consensus group of stakeholders of the International Consortium of Health Outcomes Measurement (ICHOM) [3].

PROMIS is a generic system of highly efficient, extensively validated patient-reported outcome measures (PROMs) that can be used to measure commonly relevant aspects of health-related quality of life (HRQOL), such as fatigue, anxiety, physical function, and social participation, in people with and without (chronic) medical conditions [4]. PROMIS consists of a collection of item banks. An item bank is a large set of questions that measure one domain (e.g. physical function). Item banks were developed using item response theory (IRT) modelling, and can be administered either as fixed short forms or as a computerized adaptive test (CAT). In a CAT, the computer selects questions from the item bank based on the answers to previous questions. The CAT is adapted to the symptom severity or functional level of the patient, resulting in questions that are likely more relevant to the patient. In addition, on average less questions are required to obtain similar or even more precise measurements compared to fixed PROMs measuring similar domains [5; 6]. Sufficient validity and reliability of PROMIS short forms and CAT was found in U.S. patients with CKD [7,8,9]. In a recent study we also found sufficient construct validity and test-retest reliability of seven PROMIS CATs (assessing physical function, pain interference, fatigue, sleep disturbance, anxiety, depression, and ability to participate in social roles and activities) in Dutch patients with advanced CKD [10]. However, responsiveness of PROMIS has not yet been studied in patients with CKD.

The recommendations for PROMIS in the Netherlands and abroad led to the desire to validate PROMIS in CKD patients and to compare the measurement properties of PROMIS to the SF-12. In a previous study, we found better reliability and smaller measurement error of PROMIS CATs compared to the SF-12, although PROMIS CATs required six to seven items per domain (45 items in total, using a high precision stopping rule of r = 0.95) as compared to 12 items for the SF-12. Seven CATs could be completed in on average 10.2 min as compared to 3.3 min for the SF-12. The aim of the current study was to assess responsiveness and minimal important change (MIC) of seven PROMIS CATs (Physical Function, Pain Interference, Fatigue, Sleep Disturbance, Anxiety, Depression, and Ability to Participate in Social Roles and Activities) in patients with advanced CKD, using 6 months follow-up data of this previous study [10].

Methods

Study design

A longitudinal study was performed, in which, after providing written informed consent, patients with advanced CKD were invited by e-mail to complete the PROMs digitally at the KLIK research platform (www.hetklikt.nu) at 3 time points; at inclusion (i.e. baseline), after 2 weeks (for assessing test-retest reliability, as described in a separate paper [10]) and after 6 months. We used baseline and 6 months measurements for this study. During follow-up patients received care as usual, which could include starting hemodialysis, peritoneal dialysis, or transplantation. The study was designed and reported according to COSMIN guidelines [11; 12]. A sample size of 100 was considered “very good” according to COSMIN [11].

Participants

We included adult patients with advanced CKD with an estimated glomerular filtration rate (eGFR) < 30 ml/min.1.73m², not receiving dialysis treatment. Exclusion criteria were start with kidney replacement therapy (KRT; dialysis or kidney transplantation) planned within 4 weeks after inclusion, rapid deterioration of kidney function (i.e. decrease in eGFR of > 20 ml/min.1.73 m² during the last 6 months before inclusion), not able to complete questionnaires due to cognitive impairment, poor knowledge of the Dutch language, or no informed consent. Patients were recruited between November 2020 and August 2021 by their nephrologist at the outpatient clinics of Amsterdam UMC and “Niercentrum aan de Amstel” in Amstelveen, the Netherlands [10]. The study population represents the population in which PROMs are being used in these centers. Eligible patients received written information by mail and were, if needed, approached by telephone after 2 weeks for further information. Patients without access to an electronic device with internet connection could participate by telephone.

Measures

We collected the following information from the medical records of the participants: age, gender, primary kidney disease according to European Renal Association codes [13], body mass index (BMI, weight (kg)/height (m)²), smoking status, comorbidities (hypertension, diabetes mellitus, cardiovascular disease, lung disease, liver disease and malignancy) as defined by ICHOM [3], eGFR (ml/min/1.73m², calculated with the CKD-EPI equation [14]) at each time point, kidney replacement therapy (KRT) in medical history, start of KRT during follow-up and death during follow-up. Patients reported educational level and ethno-cultural background at baseline.

Participants completed the following PROMs at baseline and at six months follow-up through the KLIK research platform [15], which is a PROM platform connected to the CAT software of the Dutch-Flemish Assessment Center, part of the Dutch-Flemish PROMIS National Center [16]:

Seven Dutch-Flemish PROMIS CATs [17]: v1.2 Physical Function, v1.1 Pain Interference, v1.0 Fatigue, v1.0 Sleep Disturbance, v1.0 Anxiety, v1.0 Depression, and v2.0 Ability to Participate in Social Roles and Activities. All items have five response options (e.g. ranging from ‘never’ to ‘always’ or from ‘not at all’ to ‘very much’). In this study, the CAT stopped when a SE of 2.2 on the T-score metric was reached (comparable to a reliability of approximately 0.95) or when a maximum of 12 items per CAT was administered. We used a lower SE compared to the standard stopping rule (i.e. SE: 3.0)[5] because a higher reliability may be preferable for routine care and by using this setting, the optimal performance of PROMIS CATs could be investigated. PROMIS CAT scores were calculated based on the original US item parameters, as per PROMIS convention, and are expressed as T-scores where a score of 50 represents the average score of the U.S. general population, with a SD of 10. Higher scores indicate more of the construct (e.g. a higher score for Depression means more depressive symptoms, a higher score for Physical Function means more [better] physical functioning). In addition, for comparison with the SF-12 component summary scores, we calculated the PROMIS-29 physical and mental health summary scores [18]. Finally, for descriptive purposes only, we also calculated the PROMIS-Preference (PROPr) score, which provides a preference-based summary score (health utility) for economic evaluations. The PROPs score was calculated according to the prediction model described by DeWitt et al., using preferences from the US population [19].
PROMIS item v1.0 Numerical Rating Scale Pain Intensity 1a, a single item with a 0–10 scale, with higher scores indicating more pain.
12-item Short-Form health survey (SF-12) version 2 [21; 22], a generic PROM assessing the following aspects of HRQOL: physical functioning, role-physical, bodily pain, general health, vitality, social functioning, role-emotional and mental health. To enable comparison with PROMIS domains, we calculated eight domain scores (not part of the official SF-12 scoring). For physical functioning, physical and emotional role functioning, and mental health the two available items were summarized. For bodily pain, vitality, social functioning, and general health single items were used. Additionally, we calculated the overall physical component summary (PCS) score and the mental component summary (MCS) score based on weighted summaries of all items, using the standard SF-12 scoring algorithm. Domain scores were transformed to a score from 0 to 100, while the PCS and PCS scores have a mean of 50, representing the average score of the U.S. general population, with a SD of 10. Higher scores indicate better HRQOL. The SF-12 showed sufficient validity in patients with CKD [22,23,24] and is routinely used in Dutch nephrology care [25].
Dialysis Symptom Index (DSI) [26], a 30-item kidney disease specific PROM to assess physical and emotional symptom burden. Patients report the presence of 30 symptoms (yes/no) during the past week and, if present, the burden of each symptom on a 5-point polytomous response scale ranging from 1 ‘not at all’ to 5 ‘very much’ bothersome. We calculated two sum scores: (1) total number of symptoms present (0–30 symptoms), and (2) total symptom burden score, which is the sum of burden on individual symptoms ranging from 0 (no symptoms) to 150 (all 30 symptoms are present and very much bothersome) [26; 28]. The DSI items ‘feeling tired or lack of energy’, ‘feeling anxious’, ‘trouble falling asleep’ and ‘trouble staying asleep’ (the latter two items are hereafter combined as ‘sleep problems’) were used as comparison items in the responsiveness analyses of the PROMIS CATs since these items intend to measure constructs comparable to the PROMIS CAT domains Fatigue, Anxiety and Sleep Disturbance, respectively. The DSI showed sufficient validity in patients with CKD [26] and is routinely used in Dutch nephrology care [25].
At six months follow-up patients were also asked to rate their perceived change in each of the seven PROMIS domains on a Global Rating Scale (GRS) (e.g. `How did your fatigue change compared to 6 months ago?`). Perceived change was rated on a five-point scale (much worse, a little worse, no change, a little better, much better).

The PROMs (seven PROMIS CATs, SF-12 and DSI) were presented in random order across patients, but with fixed order within patients during follow-up (i.e. an individual patient received the measures in the same order each time but the order differed from patient to patient). The KLIK platform did not allow for any missing values within questionnaires.

Responsiveness

Responsiveness of the PROMIS CATs was determined by comparing changes in PROMIS CAT T-scores to changes in scores of the PROMIS Pain Intensity, SF-12 (MCS, PCS and separate domains), and DSI (items/domains and overall), and to the GRS. On average, we expected that patients with advanced CKD would slightly deteriorate in physical functioning and participation and would not much change in mental functioning over a period of 6 months [29; 30]. Therefore, we expected relatively low correlations. However, we expected that there would be at least some variation in outcomes and that some patients would improve and some patients would deteriorate and that this variation would be sufficient to evaluate the responsiveness of the PROMIS CATs [30]. To support the responsiveness of PROMIS CAT, we hypothesized that the correlations between changes in PROMIS CAT T-scores and changes in comparable domains of the comparator instruments would be at least 0.40 (rather than 0.50 suggested by COSMIN) [31]. Furthermore, we hypothesized that per comparator instrument, the correlations between changes in PROMIS CAT T-scores and changes in the comparator instrument should be the highest for comparable domains (Table 1) [31]. Although the aim of the study was not to validate the PROMIS-29 summary scores, we expected that the correlations between the PROMIS-29 summary scores and the SF-12 component scores would be at least 0.40. We also expected that changes in all PROMIS CAT T-scores would be related to changes in the total number of symptoms and changes in symptom severity, as measured with the DSI. We expected that these correlations would be higher than the correlations with other DSI scores but lower than the correlations with changes in similar domains of the SF-12. We also calculated effect sizes for all PROM scores (defined as mean change divided by baseline standard deviation) and we expected at least similar or slightly higher effect sizes for PROMIS CAT compared to comparable SF-12 domains and DSI items. Finally, we examined the ability of the PROMIS CATs to distinguish between patients who reported to be deteriorated (a little worse or much worse on the GRS) and patients who reported to be not deteriorated (no change, a little better, much better on the GRS). The area under the Receiver Operating Characteristics (ROC) Curve (AUC) was used as a measure of responsiveness. An AUC of at least 0.70 is generally considered sufficient evidence for responsiveness [31]. We considered responsiveness sufficient if at least 75% of the results were in accordance with the hypotheses.

Table 1 Expected and observed correlations between PROMIS CAT change scores and change scores in SF-12 and DSI

Full size table

Minimal important change

MIC was defined as the smallest change in score that patients consider important [32]. Because patients were expected to deteriorate, a minimal important deterioration was calculated instead of a minimal important improvement. A prerequisite for calculating the MIC was a correlation between the PROMIS CAT change score and the GRS of at least 0.30 [33]. The MIC was estimated using predictive modelling, where the MIC was defined as the change score where the post-test probability of belonging to the deteriorated group equals the pre-test probability (i.e. the proportion deteriorated patients) [34]. Terluin et al. showed that the predictive modelling approach is more precise than the commonly used ROC method [34] and that ROC MIC values are biased when the percentage of deteriorated (or improved) patients is not 50% [35]. The predictive modelling approach can correct for this. MIC values were therefore adjusted for the high proportion of deteriorated patients [35], and bootstrapping was used to obtain confidence intervals.

Results

Participants

Almost half of the patients that were approached by their nephrologist provided written informed consent (for details, see [10]). In total, 207 participants completed the baseline measurement and 186 (90%) participants completed the 6 months follow-up. Characteristics of the study population are shown in Table 2. Mean (SD) age was 65.5 (13.8) and 60% were male. Mean (SD) eGRF at baseline was 21.4 (6.7). Mean (SD) eGRF at follow-up was 22.9 (10.5). During follow-up, 12 patients died, six patients started hemodialysis, one patient started peritoneal dialysis, and six patients were transplanted.

Table 2 Characteristics of the study population at baseline (n = 207)

Full size table

Scores and changes in scores of PROMIS CATs and other PROMs at baseline and at 6 month follow-up are presented in Table 3. Patients with advanced CKD had lower physical function (43.6) and higher pain interference (51.9) and fatigue (53.2) than the average population values of 50, while scores for the other domains were closer to 50. Mean changes in scores after 6 months were very small for all PROMs (≤ 1.1 T-score point for PROMIS CATs and < 1 point for SF-12 domains).

Table 3 Mean(SD) PROM (change) scores at baseline and 6 months follow-up and effect sizes

Full size table

Responsiveness

Correlations between changes in PROMIS CAT T-scores and changes in SF-12, changes in DSI scores, and GRS scores are presented in Table 4. Effect sizes are presented in Table 3. Table 5 provides an overview of how many hypotheses were confirmed. For PROMIS CAT Physical Function, Fatigue, Sleep Disturbance, and Depression sufficient responsiveness was found as more than 75% of the results were in accordance with the hypotheses. For PROMIS CAT Pain Interference and Anxiety, 70% and 60% of the results were in accordance with the hypotheses. For PROMIS CAT Ability to Participate in Social Roles and Activities only 42% of the results were in accordance with the hypotheses. As expected, the correlations between the PROMIS-29 summary scores and the SF-12 component scores were higher than 0.40 (0.52 for the physical scores, and 0.44 for the mental scores, respectively).

Table 4 Area under the ROC curve (AUC), representing the ability of PROMIS CATs to distinguish patients who deteriorated from patients who did not deteriorate, and minimal important change (MIC), representing a minimal important deterioration

Full size table

Table 5 Number of responsiveness hypotheses confirmed

Full size table

Minimal important change

Supplementary Table 1 presents changes in PROMIS CAT T-scores across all categories of the GRS. Because the much improved and much worse groups were small, the means were not always monotonically ordered, although for most domains the mean changes were largest in the much improved and much worse groups, as expected. Because the correlations between the PROMIS CAT change scores and the GRS were much lower than 0.30 for Anxiety and Depression, a MIC for these domains was not calculated. The MIC representing minimal important deterioration was − 1.6 T-score points for PROMIS Physical Function, 1.6 for Pain Interference, 0.4 for Fatigue, 1.1 for Sleep Disturbance, and − 2.5 for the Ability to Participate in Social Roles and Activities.

Discussion

The aim of this study was to assess responsiveness and minimal important change (MIC) of seven PROMIS CATs in patients with advanced CKD, measuring physical function, pain interference, fatigue, sleep disturbances, anxiety, depression, and the ability to participate in social roles and activities. On average, we expected that patients with advanced CKD would slightly deteriorate in physical functioning and participation and would not much change in mental functioning over a period of 6 months. This was indeed reflected in the changes in PROMIS scores. The pattern of correlations between change scores supported the responsiveness of the PROMIS CATs for Physical Function, Fatigue, Sleep Disturbance, and Depression, almost for Pain Interference but not for Anxiety and Ability to Participate in Social Roles and Activities. MIC values, representing the minimal important deterioration, ranged from 0.4 to 2.5 T-score points for the PROMIS CATs, except for Anxiety and Depression, for which MIC values could not be estimated.

For PROMIS CAT Ability to Participate in Social Roles and Activities only 42% of the results were in accordance with the predefined hypotheses for responsiveness. This was due to lower than expected correlations between change in PROMIS Ability to Participate and change in SF-12 Physical and Emotional role functioning (0.36 and 0.34, rather than ≥ 0.40) and a higher than expected correlation between change in PROMIS Ability to Participate and change in SF-12 MCS (0.42). We expected this latter correlation to be lower than the correlations with change in SF-12 Role-physical (0.36), Social functioning (0.41), and Role-emotional (0.34). A possible explanation could be that these SF-12 domain scores are based on one or two items only, which makes the correlations difficult to estimate The effect size of the PROMIS Ability to Participate CAT was the highest of all PROMIS CATs, so it may be too strict to argue that this PROMIS CAT is not responsive. For PROMIS Anxiety and PROMIS Depression the correlation with the GRS were also lower than expected (0.04 and 0.20, respectively). We do not have an explanation why some of the correlations were lower than expected. Perhaps response shift (i.e. a change in how patients experience their health because they adapted to their disease) or the fact that many patients did not change, played a role, but chance can also not be ruled out because we calculated many correlations. Also, predefining the magnitude of expected correlations is challenging.

This is the first study examining the responsiveness of PROMIS measures in patients with CKD. The responsiveness of PROMIS measures has been studied in patients with other chronic conditions, such as multiple sclerosis [36], COPD [37], chronic low back pain [38], and rheumatoid arthritis [39]. These studies also reported low changes in PROMIS (and other PROM) scores, because of relatively short follow-up periods, during which most patients did not change. This is an important challenge in studies assessing responsiveness in patients with chronic conditions and a limitation of this study because the aim of a responsiveness study is to detect change and patients with chronic conditions may not change much during the relatively short period of a study [32]. A longer follow-up period may lead to more variation in change scores and subsequently higher correlations between change scores.

Indirect evidence for responsiveness of PROMIS CATs was found in other studies. Sufficient construct validity and test-retest reliability was found in multiple studies in CKD patients [8–10; 41; 42]. In theory, this does not guarantee sufficient responsiveness as responsiveness may be limited due to floor or ceiling effects, but floor and ceiling effects are seldom found for PROMIS CAT, because of the large underlying item banks [42,43,44]. Therefore, these previous studies also support the responsiveness of PROMIS CATs in CKD patients, at least to some extent.

For PROMIS Anxiety and PROMIS Depression the correlation with the GRS were too low to calculate a MIC. This was probably due to the high proportion of patients who reported no change in anxiety (74.6%) or depression (69.7%). The estimated MIC values in this study were relatively low (0.4–2.5 T-score points) and confidence intervals were wide. A MIC value of 0.4 (representing an effect size of 0.04 of the T-score metric) for PROMIS Fatigue might be considered implausible because such a small change may not even be noticeable by patients. As stated above, the study design was not optimal because most patients did not change. Therefore, these MIC values should be interpreted with caution. A recent systematic review of PROMIS MIC values suggested that MIC values of 2–6 T-score points are reasonable to assume for PROMIS measures [45]. However, most studies included in the systematic review estimated minimal important improvement, while we estimated minimal important deterioration. Some studies also found lower MIC values for deterioration than for improvement [47; 48] but others did not find different MIC values [49; 50]. Therefore, it remains important to estimate MIC values separately for improvement and deterioration and studies with longer follow-up are needed to estimate MIC values in patients with chronic conditions.

Using PROMs in clinical practice can support the delivery of person-centered care through shared decision-making and management in CKD patients [50]. PROMIS has been recommended in national and international initiatives [2; 3]. Although the responsiveness of some of the PROMIS CATs was not sufficiently convincing, considering the evidence that we found, in combination with evidence from previous studies on construct validity, test-retest reliability, and a content comparison with the SF-12 [10], as well as the psychometric evidence and widespread implementation of PROMIS in other fields [51], we recommend the use of PROMIS in clinical practice. Reference scores of the PROMIS domains from the Dutch general population are available for all domains included in this study [52,53,54,55,56]. Graphical PROMIS feedback has been developed to facilitate conversations with patients [57]. PROMIS CATs are available in several electronic PROM platforms and some electronic health records (e.g. Epic) and implementation guides and training sources are available on the HealthMeasures website [58]. Administering seven PROMIS CATs takes more time than completing the SF-12 but seven PROMIS CATs can be administered within 10 min on average. An advantage of PROMIS is that each domain can be measured with a separate instrument, which provides flexibility to choose which domains to measure in studies or clinical applications. PROMIS CAT requires access to a computer and internet, which may be a limitation for some people currently. In our study, 11 patients (5%) participated by telephone. However, computer and internet use is rapidly increasing. Electronic PROM systems may also assist with remote monitoring of symptoms and functions and may encourage patients to become more engaged with their care [59]. If CAT software is not available, PROMIS short forms can be used. Although short forms may perform slightly less good than CATs, they are widely used and available in more than 60 languages and scores are directly comparable to CAT scores [59]. Healthcare providers and patients need to decide which PROMs are most relevant and feasible to use. Although there is some overlap in content with the PROMIS CATs, the DSI might be of additional value because it measures disease-specific symptoms which are not covered by PROMIS.

Conclusion

We found sufficient responsiveness of the PROMIS CATs Physical Function, Fatigue, Sleep Disturbance, and Depression. The results for PROMIS CATs Pain Interference were almost sufficient, but some of the results for the Anxiety and Ability to Participate in Social Roles and Activities were not in line with predefined hypotheses. MIC values, representing the minimal important deterioration, ranged from 0.4 to 2.5 T-score points, but should be interpreted with caution because most patients did not change during the follow-up of this study.

Data Availability

The data used for this research is available upon request. Contact information: Caroline B. Terwee, cb.terwee@amsterdamumc.nl.

Abbreviations

AUC:: Area under the ROC Curve
CAT:: Computerized Adaptive Test
CKD:: Chronic Kidney Disease
DSI:: Dialysis Symptom Index
GRS:: Global Rating Scale
HRQL:: Health-Related Quality of Life
IRT:: Item Response Theory
MIC:: Minimal Important Change
PROMIS:: Patient-Reported Outcomes Measurement Information System
ROC curve:: Receiver Operating Characteristics Curve
SF-12:: Short Form Health Survey 12

References

van der Willik EM, Meuleman Y, Prantl K, van Rijn G, Bos WJW, van Ittersum FJ, Bart HAJ, Hemmelder MH, Dekker FW (2019) Patient-reported outcome measures: selection of a valid questionnaire for routine symptom assessment in patients with advanced chronic kidney disease - a four-phase mixed methods study. BMC Nephrol 20(1):344
Article PubMed PubMed Central Google Scholar
Oude Voshaar MA, Terwee CB, Haverman L, Determann D, Verburg M, van der Wees PJ, Beurskens A (2023) Development of a standardized set of generic set PROs and PROMs for Dutch medical specialist care. A consensus based co-creation approach. Qual Life Res, 2023 Feb 9. Online ahead of print
Verberne WR, Das-Gupta Z, Allegretti AS, Bart HAJ, van Biesen W, García-García G, Gibbons E, Parra E, Hemmelder MH, Jager KJ, Ketteler M, Roberts C, Al Rohani M, Salt MJ, Stopper A, Terkivatan T, Tuttle KR, Yang C-W, Wheeler DC, Bos WJW (2019) Development of an International Standard Set of Value-Based outcome measures for patients with chronic kidney disease: a report of the International Consortium for Health Outcomes Measurement (ICHOM) CKD Working Group, vol 73. American Journal of Kidney Diseases, pp 372–384. 3
Cella D, Riley W, Stone A, Rothrock N, Reeve B, Yount S, Amtmann D, Bode R, Buysse D, Choi S, Cook K, Devellis R, DeWalt D, Fries JF, Gershon R, Hahn EA, Lai JS, Pilkonis P, Revicki D, Rose M, Weinfurt K, Hays R (2010) The patient-reported outcomes Measurement Information System (PROMIS) developed and tested its first wave of adult self-reported health outcome item banks: 2005–2008. J Clin Epidemiol 63(11):1179–1194
Article PubMed PubMed Central Google Scholar
HealthMeasures (2021) Intro to PROMIS®. Retrieved Nov 22, 2021, from https://www.healthmeasures.net/explore-measurement-systems/promis/intro-to-promis
Cella D, Riley W, Stone A, Rothrock N, Reeve B, Yount S, Amtmann D, Bode R, Buysse D, Choi S, Cook K, DeVellis R, DeWalt D, Fries JF, Gershon R, Hahn EA, Lai J-S, Pilkonis P, Revicki D, Rose M, Weinfurt K, Hays R (2010) The patient-reported outcomes Measurement Information System (PROMIS) developed and tested its first wave of adult self-reported health outcome item banks: 2005–2008. J Clin Epidemiol 63(11):1179–1194
Article PubMed PubMed Central Google Scholar
Tang E, Ekundayo O, Peipert JD, Edwards N, Bansal A, Richardson C, Bartlett SJ, Howell D, Li M, Cella D, Novak M, Mucsi I (2019) Validation of the patient-reported outcomes Measurement Information System (PROMIS)-57 and – 29 item short forms among kidney transplant recipients. Qual Life Res 28(3):815–827
Article PubMed Google Scholar
Selewski DT, Massengill SF, Troost JP, Wickman L, Messer KL, Herreshoff E, Bowers C, Ferris ME, Mahan JD, Greenbaum LA, MacHardy J, Kapur G, Chand DH, Goebel J, Barletta GM, Geary D, Kershaw DB, Pan CG, Gbadegesin R, Hidalgo G, Lane JC, Leiser JD, Song PX, Thissen D, Liu Y, Gross HE, DeWalt DA, Gipson DS (2014) Gaining the patient reported Outcomes Measurement Information System (PROMIS) perspective in chronic kidney disease: a Midwest Pediatric Nephrology Consortium study. Pediatr Nephrol 29(12):2347–2356
Article PubMed PubMed Central Google Scholar
Hussain J, Chawla G, Rafiqzad H, Huang S, Bartlett SJ, Li M, Howell D, Peipert JD, Novak M, Mucsi I (2022) Validation of the PROMIS sleep disturbance item bank computer adaptive test (CAT) in patients on renal replacement therapy. Sleep Med 90:36–43
Article PubMed Google Scholar
van der Willik EM, van Breda F, van Jaarsveld BC, van der Putte M, Jetten IW, Dekker FW, Meuleman Y, van Ittersum FJ, Terwee CB (2023). Validity and reliability of patient-reported outcomes Measurement Information System (PROMIS®) using computerized adaptive testing (CAT) in patients with advanced chronic kidney disease. Nephrol Dial Transplant 2022 Aug 1. Online ahead of print.
COSMIN. COSMIN Study Design checklist https://www.cosmin.nl/tools/checklists-assessing-methodological-study-qualities/
Gagnier JJ, Lai J, Mokkink LB, Terwee CB (2021) COSMIN reporting guideline for studies on measurement properties of patient-reported outcome measures. Qual Life Res 30(8):2197–2218
Article PubMed Google Scholar
(2021). ERA-EDTA Registry Annual Report, Amsterdam UMC (2019) location AMC, Department of Medical Informatics, Amsterdam, the Netherlands: European Renal Association
Levey AS, Stevens LA, Schmid CH, Zhang YL, Castro AF 3rd, Feldman HI, Kusek JW, Eggers P, Van Lente F, Greene T, Coresh J (2009) A new equation to estimate glomerular filtration rate. Ann Intern Med 150(9):604–612
https://www.hetklikt.nu/
www.dutchflemishpromis.nl
Terwee CB, Roorda LD, de Vet HC, Dekker J, Westhovens R, van Leeuwen J, Cella D, Correia H, Arnold B, Perez B, Boers M (2014) Dutch-flemish translation of 17 item banks from the patient-reported outcomes measurement information system (PROMIS). Qual Life Res 23(6):1733–1741
CAS PubMed Google Scholar
Hays RD, Spritzer KL, Schalet BD, Cella D (2018) PROMIS((R))-29 v2.0 profile physical and mental health summary scores. Qual Life Res 27(7):1885-1891
Dewitt B, Jalal H, Hanmer J (2020) Computing PROPr Utility Scores for PROMIS® Profile Instruments. Value Health 23(3):370–378
Article PubMed Google Scholar
Ware J Jr, Kosinski M, Keller SD (1996) A 12-Item short-form Health Survey: construction of scales and preliminary tests of reliability and validity. Med Care 34(3):220–233
Article PubMed Google Scholar
Ware JE Jr (2000) SF-36 health survey update. Spine (Phila Pa 1976) 25(24):3130–3139
Article PubMed Google Scholar
Loosman WL, Hoekstra T, van Dijk S, Terwee CB, Honig A, Siegert CE, Dekker FW (2015) Short-form 12 or short-form 36 to measure quality-of-life changes in dialysis patients? Nephrol Dial Transplant 30(7):1170–1176
Article PubMed Google Scholar
Østhus TB, Preljevic VT, Sandvik L, Leivestad T, Nordhus IH, Dammen T, Os I (2012) Mortality and health-related quality of life in prevalent dialysis patients: comparison between 12-items and 36-items short-form health survey. Health Qual Life Outcomes 10:46
Article PubMed PubMed Central Google Scholar
Pakpour AH, Nourozi S, Molsted S, Harrison AP, Nourozi K, Fridlund B (2011) Validity and reliability of short form-12 questionnaire in iranian hemodialysis patients. Iran J Kidney Dis 5(3):175–181
PubMed Google Scholar
van der Willik EM, Hemmelder MH, Bart HAJ, van Ittersum FJ, Hoogendijk-van den Akker JM, Bos WJW, Dekker FW, Meuleman Y (2021) Routinely measuring symptom burden and health-related quality of life in dialysis patients: first results from the dutch registry of patient-reported outcome measures. Clin Kidney J 14(6):1535–1544
Article PubMed Google Scholar
Weisbord SD, Fried LF, Arnold RM, Rotondi AJ, Fine MJ, Levenson DJ, Switzer GE (2004) Development of a symptom assessment instrument for chronic hemodialysis patients: the Dialysis Symptom Index. J Pain Symptom Manage 27(3):226–240
Article PubMed Google Scholar
Abdel-Kader K, Unruh ML, Weisbord SD (2009) Symptom burden, depression, and quality of life in chronic and end-stage kidney disease. Clin J Am Soc Nephrol 4(6):1057–1064
Article PubMed PubMed Central Google Scholar
de Goeij MC, Ocak G, Rotmans JI, Eijgenraam JW, Dekker FW, Halbesma N (2014) Course of symptoms and health-related quality of life during specialized pre-dialysis care.PLoS One, 9(4), e93069
de Rooij ENM, Meuleman Y, de Fijter JW, Le Cessie S, Jager KJ, Chesnaye NC, Evans M, Pagels AA, Caskey FJ, Torino C, Porto G, Szymczak M, Drechsler C, Wanner C, Dekker FW, Hoogeveen EK (2022) Quality of life before and after the start of Dialysis in older patients. Clin J Am Soc Nephrol 17(8):1159–1167
Article PubMed Google Scholar
Meuleman Y, Chilcot J, Dekker FW, Halbesma N, van Dijk S (2017) Health-related quality of life trajectories during predialysis care and associated illness perceptions. Health Psychol 36(11):1083–1091
Article PubMed Google Scholar
Prinsen CAC, Mokkink LB, Bouter LM, Alonso J, Patrick DL, de Vet HCW, Terwee CB (2018) COSMIN guideline for systematic reviews of patient-reported outcome measures. Qual Life Res 27(5):1147–1157
Article CAS PubMed PubMed Central Google Scholar
de Vet HCW, Terwee CB, Mokkink LB, Knol DL (2011) Measurement in medicine. Cambridge University Press, Cambridge
Book Google Scholar
Revicki D, Hays RD, Cella D, Sloan J (2008) Recommended methods for determining responsiveness and minimally important differences for patient-reported outcomes. J Clin Epidemiol 61(2):102–109
Article PubMed Google Scholar
Terluin B, Eekhout I, Terwee CB, de Vet HC (2015) Minimal important change (MIC) based on a predictive modeling approach was more precise than MIC based on ROC analysis. J Clin Epidemiol 68(12):1388–1396
Article PubMed Google Scholar
Terluin B, Eekhout I, Terwee CB (2017) The anchor-based minimal important change, based on receiver operating characteristic analysis or predictive modeling, may need to be adjusted for the proportion of improved patients. J Clin Epidemiol 83:90–100
Article PubMed Google Scholar
Kamudoni P, Johns J, Cook KF, Salem R, Salek S, Raab J, Middleton R, Henke C, Repovic P, Alschuler K, von Geldern G, Wundes A, Amtmann D (2021) Standardizing fatigue measurement in multiple sclerosis: the validity, responsiveness and score interpretation of the PROMIS SF v1.0 - fatigue (MS) 8a. Mult Scler Relat Disord 54:103117
Article PubMed Google Scholar
Yount SE, Atwood C, Donohue J, Hays RD, Irwin D, Leidy NK, Liu H, Spritzer KL, DeWalt DA (2019) Responsiveness of PROMIS® to change in chronic obstructive pulmonary disease. J Patient Rep Outcomes 3(1):65
Article PubMed PubMed Central Google Scholar
Khutok K, Janwantanakul P, Jensen MP, Kanlayanaphotporn R (2021) Responsiveness of the PROMIS-29 Scales in individuals with chronic low back Pain. Spine (Phila Pa 1976) 46(2):107–113
Article PubMed Google Scholar
Hays RD, Spritzer KL, Fries JF, Krishnan E (2015) Responsiveness and minimally important difference for the patient-reported outcomes measurement information system (PROMIS) 20-item physical functioning short form in a prospective observational study of rheumatoid arthritis. Ann Rheum Dis 74(1):104–107
Article PubMed Google Scholar
Tang E, Ekundayo O, Peipert JD, Edwards N, Bansal A, Richardson C, Bartlett SJ, Howell D, Li M, Cella D, Novak M, Mucsi I (2019) Validation of the patient-reported outcomes Measurement Information System (PROMIS)-57 and – 29 item short forms among kidney transplant recipients. Qual Life Res 28(3):815–827
Article PubMed Google Scholar
Troost JP, Gipson DS, Carlozzi NE, Reeve BB, Nachman PH, Gbadegesin R, Wang J, Modersitzki F, Massengill S, Mahan JD, Liu Y, Trachtman H, Herreshoff EG, DeWalt DA, Selewski DT (2019) Using PROMIS® to create clinically meaningful profiles of nephrotic syndrome patients. Health Psychol 38(5):410–421
Article PubMed PubMed Central Google Scholar
Baron JE, Parker EA, Wolf BR, Duchman KR, Westermann RW (2021) PROMIS Versus Legacy patient-reported outcome measures for Sports Medicine Patients undergoing arthroscopic knee, shoulder, and hip interventions: a systematic review. Iowa Orthop J 41(2):58–71
PubMed PubMed Central Google Scholar
Kuijlaars IAR, Teela L, van Vulpen LFD, Timmer MA, Coppens M, Gouw SC, Peters M, Kruip M, Cnossen MH, Muis JJ, van Hoorn ES, Haverman L, Fischer K (2021) Generic PROMIS item banks in adults with hemophilia for patient-reported outcome assessment: feasibility, measurement properties, and relevance.Res Pract Thromb Haemost, 5(8), e12621
Lu Y, Beletsky A, Nwachukwu BU, Patel BH, Okoroha KR, Verma N, Cole B, Forsythe B (2020) Performance of PROMIS physical function, Pain Interference, and Depression Computer adaptive tests Instruments in patients undergoing meniscal surgery. Arthrosc Sports Med Rehabil 2(5):e451–e459
Article PubMed PubMed Central Google Scholar
Terwee CB, Peipert JD, Chapman R, Lai J-S, Terluin B, Cella D, Griffith P, Mokkink LB (2021) Minimal important change (MIC): a conceptual clarification and systematic review of MIC estimates of PROMIS measures. Qual Life Res 30(10):2729–2754
Article PubMed PubMed Central Google Scholar
Hageman D, de Wit M, van den Houten MML, Gommans LNM, Scheltinga MRM, Teijink JAW (2022) Vascular quality of Life Questionnaire-6 before and after supervised Exercise Therapy in patients with intermittent claudication. Eur J Vasc Endovasc Surg 63(3):457–463
Article PubMed Google Scholar
Oosterveer DM, van den Berg C, Volker G, Wouda NC, Terluin B, Hoitsma E (2022) Determining the minimal important change of the 6-minute walking test in multiple sclerosis patients using a predictive modelling anchor-based method. Mult Scler Relat Disord 57:103438
Article PubMed Google Scholar
Kawahara T, Taira N, Shiroiwa T, Hagiwara Y, Fukuda T, Uemura Y, Mukai H (2022) Minimal important differences of EORTC QLQ-C30 for metastatic breast cancer patients: results from a randomized clinical trial. Qual Life Res 31(6):1829–1836
Article PubMed PubMed Central Google Scholar
Mulder MLM, Bertram AM, Wenink MH, Vriezekolk JE (2022) Defining the minimal important change (MIC) and meaningful change value (MCV) of the psoriatic arthritis disease activity score (PASDAS) in a routine practice cohort of psoriatic arthritis patients. Rheumatology (Oxford). 61(10):4119-4123
Anderson NE, McMullan C, Calvert M, Dutton M, Cockwell P, Aiyegbusi OL, Kyte D (2021) Using patient-reported outcome measures during the management of patients with end-stage kidney disease requiring treatment with haemodialysis (PROM-HD): a qualitative study.BMJ Open, 11(8), e052629
Smith AW, Jensen RE (2019) Beyond methods to applied research: realizing the vision of PROMIS®. Health Psychol 38(5):347–350
Article PubMed Google Scholar
Elsman EBM, Flens G, de Beurs E, Roorda LD, Terwee CB (2022) Towards standardization of measuring anxiety and depression: Differential item functioning for language and dutch reference values of PROMIS item banks.PLoS One, 17(8), e0273287
Terwee CB, Roorda LD (2023) Country-specific reference values for PROMIS(®) pain, physical function and participation measures compared to US reference values. Ann Med 55(1):1–11
Article PubMed Google Scholar
Terwee CB, van Litsenburg RRL, Elsman EBM, Roorda LD (2023) Psychometric properties and reference values of the patient-reported outcomes Measurement Information System (PROMIS) sleep item banks in the dutch general population. J Sleep Res 32(2):e13753
Terwee CB, Crins MHP, Boers M, de Vet HCW, Roorda LD (2019) Validation of two PROMIS item banks for measuring social participation in the dutch general population. Qual Life Res 28:211–220
Article CAS PubMed Google Scholar
Terwee CB, Elsman EBM, Roorda LD (2022) Towards standardization of fatigue measurement: psychometric properties and reference values of the PROMIS fatigue item bank in the dutch general population. Res Methods Med Health Sciences 3:86–98
Article Google Scholar
van Muilekom MM, Luijten MAJ, van Oers HA, Terwee CB, van Litsenburg RRL, Roorda LD, Grootenhuis MA, Haverman L (2021) From statistics to clinics: the visual feedback of PROMIS® CATs. J Patient Rep Outcomes 5(1):55
Article PubMed PubMed Central Google Scholar
https://www.healthmeasures.net/implement-healthmeasures/implement-for-patient-care/implementation-guides
Aiyegbusi OL, Kyte D, Cockwell P, Anderson N, Calvert M (2017) A patient-centred approach to measuring quality in kidney care: patient-reported outcome measures and patient-reported experience measures. Curr Opin Nephrol Hypertens 26(6):442–449
Article PubMed Google Scholar
https://www.healthmeasures.net/explore-measurement-systems/promis/intro-to-promis/available-translations

Download references

Acknowledgements

The authors are grateful to all the patients and healthcare professionals for their participation in and contributions to this study.

Funding

This study was supported by an unrestricted grant from the Dutch Kidney Foundation (18SWO03). The funding organization had no role in the study design; collection, analysis and interpretation of the data; writing the report and the decision to submit the report for publication.

Author information

Authors and Affiliations

Department of Epidemiology and Data Science, Amsterdam UMC location Vrije Universiteit, P.O. box 7057, Amsterdam, 1007 MB, the Netherlands
Caroline B. Terwee & Esmee M. van der Willik
Amsterdam Public Health research institute, Methodology, Amsterdam, The Netherlands
Caroline B. Terwee
Department of Clinical Epidemiology, Leiden University Medical Center, Leiden, The Netherlands
Esmee M. van der Willik, Friedo W. Dekker & Yvette Meuleman
Department of Nephrology, Amsterdam University Medical Centers, Amsterdam, The Netherlands
Fenna van Breda, Brigit C. van Jaarsveld, Marlon van de Putte, Isabelle W. Jetten & Frans J. van Ittersum

Authors

Caroline B. Terwee
View author publications
You can also search for this author in PubMed Google Scholar
Esmee M. van der Willik
View author publications
You can also search for this author in PubMed Google Scholar
Fenna van Breda
View author publications
You can also search for this author in PubMed Google Scholar
Brigit C. van Jaarsveld
View author publications
You can also search for this author in PubMed Google Scholar
Marlon van de Putte
View author publications
You can also search for this author in PubMed Google Scholar
Isabelle W. Jetten
View author publications
You can also search for this author in PubMed Google Scholar
Friedo W. Dekker
View author publications
You can also search for this author in PubMed Google Scholar
Yvette Meuleman
View author publications
You can also search for this author in PubMed Google Scholar
Frans J. van Ittersum
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

The authors E.vdW., F.vB., B.vJ., F.vI., and C.T. designed the study. The authors E.vdW., M.vdP., and I.J. collected the data. Author C.T. conducted the data analysis and drafted the manuscript. Y.M., F.vI., and C.T. provided supervision and mentorship. All authors (E.vdW., F.vB., B.vJ., M.vdP., I.J., F.D., Y.M., F.vI., and C.T.) supported the interpretation of results, provided important intellectual content, and revised the final version of the manuscript. All authors provided final approval of the version to be published.

Corresponding author

Correspondence to Caroline B. Terwee.

Ethics declarations

Ethics approval and consent to participate

The study was reviewed by the Medical Ethics Review Committee of VU University Medical Center in the Netherlands, which confirmed that the Dutch Medical Research Involving Human Subjects Act (WMO) does not apply to this study. Patients provided written informed consent.

Consent for publication

Not applicable.

Competing Interests

C.T. is past board member of the PROMIS Health Organization and representative of the Dutch-Flemish PROMIS National Center. The other authors declare that they have no competing interests. None of the authors was involved in the development of the included PROMs.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Terwee, C.B., van der Willik, E.M., van Breda, F. et al. Responsiveness and minimal important change of seven PROMIS computerized adaptive tests (CAT) in patients with advanced chronic kidney disease. J Patient Rep Outcomes 7, 35 (2023). https://doi.org/10.1186/s41687-023-00574-y

Download citation

Received: 13 October 2022
Accepted: 11 March 2023
Published: 04 April 2023
DOI: https://doi.org/10.1186/s41687-023-00574-y

Responsiveness and minimal important change of seven PROMIS computerized adaptive tests (CAT) in patients with advanced chronic kidney disease

Abstract

Background

Methods

Results

Conclusion

Background

Methods

Study design

Participants

Measures

Responsiveness

Minimal important change

Results

Participants

Responsiveness

Minimal important change

Discussion

Conclusion

Data Availability

Abbreviations

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Ethics approval and consent to participate

Consent for publication

Competing Interests

Additional information

Publisher’s Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords