Skip to main content

Measuring needs-based quality of life and self-perceived health inequity in patients with multimorbidity: investigating psychometric measurement properties of the MultiMorbidity Questionnaire (MMQ) using primarily Rasch models

Abstract

Background

Multimorbidity is a burden for the individual and to the healthcare sector worldwide, leading to a rising number of intervention studies towards this patient group. To measure a possible effect of such interventions, an adequate patient-reported outcome measure (PROM) is essential. The aim of this study was to assess the draft MultiMorbidity Questionnaire (MMQ), a PROM measuring needs-based quality of life and self-perceived inequity in patients with multimorbidity, for its psychometric properties and to adjust it accordingly to create a content- and construct valid measure.

Methods

The draft MMQ was sent to 1198 eligible respondents with multimorbidity. Modern test theory and classical test theory were used to analyse data. Dimensionality of the suggested domains and invariance of the items were assessed through item analysis, examining the fit to a psychometric model.

Results

The psychometric analyses were based on responses from 390 patients with multimorbidity. In the MMQ1, measuring needs-based QoL, evidence of six unidimensional scales was confirmed: physical ability (6 items), worries (6 items), limitations in everyday life (10 items), my social life (6 items), self-image (6 items), and personal finances (3 items). The psychometric analyses of the MMQ2 outlined four unidimensional scales measuring the feeling of Self-perceived inequity in patients with multimorbidity: experiences of being stigmatised (4–5 items), Experiences of insufficient understanding of the burden of disease (3 items), Experiences of not being seen and heard (4 items), Experience of powerlessness (5 items). These scales are relevant for patients’ with multimorbidity encounters with (1) their general pratitioner, (2) staff at their general practitioner’s surgery, (3) healthcare professionals, (4) staff at the local authorities and (5) friends, family, and others.

Conclusion

The MMQ, a QoL measure for patients living with multimorbidity has been validated: the MMQ1 is a condition-specific PROM with adequate psychometric properties designed to measure needs-based QoL. The MMQ2 measuring Self-perceived inequity, has also been found to possess adequate measurement properties; however due to the risk of type 2 error a revalidation of MMQ2 is suggested.

Background

The number of adults living with two or more chronic conditions, usually defined as multimorbidity [1], is continuously rising worldwide [2,3,4]. Besides extended healthcare costs for society [5], qualitative and quantitative studies show how multimorbidity has a negative impact on the individual’s quality of life (QoL) [6,7,8]. To intervene successfully among people living with multimorbidity it is essential to be able to measure the possible effects of studies targeting this patient group. Yet, there is no existing condition-specific patient-reported outcome measure (PROM) for adequately assessing QoL in patients with multimorbidity [9]. Therefore, we have developed the MultiMorbidity Questionnaire (MMQ) for this purpose. In the first stage of developing the MMQ, we found The needs-based approach to QoL to be a relevant, but not completely covering, conceptual framework through qualitative interviews with the target group [10,11,12]. The Needs-based approach to QoL is based on the individual’s possibility of fulfilling their expectations and needs in life [13, 14] and thus, emphasises the importance of involving the target group in generating items [15]. We found it necessary to supplement the conceptual framework with theory on subjective health inequity; reactions to how you feel social determinants determine how you are perceived by healthcare professionals [12, 16]. Our studies define this theory as self-perceived health inequity [12].

In the second stage of developing the MMQ, we developed items and domains (a set of items measuring nuances of the same construct before psychometric validation) covering the constructs needs-based QoL (MMQ1) and self-perceived inequity (MMQ2) (Tables 1, 2) through qualitative interviews with patients with multimorbidity, ensuring high content validity (content relevance and content coverage) [12].

Table 1 Domains and preliminary items of the draft MultiMorbidity Questionnaire (MMQ)
Table 2 Example of an item with response options

This article will detail the third stage in the development of the MMQ; psychometric assessment of the developed items and domains using Modern Test Theory (MTT) [17]. The Rasch model, belonging to MTT is regarded as the most strict model when validating PROMs regarding psychometric properties [17,18,19]. Contrary to classical test theory (CTT), Rasch models test for unidimensionality (i.e. items in a scale measure only a single construct), invariance (i.e. different subgroups do not respond in systematically different ways to a specific item) and local independence (i.e. items within a scale are not correlated beyond the construct measured) [17, 20]. The aim of this study is to test the drafts of the MMQ1 and MMQ2 for their psychometric properties regarding dimensionality of the domains and invariance of the items using MTT and to adjust accordingly. Furthermore, discrimination abilities and reliability of the scales will be tested using CTT.

Materials and methods

Stratification of eligible survey respondents

Eligible people invited to respond to the final drafts of the MMQ1 and MMQ2 were adults ≥ 18 years living with multimorbidity. They were recruited via their participation in The Lolland–Falster health study (LOFUS). LOFUS is a household population study of the general population of Lolland and Falster, in two deprived areas of southern Denmark [21]. A stratification among the LOFUS participants was conducted to obtain respondents who we hypothesised to be affected by their chronic conditions on their Needs-based QoL (Fig. 1).

Fig. 1
figure 1

Stratification of eligible survey respondents from the Lolland–Falster health study (LOFUS). Stratification step 1–4 were conducted within the LOFUS database based on the participants' responses to the LOFUS questionnaire. aAccording to our definition of multimorbidity (see Table 3). bThe LOFUS questionnaire: A health related questionnaire completed upon inclusion in the LOFUS [22]

Table 3 Diagnoses groups to ensure respondents with multimorbidity (diagnosis from more than one group)

Excluded participants are listed in Fig. 1 (1–3). Among the remaining LOFUS participants, all participants who had stated to suffer from anxiety and/or depression (N = 698) and a random sample of patients who had stated not to have anxiety or depression (N = 500) were stratified. This step was conducted to ensure enough respondents from the following three groups were invited: (1) Participants with somatic multimorbidity. (2) Participants with psychiatric multimorbidity (anxiety and depression). (3) Participants with psychiatric and somatic multimorbidity. An invitation to complete the MMQ was distributed in December 2019 to the group of 1198 strategically selected LOFUS-participant through a link to SurveyXact via e-Boks (a Danish provider of secure digital post-boxes). A paper version was sent via postal mail to participants without email or e-Boks. A final stratification of eligible respondents was done in the invitation process where participants without multimorbidity were excluded. This was done according to the definition of multimorbidity as suggested by Willadsen et al. “as having diagnoses from at least 2 of 10 diagnosis groups” [3] by the respondents’ responses to a list of diagnoses that had also been included in the LOFUS questionnaire. For the purpose of this study, diagnoses were grouped into the diagnosis groups listed in Table 3. Participants who had diagnoses from more than one of the groups and/or both anxiety and depression (N = 972) were invited. Reminders to invited eligible respondents who had not completed the survey within four weeks were distributed in January 2020. In the response to MMQ2 “skip-questions” occurred, enabling the patient to skip questions related to encounters with local authorities and other healthcare professionals if they had no contact to these two groups.

Psychometric analysis

The data collected were analysed using the Rasch model for polytomous items [23,24,25]. We evaluated the overall model fit of the draft domains regarding unidimensionality using Andersen’s conditional likelihood ratio test (CLR-χ2) [26]. Fit to the Rasch model was assessed for each item using item fit statistics with a known asymptotic distribution [27] and categorised items according to statistical significance (after adjustment for multiple testing). Individual item fit was tested by comparing observed and expected correlations between the individual item score with the rest-score of the remaining items in the scale [28]. Invariance was assessed by Differential Item Function (DIF), testing whether items measured differently among subgroups [19] related to sex (male, female), age groups (< 55, 55–65, > 65) and diagnosis groups (Table 3). This was done using conditional likelihood ratio tests (LRT) comparing nested models. Considering the heterogeneous group of patients with multimorbidity, we removed items with evidence of DIF. Lastly, we tested the items of each domain for local response dependence (LRD), analysing if items were correlated beyond the construct measured in terms of content, structure or wordings [29]. This was done using conditional likelihood ratio tests [30] and where LRD was found, this was accounted for by extending the model loglinear interaction parameters yielding graphical Rasch models [31,32,33,34] incorporating LRD. In doing so we compared nested models using conditional likelihood ratio tests (LRT). If items possessed misfit or poor psychometric properties, we tested alternative configurations in a stepwise manner, evaluating which items to remove. These evaluations were based on empirical knowledge from the qualitative interviews [11, 12], clinical experience or theoretical considerations linked to our conceptual frameworks of Needs-based QoL [11, 13, 14] and Self-perceived inequity [12, 16] above results of our analyses. Therefore, some items were kept despite misfit if they possessed high content validity in the qualitative phase [12]. Reliability for each of the resulting scales was assessed using Cronbach’s alpha [35]. To account for multiple testing, we used the Benjamini–Hochberg procedure to control the false discovery rate at 5% [36]. Person fit was evaluated using the conditional probability of response vectors given the observed total scale score and reported as the proportion of respondents with significant misfit and corresponding 95% confidence interval. Targeting was evaluated graphically using person-item location maps.

Discriminative abilities of the MMQ1 scales were determined by comparing the mean (plus SD) sumscores of the scales between the categories of the global item [18, 37] (Table 4); the order of the means should follow the categories’ order. Discriminating ability is calculated as the number of individuals needed in a t-test with 5% significance level to find a clinically meaningful difference in the response categories between known-groups with 80% power; here a difference of 1 point in the response categories of the global item, i.e., between “very poor” and “poor”, “poor” and “acceptable”, “acceptable” and “good”, or “good” and “very good” was taken as a clinically meaningful difference. A low number of individuals indicates a high discriminating ability of the scale.

Table 4 Global item with response categories

All analyses were conducted using DIGRAM v. 3.05.3 [38, 39] and RUMM2030 software [40]. Figures were made in R v. 4.0.0 [41]. A description of the Rasch analysis implemented in DIGRAM is available in Additional file 1.

Results

From the stratification of eligible respondents in the LOFUS we ensured only patients with diagnoses from different diagnosis groups [3] or more than one psychiatric disease were invited (n = 972). The response rate was 40.1% (Fig. 2). Table 5 shows the characteristics of the surveyed respondents.

Fig. 2
figure 2

Response rates of the MMQ

Table 5 Characteristics of survey respondents

Multimorbidity Questionnaire 1

In the psychometric analysis of the responses to the MMQ1, completed by patients with multimorbidity, we removed a total of 23 items and found that the remaining 37 items had adequate fit to unidimensional Rasch models: 1. Physical ability (6 items), 2. Worries (6 items), 3. Limitations in everyday life (10 items), 4. My social life (6 items), 5. Self-image (6 items), and 6. Personal Finances (3 items). Two items were moved from the domain they originated and tested within another domain. The results from the Rasch analysis for each of these six scales will be described in detail below. The top panel in Table 6 presents fit statistics, Cronbach’s alpha, Person Separation Indices, and proportion with misfit for each of the scales in MMQ1. Table 7 shows the structure and scale profiles of the final MMQ.

Table 6 Fit statistics, Cronbach’s alpha, number of respondents and proportion with significant person misfit for each scale in the MMQ1 and MMQ2 after removing items with poor fit or DIF
Table 7 Structure and scale-score profiles of the MultiMorbidity Questionnaire (MMQ)

In Additional file 2, content condensates of the included and excluded items of the draft MMQ1 and MMQ2 can be found. Additional file 3 shows individual item fits. In the following the stepwise elimination of problematic items are summarised.

Physical ability

The draft 10-item domain did not fit the Rasch model. Items 1b (constantly tired), 1c (aware of the body), and 1i (difficulties taking care of body) were removed because of DIF (Table A in Additional file 3) diagnosis. Item 1b, with its content regarding tiredness, was evaluated as too general and therefore could be dispensed. The remaining items indicated no DIF, but item 1a (manages very little) had poor fit (Table A in Additional file 3) and was removed. The remaining six items (1d, 1e, 1f, 1g, 1h, 1j) were found to fit a graphical Rasch model with evidence of LRD for two item pairs (1d and 1j), and (1f and 1g) (Table 6).

Worries

The Rasch model rejected the draft 11-item domain. In a stepwise procedure, the following items were removed due to DIF and poor item fit: (1) Item 2k (closest relatives being worried), (2) Item 2d (worried about personal economy), (3) Item 2f (worried about status in society) (Table A in Additional file 3). The wording of Item 2d only focuses on economy and was moved to another domain. Item 2f was also moved to another domain. Two items 2g (feeling mentally vulnerable) and 2h (limited in feeling mentally well) were removed due to DIF (Table A in Additional file 3). The remaining 6 items (2a, 2b, 2c, 2e, 2i, 2j) showed good fit to a graphical Rasch model with LRD for three item pairs (2a and 2c; 2b and 2c; 2e and 2i) (Table 6).

Limitations in everyday life

The draft 15-item domain did not fit a graphical Rasch model. Five items were removed stepwise due to DIF and/or misfit: Items 3f (have to plan activities from end to end), 3o (prevented from following what happens in the world), 3j (difficult performing one’s job), 3m (difficult keeping one’s home as one wish to present it to others), and 3g (everyday life planned around illnesses) (Table A in Additional file 3). Two of these (3m and 3g) had content similar to two retained items (3l and 3e). This improved the psychometric properties of the scale and the resulting 10-item scale (3a, 3b, 3c, 3d, 3e, 3h, 3i, 3k, 3l, 3n) fitted a graphical Rasch model with LRD for three item pairs (Table 6). Item 3c showed borderline problematic item fit (p = 0.0309) but was retained because of high content validity. Five items showed borderline significant DIF (3a, with respect to age; 3b, with respect to diagnosis; 3e, 3l, 3n, with respect to sex). The p values ranged between 0.0181 and 0.0335. These items were also retained due to high content validity.

My social life

The draft 11-item domain was rejected by Rasch analysis. The five items 4b (relationship with loved ones affected), 4g (difficulties being anything to others), 4i (difficulties helping loved ones), 4j (limited sex life), and 4k (prevented from sexual activities) were removed due to DIF and/or misfit (Table A in Additional file 3). The two latter showed a high degree of LRD due to their wordings regarding sex as the only items. Omitting these poor-fitting items resulted in a 6-item scale (4a, 4c, 4d, 4e, 4f, 4h) with adequate fit to a graphical Rasch model without DIF, but with LRD between the two item pairs (4d and 4e; 4c and 4h) (Table 6).

Self-image

The draft 12-item domain showed poor fit to a graphical Rasch model. The following items were removed due to misfit and/or DIF: 5h (hides illness), 5i (angry at oneself because of illnesses), 5j (disappointed in oneself), 5k (losing one’s role related to employment situation), 5l (losing one’s role in the family) (Table A in Additional file 3). This resulted in a 6-item scale fitted a graphical Rasch model without DFI, but with LRD for four item pairs (5a and 5c; 5d and 5g; 5d and 5e; 5f and 5g) (Table 6).

Personal finances

Due to their content relating to personal finances of the omitted items 2d and 2f from the scale “Worries”, these items were added to the suggested 2-item domain “Personal economy”. A single item (6b; prevented from living as one wish to) was removed due to DIF (Table A in Additional file 3). The resulting 3-item scale (6a, 2d, 2f) fitted a Rasch model without DIF, but with LRD between the items 6a and 2f (Table 6).

Reliability

The Cronbach’s alpha coefficients for the six MMQ1 scales ranged from 0.87 to 0.92 (Table 6). The Person Separation Index varied from 0.60 to 0.88.

Person fit, targeting and discrimination

The level of person fit was acceptable for all scales and so was the targeting (Fig. 3). The scale 3. Limitations in everyday life performed best in discrimination between the response options “Acceptable” and “Poor” of the global item. Generally, all the scales performed best in the range between “Poor” and “Good”. The discrimination abilities for each scale in the MMQ1 corresponding to a difference in the response options of 1 point are presented in Table 8 and illustratively in Fig. 4.

Fig. 3
figure 3

Person-item location maps

Table 8 Discriminative abilities of the MMQ1 scales between the global item’s response options
Fig. 4
figure 4figure 4

Discrimination abilities of the MMQ1 scales. Responses to each category of the global item within each possible scale-score. X-axis: Total score on the scale. Physical ability as an example: Low score, low perceived limitation). Y-axis: Number of people affirming each category of the global item

Multimorbidity Questionnaire 2

The psychometric analyses of the MMQ2 outlined the measurement of self-perceived inequity in four unidimensional scales showing fit to the Rasch model, apart from the misfits detailed below. The scales include: 7. Experiences of being stigmatised, 8. Experiences of not being seen and heard, 9. Experiences of insufficient understanding of the burden of disease, and 10. Experience of powerlessness (repeated for encounters with the following groups: (1) The general practitioner (GP), (2) staff at the GP’ surgery, (3) local authorities, (4) other healthcare professionals, (5) friends, families, and others. The lower panel in Table 6 presents fit statistics and Cronbach’s alpha for each of the scales in MMQ1. The following describes the stepwise analyses.

In encounters with the GP

Experiences of being stigmatised

The draft 5-item scale had acceptable fit to the Rasch model after adjustment for multiple testing (Table 6). Poor item fit was present for items 7c (Pigeonholed because of society’s norms) and 7e (feeling labelled as a second-class citizen) (Table B in Additional file 3). Item 7b (feeling judged because of social status) possessed DIF regarding age (LRT: p = 0.0278).

Experiences of not being seen and heard

The draft 4-item scale was disregarded by the Rasch model (Table 6). Item 8b (feeling unfairly treated) had poor fit (Table B in Additional file 3), and item 8c presented DIF in terms of age (LRT: p = 0.0199).

Experiences of insufficient understanding of the burden of disease

The analyses supported the overall fit of the draft 3-item scale to the Rasch model (Table 6). There was evidence of DIF regarding diagnosis for item 9a (the GP having difficulties understanding problems) (LRT: p = 0.0006).

Experience of powerlessness

The draft 5-item domain was found to fit the Rasch model with no DIF. Item 10c revealed poor item fit (Table B in Additional file 3).

In encounters with staff at the GP’s surgery

Experiences of being stigmatised

The draft 5-item domain fitted the Rasch model (Table 6). DIF with regards to diagnoses was evident for items 11c (pigeonholed because of society’s norms) (LRT: p = 0.0176) and 11e (feeling labelled as a second-class citizen) (LRT: p = 0.0156). Furthermore, DIF was present for item 11b (feeling labelled due to a lack of affiliation to the labour market) regarding sex (LRT: p = 0.0009).

Experiences of not being seen and heard

The draft 4-item scale showed good fit to the Rasch model (Table 6) with no indication of DIF.

Experiences of insufficient understanding of the burden of disease

The analyses supported the overall fit of the draft 3-item scale to the Rasch model. The item 13a (staff at the GP’s surgery having difficulties understanding problems) possesed DIF in terms of age group (LRT: p = 0.0031) and sex (LRT: p = 0.0032).

In encounters with other healthcare professionals

There were 165 respondents stating they did not have contact with other healthcare professionals (e.g., hospital department), and therefore did not respond to the items in this domain.

Experiences of being stigmatised

The draft 5-item domain possessed fit to the Rasch model with no evidence of DIF (Table 6).

Experiences of not being seen and heard

The draft 4-item scale fitted a Rasch model (Table 6). Item 15c (not feeling seen) exhibited DIF regarding age (LRT: p = 0.0109).

Experiences of insufficient understanding of the burden of disease

The analyses supported the overall fit of the draft 3-item scale to the partial credit Rasch model with no evidence of DIF (Table 6).

Experience of powerlessness

The draft 5-item domain was found to fit a Rasch model with no DIF (Table 6).

In encounters with local authorities

There were 218 respondents stating they did not have contact with the local authorities (e.g., job centre or case worker), and therefore did not respond to the items in this domain.

Experiences of being stigmatised

The draft 5-item domain possessed fit to the Rasch model (Table 6). DIF was present for Item 18e (feeling labelled as a second-class citizen) in terms of age (LRT: p = 0.0075).

Experiences of not being seen and heard

The analyses supported the overall fit of the draft 4-item scale to the Rasch model (Table 6). Item 19c (not feeling seen) showed DIF regarding sex (LRT: p = 0.0163).

Experiences of insufficient understanding of the burden of disease

The analyses supported the overall fit of the draft 3-item scale to the partial credit Rasch model with no evidence of DIF (Table 6).

Experience of powerlessness

The draft 5-item domain fitted the Rasch model (Table 6). DIF was revealed regarding age group (LRT: p = 0.0213). Item 21b (using a lot of energy fighting ‘my cause’) showed poor item fit (Table E in Additional file 3).

When spending time with friends, family, and others

Experiences of being stigmatised

The draft 4-item scale possessed fit to the Rasch model with no evidence of DIF (Table 6).

Experiences of insufficient understanding of the burden of disease

The analyses supported the overall fit of the draft 3-item scale to the partial credit Rasch model with no evidence of DIF (Table 6).

Reliability

Reliability was evaluated using Cronbach’s alpha coefficients after evidence of unidimensionality of each scale was established, values ranging from 0.84 to 0.98.

Discussion

Evidence of six unidimensional scales measuring Needs-based Quality of Life in the MMQ1 were confirmed: Physical ability, Worries, Limitations in everyday life, My social life, Self-image and Personal economy encompassing in total 31 items. In the MMQ2, measuring Self-perceived inequity, four unidimensional scales were outlined: Experiences of being stigmatised (4–5 items), Experiences of not being seen and heard (4 items), Experiences of insufficient understanding of the burden of disease (3 items), and Experience of powerlessness in encounters with (1) the GP, (2) staff at the GP’s surgery, (3) other healthcare professions, (4) local authorities and (5) family, friends, and others (in total a maximum of 16–17 items).

In the psychometric assessment, 23 items were removed from the MMQ1, mainly because of DIF of the individual item regarding diagnoses or age group. This could be expected, considering the responses stemmed from the heterogeneous group of patients with multimorbidity, with different combinations of diagnoses, different severities of illnesses, treatment regimens, and ages. However, we raised evidence that invariant measure was achieved in 6 unidimensional scales encompassing 31 items and thereby comparison of Need-based quality of life across different diagnoses combination in patients living with multimorbidity is possible. Furthermore, as the final MMQ1 only possesses DIF in the scale Limitations in everyday life, it is possible to use this measure in non-randomised studies. In this context, the DIF revealed in this scale requires attention in future studies.

As we have no external objective parameter to compare the MMQ1’s measurement properties we assessed how the individual MMQ1 scales discriminated between the response options of the global item. This differed for each scale and response category, ranging from 34 participants necessary to able to measure the difference between the response options “Acceptable” and “Poor” in the scale Limitations in everyday life to 10,478 participants in the scale “Personal finances” between the response categories “Poor” and “Very poor”. Generally, the six MMQ1 scales discriminated best between the response options “Good” and “Poor”. We interpret this as due to the Danish wordings of the response options in the fare ends of the scales (“Very poor” to “poor” and “good” to “very good”) are similar in the context of the items they refer to thereby preventing the ability to discriminate responses. Another interpretation is, that distinguishing in the fare ends of the scales for the individual respondent, is questionable and possibly caused by other day-to-day factors than the latent construct.

Due to a low number of responses to the MMQ2 among the 390 responders, the analyses were based on between 47 and 165 responses for each domain (Table 6), no items from the MMQ2 were disregarded because of insufficient statistical power to detect potential problems, e.g., multidimensionality, misfit, LRD or DIF. Therefore, we suggest a revalidation of MMQ2 among a larger sample size.

Experiences of stigma in encounters with healthcare services are frequently described in vulnerable populations [12, 16]. To the best of our knowledge, no measure exists for measuring these perceptions and the influence on QoL in patients with multimorbidity. In our qualitative item generation phase, this aspect of QoL proved to be of great importance to our informants and resulted in the inclusion of measuring Self-perceived inequity through the construction of four domains, with relevant items repeated for the informants encounters with healthcare professionals, local authorities and family/friends [11, 12]. Rasch analyses indicated that the domains behaved as we expected, yet with the risk of type 2 errors due to low power from the small sample size.

Strengths and limitations

The primary strength of this study is the content-validity-driven approach of using psychometric methods confirmatory supported by the preliminary conceptual and qualitative phases [11, 12]. This process enables development of scales and items with both high content validity and adequate psychometric properties. Furthermore, the thorough stratified selection of eligible respondents providing the data is a strength for the psychometric assessment. Another strength is the assessment of the discriminating ability of each MMQ1 scale providing evidence of the scales’ measurement capabilities and enabling power calculations for future clinical trials and surveys.

The limitation of this study is primarily related to the low response rate the analyses of the MMQ2 were based on, questioning the applicability directly to other studies. Therefore, to be transparent in the process of psychometrically testing the MMQ2, we thoroughly reported each psychometric flaw the analyses revealed towards unidimensionality, item misfit and DIF. This will guide our revalidation and assessment of the MMQ in a large Danish survey study encompassing more than 1600 responses.

The lower response rates of the MMQ2 compared to the MMQ1 could be related to the response burden. MMQ2 was at the end of the questionnaire, and the same items are repeated for each contact. Yet, the lowest number of responses is due to the possibility of skipping items, if they were not relevant for the respondent, this applied to encounters with local authorities (N = 218) and other healthcare professionals (N = 165). In future studies, the MMQ1 and MMQ2 can be used independently and only encounters relevant for the specific study has to be included.

Conclusion

This study has raised evidence of adequate psychometric properties, including unidimensional scales encompassing items possessing invariant measurement properties, of the MMQ, a condition-specific PROM for patients living with multimorbidity. The MMQ1, measuring Needs-based QoL, encompasses six scales: Physical ability (6 items), Worries (6 items), Limitations in everyday life (10 items), My social life (6 items) Self-image (6 items), and Personal finances (3 items)—in total 31 items.

Furthermore, the psychometric analyses of the MMQ2 outlines four scales measuring Self-perceived inequity for patients with multimorbidity: experiences of being stigmatised (4–5 items), Experiences of insufficient understanding of the burden of disease (3 items), Experiences of not being seen and heard (4 items), Experience of 1owerlessness (5 items)—in total 16–17 items). These scales are relevant in their encounters with their (1) GP, (2) staff at their GP’s surgery, (3) healthcare professionals, (4) staff at the local authorities and (v) friends, family, and others.

Implications for research

Our initial systematic review revealed that there is no measure of QoL with adequate psychometric properties specifically for patients with multimorbidity [9]. The MMQ1 is a new multidimensional, condition-specific PROM for measuring Needs-based QoL in patients with multimorbidity with high content validity and adequate measurement properties that can be used in observational- and intervention studies. Further assessment of the MMQ, including revalidation of the MMQ2 for measuring Self-perceived health inequity, is forthcoming. As the MMQ was developed in a Danish setting, work is in progress for translating, adapting, and validating the MMQ into an English context.

Availability of data and materials

The datasets are available upon reasonable request to the corresponding author.

References

  1. van den Akker M, Buntinx F, Knottnerus JA (1996) Comorbidity or multimorbidity. Eur J Gen Pract 2:65–70. https://doi.org/10.3109/13814789609162146

    Article  Google Scholar 

  2. Uijen AA, van de Lisdonk EH (2008) Multimorbidity in primary care: prevalence and trend over the last 20 years. Eur J Gen Pract 14:28–32. https://doi.org/10.1080/13814780802436093

    Article  PubMed  Google Scholar 

  3. Willadsen TG, Siersma V, Nicolaisdóttir DR et al (2018) Multimorbidity and mortality: A 15-year longitudinal registry-based nationwide Danish population study. J Comorb. https://doi.org/10.1177/2235042X18804063

    Article  PubMed  PubMed Central  Google Scholar 

  4. Nguyen H, Manolova G, Daskalopoulou C et al (2019) Prevalence of multimorbidity in community settings: A systematic review and meta-analysis of observational studies. J Comorb. https://doi.org/10.1177/2235042X19870934

    Article  PubMed  PubMed Central  Google Scholar 

  5. Glynn LG, Valderas JM, Healy P et al (2011) The prevalence of multimorbidity in primary care and its effect on health care utilization and cost. Fam Pract 28:516–523. https://doi.org/10.1093/fampra/cmr013

    Article  PubMed  Google Scholar 

  6. Makovski TT, Schmitz S, Zeegers MP et al (2019) Multimorbidity and quality of life: systematic literature review and meta-analysis. Ageing Res Rev 53:100903

    Article  PubMed  Google Scholar 

  7. Ørtenblad L, Meillier L, Jønsson AR (2018) Multi-morbidity: a patient perspective on navigating the health care system and everyday life. Chronic Illn 14:271–282. https://doi.org/10.1177/1742395317731607

    Article  PubMed  Google Scholar 

  8. Sand C, Rahbek K, Willadsen T, Jønsson A (2021) Prioritizing social identities: patients’ perspective on living with multimorbidity. J Comorb. https://doi.org/10.1177/26335565211009375

    Article  Google Scholar 

  9. Møller A, Bissenbakker KH, Arreskov AB, Brodersen J (2020) Specific measures of quality of life in patients with multimorbidity in primary healthcare: a systematic review on patient-reported outcome measures’ adequacy of measurement. Patient Relat Outcome Meas 11:1–10. https://doi.org/10.2147/prom.s226576

    Article  PubMed  PubMed Central  Google Scholar 

  10. Bissenbakker KH, Jønsson AR, Brodersen J, Møller A (2020) PROMs og livskvalitet. Tidsskr Forsk i Sygd og Samf 17:55–74. https://doi.org/10.7146/tfss.v17i32.120975

    Article  Google Scholar 

  11. Bissenbakker K, Møller A, Brodersen JB, Jønsson ABR (2022) Conceptualisation of a measurement framework for needs-based quality of life among patients with multimorbidity. J Patient Rep Outcomes 6:83. https://doi.org/10.1186/s41687-022-00489-0

    Article  PubMed  PubMed Central  Google Scholar 

  12. Bissenbakker K, Møller A, Jønsson ABR, Brodersen JB. Generating items for measuring needs-based quality of life and self-perceived health inequity among patients with multimorbidity: Development of the MultiMorbidity Questionnaire (MMQ). In review (September 2022)

  13. Hunt SM, McKenna SP (1992) The QLDS: a scale for the measurement of quality of life in depression. Health Policy (NY) 22:307–319. https://doi.org/10.1016/0168-8510(92)90004-U

    Article  CAS  Google Scholar 

  14. McKenna SP, Doward LC (2004) The needs-based approach to quality of life assessment. Value Heal 7:S1–S3. https://doi.org/10.1111/J.1524-4733.2004.7S101.X

    Article  Google Scholar 

  15. McKenna SP, Doward LC, Niero M, Erdman R (2004) Development of needs-based quality of life instruments. Value Heal 7:17–21. https://doi.org/10.1111/j.1524-4733.2004.7s105.x

    Article  Google Scholar 

  16. Jønsson ABR (2018) Aging with multimorbidity. Illness and Inequity in Everyday Live. University of Copenhagen

  17. Christensen KB, Comins JD, Krogsgaard MR et al (2021) Psychometric validation of PROM instruments. Scand J Med Sci Sports 31:1225–1238. https://doi.org/10.1111/SMS.13908

    Article  PubMed  Google Scholar 

  18. Mokkink LB, de Vet HCW, Prinsen CAC et al (2018) COSMIN risk of bias checklist for systematic reviews of patient-reported outcome measures. Qual Life Res 27:1171. https://doi.org/10.1007/S11136-017-1765-4

    Article  CAS  PubMed  Google Scholar 

  19. Brodersen J, Meads D, Kreiner S et al (2007) Methodological aspects of differential item functioning in the Rasch model. J Med Econ 10:309–324. https://doi.org/10.3111/13696990701557048

    Article  Google Scholar 

  20. Marais I (2013) Local dependence. In: Christensen KB, Kreiner S, Mesbah M (eds) Rasch models in health. Wiley, London

  21. Jepsen R, Egholm CL, Brodersen J et al (2020) Lolland–Falster health study: study protocol for a household-based prospective cohort study. Scand J Public Health 48:382–390. https://doi.org/10.1177/1403494818799613

    Article  PubMed  Google Scholar 

  22. Egholm CL, Packness A, Stokholm J et al (2020) Questionnaire development for the Lolland–Falster Health Study, Denmark: an iterative and incremental process. BMC Med Res Methodol. https://doi.org/10.1186/S12874-020-00931-1

    Article  PubMed  PubMed Central  Google Scholar 

  23. Masters GN (1982) A Rasch model for partial credit scoring. Psychometrica 47:149–174. https://doi.org/10.1007/BF02296272

    Article  Google Scholar 

  24. Andersen EB (1977) Sufficient statistics and latent trait models. Psychometrika 42:69–81. https://doi.org/10.1007/BF02293746

    Article  Google Scholar 

  25. Andrich D (1978) A rating formulation for ordered response categories. Psychometrika 43:561–573. https://doi.org/10.1007/BF02293814

    Article  Google Scholar 

  26. Andersen EB (1973) A goodness of fit test for the Rasch model. Psychometrika 38:123–140

    Article  Google Scholar 

  27. Christensen KB, Kreiner S. Item fit statistics. In: Rasch Models in Health. Wiley, pp 83–104 (2013)

  28. Kreiner S (2011) A note on item–restscore association in Rasch models. Appl Psychol Meas 35:557–561. https://doi.org/10.1177/0146621611410227

    Article  Google Scholar 

  29. Christensen KB, Makransky G, Horton M (2017) Critical values for Yen’s Q3: identification of local dependence in the Rasch model using residual correlations. Appl Psychol Meas 41:178. https://doi.org/10.1177/0146621616677520

    Article  PubMed  Google Scholar 

  30. Kreiner S, Christensen K. Two tests of local independence. In: Rasch Models in Health. Wiley, pp 131–136 (2013)

  31. Kreiner S, Bang Christensen K (2011) Item screening in graphical loglinear Rasch models. Psychometrika 76:228–256. https://doi.org/10.1007/S11336-011-9203-Y

    Article  Google Scholar 

  32. Kreiner S, Christensen KB (2002) Graphical Rasch models. In: Mesbah M, Cole FC, Lee MT (eds) Statistical methods for quality of life studies. Springer, New York, pp 187–203

    Chapter  Google Scholar 

  33. Kreiner S, Christensen KB (2007) Analysis of local dependence and multidimensionality in graphical loglinear Rasch models. Commun Stat Theory Methods 33:1239–1276. https://doi.org/10.1081/STA-120030148

    Article  Google Scholar 

  34. Kreiner S, Christensen KB (2007) Validity and objectivity in health-related scales: analysis by graphical loglinear Rasch models. Multivar Mix Distrib Rasch Model. https://doi.org/10.1007/978-0-387-49839-3_21

    Article  Google Scholar 

  35. Cronbach LJ (1951) Coefficient alpha and the internal structure of tests. Psychom 163(16):297–334. https://doi.org/10.1007/BF02310555

    Article  Google Scholar 

  36. Benjamini Y, Hochberg Y (1995) Controlling the false discovery rate: a practical and powerful approach to multiple testing. J R Stat Soc Ser B 57:289–300. https://doi.org/10.1111/J.2517-6161.1995.TB02031.X

    Article  Google Scholar 

  37. Cappelleri JC, Zou KH, Bushmakin AG et al (2013) Patient-reported outcomes: measurement, implementation and interpretation. CRC Press, Boca Raton

    Book  Google Scholar 

  38. Kreiner S. Introduction to DIGRAM. Research report 03/10. Department of Biostatistics. University of Copenhagen, Copenhagen (2003).

  39. DIGRAM. https://biostat.ku.dk/DIGRAM.

  40. Andrich D, Sheridan B, Luo G. RUMM 2030: Rasch Unidimensional Measurement Models [Computer software]. RUMM Laboratory, Perth (2010).

  41. R Core Team. R: A language and environment for statistical computing (2022).

Download references

Acknowledgements

The author group would like to thank the participants of the Lolland–Falster Study who responded to the draft versions of the MMQ. A very special thanks to Randi Jepsen and Christian Cato Holm conducting the stratification of patients, disseminating the questionnaires and the data-extraction in the LOFUS. Randi especially for the commitment to the study, and for critical and relevant comments.

Funding

Open access funding provided by Royal Library, Copenhagen University Library This study was funded by Fonden for Almen Praksis (A1849, A2013, A2662, A3550), Region Zealand: Sundhedsvidenskabelige Forskningsfond, Region Zealand: Tidlig opsporing, Region Zealand: PhD grant, The Novo Nordisk Foundation (NNF15OC0019568), The University of Copenhagen: Den lægevidenvidenskabelige del af det Sundhedsvidenskabelige Fakultets fond for videnskabeligt ansate kandidater og studerende (2018-0061), AP Møller Lægefonden (19-L-0328), Sara Krabbes Legat.

Author information

Authors and Affiliations

Authors

Contributions

KB: Study conceptualisation and design, data collection, formal analysis, data curation, writing original draft and revisions of the manuscript. JBB: Study conceptualisation and design, formal analysis, writing—review and editing. KBC: Study conceptualisation and design, formal analysis, data curation, software, writing—review and editing. VS: Study conceptualisation and design, formal analysis, data curation, software, writing—review and editing. AM: Study conceptualisation and design, writing—review and editing. ABRJ: Study conceptualisation and design, writing—review and editing. All authors have read and approved the final version of this manuscript.

Corresponding author

Correspondence to Kristine Bissenbakker.

Ethics declarations

Ethics approval and consent to participate

Participants in this study are registered in, and thereby covered under The Lolland–Falster Study’s (LOFUS) approval from The Danish Data Protection Agency (REG-24-2015). Our data provided by LOFUS did not involve any personally identifiable information, hence, no ethical approval was required. Information about this study was provided in the invitation to complete MMQ, yet informed consent was not required due to the anonymised data.

Consent for publication

Not applicable due to the use of anonymised data in this study.

Competing interests

The authors declare that they have no competing interests.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Additional file 1.

Rasch analysis implemented in DIGRAM.

Additional file 2.

Content condensates of the included and excluded items of the draft MMQ1 and MMQ2.

Additional file 3.

Individual item fit of items in MMQ1 and MMQ2.

Additional file 4.

Requirements for using the MultiMorbidity Questionnaire.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Bissenbakker, K., Siersma, V., Jønsson, A.B.R. et al. Measuring needs-based quality of life and self-perceived health inequity in patients with multimorbidity: investigating psychometric measurement properties of the MultiMorbidity Questionnaire (MMQ) using primarily Rasch models. J Patient Rep Outcomes 7, 94 (2023). https://doi.org/10.1186/s41687-023-00633-4

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1186/s41687-023-00633-4

Keywords