Development and psychometric properties of the Thai Graves’ ophthalmopathy quality of life (GO-QOL) questionnaire
Journal of Patient-Reported Outcomes volume 4, Article number: 1 (2020)
To develop and assess the psychometric properties of the Thai version of the Graves’ Ophthalmopathy Quality of Life (GO-QOL) questionnaire.
Graves’ ophthalmopathy (GO) is a chronic condition that causes negative self-image and impaired visual function. These conditions impact quality of life (QOL) but are rarely documented. Graves’ Ophthalmopathy Quality of Life Questionnaire (GO-QOL) has good validity, reliability and responsiveness. In this study we developed a Thai GO-QOL questionnaire by translating the questionnaire from English to the Thai language and evaluated its reliability and validity.
Patients and methods
Forward and backward translations were performed independently by four translators with extensive experience in both English and Thai. Seventy patients at the thyroid clinic responded to the Thai translated version upon their first visit and again 2–3 weeks afterwards. Validity was assessed by the content validity index (CVI) and correlation with relevant clinical parameters. Reliability was evaluated by Cronbach’s alpha, the intraclass correlation coefficient, and the Bland-Altman plot.
The Thai GO-QOL version showed high CVI (0.97) and a moderate negative correlation of the functional QOL score with disease severity (r = − 0.49), the clinical activity score (r = − 0.31), and exposure parameter (r = − 0.32). It showed good reliability with a high intraclass correlation coefficient (0.92) and high Cronbach’ s coefficient (0.86).
The Thai GO-QOL has good validity and reliability. It can be used to evaluate the quality of life of Graves’ ophthalmopathy patients as a consequence of their disease in thyroid treatment programs.
Graves’ ophthalmopathy (GO) is an autoimmune orbital inflammatory disease that occurs in approximately 50% of patients with Graves’ disease . The pathogenesis of the disease is associated with shared antigens and cross-reactivity of thyroid and orbital tissue. Circulating antibodies activate proteins in the extraocular muscles and orbital fat, leading to fibroblast proliferation and glycosaminoglycan production. This results in exophthalmos, orbital congestion, enlarged extraocular muscles and increased orbital fat volume, with typical effects of diplopia, and eye exposure. Infrequently, optic neuropathy develops in 5% of GO. The course of GO is commonly biphasic with an active phase characterized by orbital inflammatory signs, mostly lasting 6–24 months, followed by the inactive phase of the disease . This chronic condition negatively impacts patients’ quality of life and is associated with visual impairment, psychosocial problems, and consequent disfigurement of the orbital structures. In 1997, Gerding et al. reported lower scores on the 24-Item Medical Outcomes Study Short-Form Health Survey (MOS SF-24) in GO patients compared to scores in diabetes, emphysema or heart failure patients but scores comparable to those of patients with inflammatory bowel disease . The MOS SF-24 was used to evaluate the perception of general health status in GO patients in the Netherlands before a disease-specific quality of life questionnaire was proposed .
The first GO-specific questionnaire (GO-QOL) was developed in 1998 by Terwee et al. . It contained 16 questions, 8 on visual functioning and 8 on appearance. The study showed that the GO-QOL demonstrated good validity, reliability and responsiveness [5, 6]. It also revealed good correlations with disease severity and clinical activity, even among different ethnic groups and different languages [7,8,9,10]. The GO-QOL was recommended for use by the European Group on Graves’ Orbitopathy (EUGOGO) for the assessment of clinical response parameters in clinical trials .
The goal of GO treatment is to improve visual function, appearance, and patients’ feelings. Thus, the self-assessment of eye condition has been recommended in the treatment program for GO patients. No health- related quality of life (HRQoL) or thyroid disease-specific questionnaire studies in GO patients have previously been conducted in Thailand. To monitor the clinical response to treatment and clinical trial outcomes, we developed a Thai version of the GO-QOL and evaluated its validity and reliability with respect to its value for Thai GO patients.
This study was approved by the Mettapracharak Hospital Institutional Review Board and informed consent was obtained from all participants. Permission for translation was received from the proposer of the GO-QOL questionnaire (Terwee CB). Forward translation from the English language questionnaire to Thai was performed by two native Thai speakers from Chulalongkorn University with extensive experience in both the English and Thai languages. Backward translation was performed by two native speakers in English working independently. Each item in the Thai questionnaire was reviewed and troublesome items were identified by 5 oculoplastic surgeons with expertise in thyroid eye disease.
The questionnaire was administered to 10 normal subjects, defined as healthy volunteers with no abnormal eye conditions that affected their quality of life. Discussion about the meaning and clarity of each item was performed.
We enrolled 70 consecutive participants with Graves’ ophthalmopathy (age > 18) at the oculoplastic clinic in Mettapracharak Hospital. Only patients who understood Thai were enrolled. GO patients whose quality of life could be affected by other eye diseases were excluded. Seventy participants completed the Thai-translated questionnaires twice. The first time was at the clinic, and the second was 2–3 weeks later. The short period was intended to prevent recall bias and changes in the stability of clinical signs. All patients were interviewed to determine whether any items were difficult or confusing to answer or irrelevant to the disease.
The Thai version of the GO-QOL contained 16 items with 8 questions pertaining to the consequences of diplopia and decreased vision on visual functioning and 8 questions on the psychosocial consequences of a changed appearance . Each item was answered on a 3-point Likert scale (1, “seriously limited”; 2, “a little limited”; 3, “not limited at all”). The answer scores for items 1–8 and for items 9–16 were tallied to provide 2 raw scores. Each possible subscale score ranged from 8 to 24; the first one was a functioning subscale, and the second one was an appearance subscale. The two raw subscale scores were transformed to 2 total scores by the following formula: total score = (raw score - Y) / (2 x Y) × 100, where Y is the number of completed question items. Each total score ranged from 0 to 100, with lower scores representing poorer health status. If participants did not complete any item for any reason, such as “never learned to ride a bike” or “have no driver’s license”, the missing item could be excluded, leaving the remaining completed items for calculation [6, 12].
Patients’ demographic data were collected. Current GO severity was graded by the EUGOGO classification system, which divides patients into mild, moderate to severe and very severe groups. Soft tissue inflammation and activity was graded by clinical activity scores based on inflammatory signs of the orbit (pain, redness and swelling).
Validation focused on content and construct validity. Content validity involved the systematic examination of the item content to determine whether it was applicable to, relevant to and reflective of Thai GO patients and was determined using a content validity index (CVI). Each item was rated by five experts on a 4-point ordinal scale (grade 1, not relevant; grade 2, somewhat relevant; grade 3, quite relevant; grade 4, highly relevant). The CVI was assessed both at the item level (I-CVI) and the scale level (S-CVI). The item level (I-CVI) was calculated as the following formula: I-CVI = NR / N, where NR is the number of experts giving a score of grade 3 or grade 4 and N is the total number of experts . I-CVI should be at least 0.78 for sufficient agreement . The scale level CVI (S-CVI) was computed with 2 different indices: 1) universal agreement (S-CVI/UA) was calculated by the formula S-CVI/UA = A / B, where A is the number of items that were judged relevant to grade3 or grade 4 by all experts (I-CVI = 1.0), and B is the number of total items, and 2) average agreement (S-CVI/Ave) was computed by summing all I-CVI and dividing by the number of total items . An acceptable S-CVI should be at least 0.8 [14, 15].
Construct validity refers to how well a test measures its intended construct . This was assessed by evaluating specific hypotheses through the correlation between QOL scores and other clinical parameters. The functioning subscale had moderate negative relationships with disease severity and clinical activity score (CAS) values. The appearance subscale was weakly to moderately correlated with four measures: 1) age, with younger patients reporting more problems with appearance than older patients; 2) sex, with female patients reporting more problems with appearance than men; 3) GO disease severity; and 4) eye exposure parameters, with scores worsening as eye exposure increased. The expected magnitude of the differences in GO-QOL scores between various severity groups was at least 10 points as a minimal clinically important difference . The criteria for good construct validity are defined as at least 75% of the results are coherent with the hypotheses . A one-way analysis of variance (ANOVA) was used to identify differences in the mean QOL scores between the various severity groups. Spearman’s rank correlation coefficient was performed to assess the correlation between disease severity, CAS and eye exposure parameters with QOL scores. The internal consistency of the questionnaires was assessed by calculating Cronbach’s alpha. Factor analysis was performed to categorize 16 items of the Thai GO-QOL into group domains; the cutoff point of factor loading was 0.4 . Test-retest reliability was evaluated according to the intraclass correlation coefficient (ICC) and Bland-Altman plot . A two-way mixed model for absolute agreement was used to calculate the ICC . All data were analyzed using SPSS version 21.0 software (SPSS Inc., Chicago, IL, USA).
Results of pretesting
Ten normal participants reported that the Thai GO-QOL questionnaire was not difficult to complete or confusing.
Results of testing
All 70 participants were recruited and completed every item of the Thai version of the GO-QOL. They reported that the questionnaire was not difficult, confusing or irrelevant to the disease. Table 1 shows the demographic data and clinical characteristics of the GO patients. Most of the participants were female. The mean age was 49.4 ± 12.1 years. Diplopia and dry eye were the most common current symptoms (36.6%). Approximately two-thirds of the patients had moderate to severe severity (67.1%). Six patients (8.6%) had dysthyroid optic neuropathy. The average CAS was 1.0, and 6 patients (8.6%) had active disease.
The frequencies of responses on the Thai GO-QOL are presented in Table 2. The percentage of completed responses for each item was 100%. Seriously limited activities were described by 48.6% of patients for reading, by 42.9% for driving, by 44.3% for interference with daily life, and by 40% for walking outdoors and watching TV. Most patients (95.7%) felt that their appearance had changed (a little or very much), 75.7% felt the influence of the disease on their self-confidence (a little or very much), and 71.4% felt that they were stared at on the street (a little or very much). The average score for visual functioning was 40.17 ± 28.65 (mean ± SD). The average appearance score was 57.50 ± 26.76.
There were high content validity indices for each item question (I-CVI > 0.8) and the mean of all items (S-CVI/Ave =0.97) (Table 3). The visual functioning scores were moderately negatively correlated with disease severity (r = − 0.49), CAS (r = − 0.31) and lid retraction (r = − 0.32). The appearance scores were weakly negatively correlated with disease severity (r = − 0.20) and dry eye severity (r = − 0.24). Age was weakly correlated with QOL scores, while female sex was not correlated with the scores (Table 4). The mean visual functioning in each severity group was statistically significantly different (p < 0.001, p = 0.01) (Table 5 and Fig. 1). In accordance with the hypotheses, the construct validity of the visual functioning subscale was 100% (3 of 3 criteria), and that of the appearance subscale was 80% (4 of 5 criteria) (Tables 4 and 5).
The results of the factor analysis are presented in Table 6. Factor analysis with varimax rotation was used to categorize the 16 items of the questionnaire into four group factors, explaining 72.93% of the total variance. Items that loaded high on the first factor were related to problems with near to intermediate vision. Items that loaded high on the second factor were associated with psychosocial problems. Items that loaded high on the third factor were correlated with changed appearance, and items that loaded high on the fourth factor were related to trouble with distant vision. A two-factor structure confirmed the subdivision of the questionnaire into 2 subscales for visual functioning (near to distance vision) and the psychosocial effects of changed appearance.
Cronbach’s alphas were 0.86 for visual functioning and 0.87 for appearance. The intraclass correlation coefficients were 0.92 (95% CI, 0.88–0.95) for visual functioning and 0.90 (95% CI, 0.85–0.94) for appearance scores (Table 7).
Figures 2 and 3 display a scatter diagram of the differences between the first and second measurements plotted against their means, with a presentation of the limits of agreement (mean difference ± 1.96 SD) at 1.27 ± 21.42 for the functioning subscale and − 1.69 ± 22.73 for the appearance subscale.
The results of this study showed good reliability and validity of the Thai version of the GO-QOL. High Cronbach’s alpha and intraclass correlation coefficients suggested good reliability and internal consistency of the questionnaire. The Bland-Altman plot also confirmed the repeatability of the questionnaire in both subscales; this result corresponded to the original study by Terwee . The high content validity index of each item and the average of all items supported good content validity, whereas construct validity was supported by the correlation of QOL scores with disease severity, clinical activity scores, and exposure parameters. There were significant differences in the means of visual functioning scores among the varying severity groups. Interestingly, the appearance subscale score was weakly correlated with disease severity. This result was similar to the original study by Terwee et al. . The mean appearance scores for the different severity groups were not significantly different, suggesting that GO severity might have less influence than individual perception on changing appearance.
Gerding et al. reported that the HRQoL scores in GO patients did not correlate with the duration, severity or activity of the disease. They concluded that usual clinical assessment seems to be unrelated to the negative impact on quality of life .
Regarding well-being among Graves’ disease patients with or without ophthalmopathy assessed by HRQoL, GO-QOL and Mini-Mental State Examination, Riguetto CM et al. found that the presence of ophthalmopathy was a factor related to poor quality of life .
From the perspective of patients, clinical measurements, such as extraocular muscle movement or the degree of proptosis, are of limited interest. Instead, patients usually consider to impaired physical and psychosocial issues in daily life [6, 22]. The difference between objective clinical measurements and patients’ experiences cannot be explained only by the severity of signs and symptoms but by individuals’ characteristics and the environment, such as expectations, motives, past experiences, stress coping, doctor-patient relationships and social support . Health-related quality of life is the most important indicator of successful treatment when the primary aim is to improve quality of life rather than to prolong life .
Previous GO-specific quality of life studies have shown only a moderate correlation between QOL and disease severity and a low correlation in appearance subscales [4, 9, 16, 24, 25]. This evidence accentuates the disparity between objective clinical assessment and subjective quality of life; hence, assessing both objective and subjective measurements is the best approach for GO treatment programs .
There are some limitations to this study. First the translators had no background in medicine or understanding of GO. With regard to some ambiguous terms referring to GO in the Thai language, the translators were informed of concepts included in the questionnaire during the first stage of translation. The expert committee worked closely with all the translators in the process.
Content validity refers to the relevance, comprehensiveness and comprehensibility of a questionnaire . It can be evaluated by asking experts and patients . In this study, relevance was evaluated among experts and GO patients; comprehensibility was evaluated among ‘normal subjects’ and patients. However, as another limitation of the study, comprehensiveness was not adequately addressed.
The Thai GO-QOL questionnaire indicates good reliability and validity similar to its prototype. Its scores correlated with clinical activity, disease severity and eye exposure parameters. The Thai version of the GO-QOL can be implemented into thyroid disease treatment programs to evaluate dynamic clinical outcome measurements of Graves’ ophthalmopathy.
clinical activity score
content validity index
European Group on Graves’ Orbitopathy
- MOS SF-24:
24-item Medical Outcome Study Short-Form Health Survey
quality of life
Perros, P., Crombie, A. L., Matthews, J. N., et al. (1993). Age and gender influence the severity of thyroid-associated ophthalmopathy: A study of 101 patients attending a combined thyroid-eye clinic. Clin Endocrine (Oxf), 38, 367–372.
Rao, R., Macintosh, P. W., Yoon, M. K., et al. (2015). Current trends in the management of thyroid eye disease. Curr Opin Ophthalmol, 26, 484–490.
Gerding, M. N., Terwee, C. B., Dekker, F. W., et al. (1997). Quality of life in patients with graves’ ophthalmopathy is markedly decreased: Measurement by the medical outcomes study instrument. Thyroid, 7, 885–889.
Terwee, C. B., Gerding, M. N., Dekker, F. W., et al. (1998). Development of a disease specific quality of life questionnaire for patients with graves’ ophthalmopathy: The GO-QOL. Br J Ophthalmol, 82, 773–779.
Terwee, C. B., Dekker, F. W., Mourits, M. P., et al. (2001). Interpretation and validity of changes in scores on the graves’ ophthalmopathy quality of life questionnaire (GO-Q0L) after different treatments. Clin Endocrinol, 54, 391–398.
Wiersinga, W. M. (2012). Quality of life in graves’ ophthalmopathy. Best Pract Res Clin Endocrinol Metab, 26, 359–370.
Terwee, C. B., Gerding, M. N., Dekker, F. W., et al. (1999). Test-retest reliability of the GO-QOL: A disease-specific quality of life questionnaire for patients with graves’ ophthalmopathy. J Clin Epidemiol, 52, 875–884.
Ponto, K. A., Hommel, G., Pitz, S., et al. (2011). Quality of life in a german graves orbitopathy population. Am J Ophthalmol, 152, 483–490.
Choi, Y. J., Lim, H. T., Lee, S. J., et al. (2012). Assessing graves’ ophthalmopathy specific quality of life in Korean patients. Eye, 26, 544–551.
Park, J. J., Sullivan, T. J., Mortimer, R. H., et al. (2004). Assessing quality of life in Australian patients with graves’ ophthalmopathy. Br J Ophthalmol, 88, 75–78.
Wiersinga, W. M., Perros, P., Kahaly, G. J., et al. (2006). Clinical assessment of patients with graves’ orbitopathy: The European group on graves’ Orbitopathy recommendations to generalists, specialists and clinical researchers. Eur J Endocrinol, 155, 387–389.
Lin, I. C., Lee, C. C., & Liao, S. L. (2015). Assessing quality of life in Taiwanese patients with graves’ ophthalmopathy. J Formos Med Assoc, 114, 1047–1054.
Lynn, M. R. (1986). Determination and quantification of content validity. Nurs Res, 35, 382–385.
Polit, D. F., & Beck, C. T. (2006). The content validity index: Are you sure you know what’s being reported? Critique and recommendation. Res Nurs Health, 29, 489–497.
Halek, M., Holle, D., & Bartholomeyczik, S. (2017). Development and evaluation of the content validity practicability and feasibility of the innovative dementia-oriented assessment system for challenging behaviour in residents with dementia. BMC Health Serv Res, 17, 554.
Fayers, T., & Dolman, P. J. (2011). Validity and reliability of the TED-QOL: A new three-item questionnaire to assess quality of life in thyroid eye disease. Br J Ophthalmol, 95, 1670–1674.
Terwee, C. B., Bot, S. D., de Boer, M. R., et al. (2007). Quality criteria were proposed for measurement properties of health status questionnaires. J Clin Epidemiol, 60, 34–42.
Sahlberg-Blom, E., Ternestedt, B. M., & Johansson, J. E. (2001). Is good ‘quality of life’ possible at the end of life? An explorative study of the experiences of a group of cancer patients in two different care cultures. J Clin Nurse, 10, 550–562.
Bland, J. M., & Altman, D. G. (1986). Statistical methods for assessing agreement between two methods of clinical measurement. Lancet, 1, 307–310.
Koo, T. K., & Li, M. Y. (2016). A guideline of selecting and reporting Intraclass correlation coefficients for reliability research. J Chiropr Med, 15, 155–163.
Riguetto, C. M., Neto, A. M., & Tambascia, M. A. (2019). The relationship between quality of life, cognition, and thyroid status in graves’ disease. Endocrine, 63, 87–93.
Wilson, I. B., & Cleary, P. D. (1995). Linking clinical variables with health-related quality of life. A conceptual model of patient outcomes JAMA, 273, 59–65.
Guyatt, G. H., Naylor, C. D., Juniper, E., et al. (1997). Users’ guides to the medical literature. XII How to use articles about health-related quality of life Evidence-Based Medicine Working Group JAMA, 277, 1232–1237.
Barrio-Barrio, J., Sabater, A. L., Bonet-Farriol, E., et al. (2015). Graves’ Ophthalmopathy: VISA versus EUGOGO classification, assessment, and management. J Ophthalmol, 2015, 249125.
Son, B. J., Lee, S. Y., & Yoon, J. S. (2014). Evaluation of thyroid eye disease: Quality-of-life questionnaire (TED-QOL) in Korean patients. Can J Ophthalmol, 49, 167–173.
Terwee, C. B., Prinsen, C. A. C., Chiarotto, A., et al. (2015). COSMIN methodology for evaluating the content validity of patient-reported outcome measures: A Delphi study. Qual Life Res, 27, 1159–1170.
The authors would like to thank Dr. Jirapas Anantachai as our language adviser.
Availability of data and materials
All data generated or analyzed during this study are included in this published article.
Department of Medical Service, Ministry of Public Health, Thailand. The roles of the funding body involved data collection, analysis and writing of the manuscript.
Ethics approval and consent to participate
All procedures performed in this study involving human participants were in accordance with the ethical standards of the Mettapracharak Hospital Institutional Research Committee, and informed consent was obtained from all individual participants included in the study.
Consent for publication
All authors have no competing interests.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
About this article
Cite this article
Lumyongsatien, M., Keeratidamkerngsakul, B., Pornpanich, K. et al. Development and psychometric properties of the Thai Graves’ ophthalmopathy quality of life (GO-QOL) questionnaire. J Patient Rep Outcomes 4, 1 (2020). https://doi.org/10.1186/s41687-019-0164-8