- Open Access
Establishment of MOS-SF36 percentile ranks in the general youth French population
BMC Psychology volume 10, Article number: 74 (2022)
The SF-36 is a generic quality of life questionnaire, massively translated and widely used to obtain physical and mental health status. However, validation work in the French language was carried out over a generation ago. The objective of this study was to obtain the norms of the SF-36 in the French young population.
The sample consisted of 958 non-pre-screened French people aged between 18 and 24 years.
The internal consistencies of the scales were high and the metrics associated with the factor structure were satisfactory. In general, women presented significantly higher scores than men.
Our results suggest that the SF-36 remains a reliable tool for studying quality of life in the young French population.
Since the 1990s, health-related quality of life has gradually become a major theme in clinical research . Indeed, although the health status of a population is most often expressed in quantitative terms such as life expectancy, mortality, or morbidity, a growing number of studies are now interested in measuring health status, and in particular its relationship with quality of life. Nowadays, the patient's perceived quality of life is placed at the center of the care process. It can thus reflect the satisfaction and perceived benefits of an intervention, which could not necessarily be measured by other parameters . Thus, the importance of quality of life assessment is such that it has led to the establishment of indicators centered on patient-reported outcomes measures (PROMs) by the French High Authority for Health (HAS) . These indicators are beginning to be used by regulatory and reimbursement authorities, who require them as part of the decision-making process .
Thus, a large number of self-reported questionnaires have been developed to measure these dimensions, notably the very broad Medical Outcomes Questionnaire 149-item from the RAND Health Insurance Experiment .
Several tools have been developed from this original questionnaire, including the derived SF-20 and SF-36 versions, which have shown more precise discriminatory abilities in their validation studies . Thus, the Medical Outcomes Study Short Form (SF-36) has become one of the most widely used self-reported quality of life questionnaires for assessing health status, given its discriminatory properties of well-being at the level of clinical groups , and has thus been used extensively in the monitoring of clinical practice outcomes and medical treatment effects. This questionnaire measures quality of life on the basis of eight dimensions or concepts that are frequently used in health studies. These eight dimensions are estimated from eight subscales that examine general health; mental health (with respect to anxiety and depression components); physical functioning; limitation of work capacity or daily activities due to physical functioning as well as that due to emotional disorders; vitality; pain; and social functioning. The SF36 has already been evaluated numerous times for its differential performance in comparison with other perceived quality of life questionnaires in different clinical settings, including the Euroqol questionnaire ; the Sickness Impact Profile ; and the Hopkins Symptom Checklist 25  with similar qualities. However, it would appear that the SF-36 stands out for its operational qualities in the assessment of general health, as well as its ease and speed of administration [9, 10].
The questionnaire has already been translated several times into French, and norms have been obtained for the French  and Swiss  populations, but they were established almost a generation ago. In addition, no work has been done to our knowledge to establish SF-36 norms expressed as percentile ranks. The purpose of this study was to establish the norms of the SF-36 in the youth French population (15–24 years) as percentile ranks and to reassess its psychometric properties in terms of reliability and validity, in order to provide a baseline in the general young population and to provide a tool that can be used in clinical routine.
Material and methods
The questionnaire was adapted in the formulation of the items from the version proposed by (Richard et al. 2000) . It was then computerized using the Google Form tool. Sampling was carried out randomly by distributing the questionnaire on social networks, without direct contact with the participants and on the basis of anonymous voluntary contributions. Only age, gender, and date of completion were collected, ensuring complete anonymity for participants. An exclusion criterion in terms of age (> 24 years) was applied after data collection during data preprocessing.
Nine hundred and fifty-eight (n = 958) not preselected adults (mean age = 22.1 years; SD = 1.76) from the general French population participated in this study. Participants screening was completed online, and ethical consents were obtained online in agreement with the Declaration of Helsinki. The study was approved by the “Comité de Protection des Personnes Sud-Est VI”. Full measures were available for all subjects. No minors were included in the study.
Questionnaire: the MOS-SF36
The SF-36 is a short 36-item behavioural questionnaire measuring eight quality of life dimensions: general health (GH-5 items), vitality (VT-4 items), bodily pain (BP-2 items), limitation of physical problems (RP-4 items), limitation of emotional problems (RE-3 items), mental health (MH-5 items), and physical functioning (PF-10 items), social functioning (SF-2 items). The SF-36 also includes an item to estimate the change in the subject's health status during the year preceding the assessment (HC).
For each dimension, item responses were re-encoded on a scale ranging from 0 (best) to 100 (worst), following the standard SF-36 scoring algorithm , adapted for a 5-point Likert scale. The algorithm used is available in Table 1 and the full questionnaire used in the study is available in Additional file 1.
For the calculation of the composite scores, we averaged the PF, RP, BP and GH subscales for the physical composite score (PCS) and averaged the VT, SF, RE and MH subscales for the mental composite score (MCS).
Internal consistency and reliability
Internal consistency and reliability of the items were examined by Cronbach’s alpha. Reasonable acceptability criterion was set to .70 ≤ ɑ ≤ .90 with exceeding lower bound meaning a low reliability, and exceeding higher bound meaning too many similar items, decreasing the scale’s true reliability [14, 15].
In order to test our 8-factors model for SF-36 and assess construct validity, we conducted a confirmatory factor analysis. Generalized least squares method was performed in order to test the fit capability of the factor structure. Model fit was assessed using the following fit indices: we used the χ2 test statistic for absolute fit; the comparative fit index (CFI) and Tucker-Lewis Index (TLI) for fit relative to a null model [16,17,18]; the Standardized Root Mean Square Residual  and the Root Mean Square Error of Approximation  for overall fit. Accordingly to Hu and Bentler (1999) , we assumed that our 8-factors model fit well if CFI > .95; TLI > .95; RMSEA < .06 and SRMR < .08. All statistical analyses were coded in R with Lavaan library and interpreted in RStudio v1.0.143.
Descriptive statistics of the study sample are shown in Table 2. Results showed that women reported poorer health compared to men for all variables except for BP.
Internal consistency and reliability
Results concerning internal consistency and reliability are presented in Table 3 and Additional file 3: Table S12. Data showed that the SF-36 questionnaire carries high internal consistency and reliability even when an item is dropped.
The Cronbach alpha was measured at .88 [CI95% = .87–.89] for the full SF-36 questionnaire. When each of the SF-36 items was removed from the analysis in order to assess robustness, Cronbach’s alpha remained high (varying from .87 to .89 with meanɑ = .88, SD = .007; Additional file 3: Table S12). Measures for the subscales ranged from .78 to .85. All measures were above the minimum acceptable rate of .70 and was close to the maximum expected value of .9 (Table 3).
Confirmatory factor analysis
Confirmatory factor analysis suggested that the 8-factor model fit well with the SF-36 questionnaire, except for the CFI and TLI which remains slightly below the pre-defined cut-off [χ2(595) = 2247, p < .001, CFI = .89, TLI = .88, RMSEA = .058, SRMR = .053]. We assumed that, based on these indices, this sample has an acceptable fit to the 8-factor model.
Additional file 3: Table S13 shows the standardized factor loadings for the SF-36. The analysis revealed factor loadings in the range of .5 to .86 for the GH factor, .69 to .85 for the MH factor, .69 to .73 for the VT factor, .77 to .87 for the BP factor, .85 to .87 for the SF factor, .69 to .81 for the RE factor, .62 to .73 for the RP factor, and .41 to .68 for PF.
Justification of the normative approach
A two-ways ANOVA (dimension * gender) on the measured score showed a significant effect of gender (F(1,41202) = 84.75, p < .001), dimension (F(8,41202) = 8942.02, p < .001), and a significant interaction between gender and dimension (F(8,41202) = 18.59, p < .001). Since this significant interaction indicated that the distribution within the dimensions of the SF-36 was directly dependent on the factor of gender, we decided to separate them in the setting of the norms.
Normative data for the SF-36 composite scores expressed in percentiles are presented in Table 4. The full percentiles for the 8 subscales, the 2 composite factors and HC item are available in Additional file 3: Tables S1–S11. Women showed higher scores compared to men for each scale except for BP.
The present study verified the reliability and the internal consistency of the French version of the 36-Item Short Form Survey (SF-36) questionnaire in a young population.
Cronbach’s alpha measures suggested that the SF-36 questionnaire was internally reliable, with measured alphas remaining in the .70 ≤ ɑ ≤ .90 interval recommended by Bland and Altman (1997) and DeVellis (2003).
Further confirmatory factor analysis supported the eight-factor structure of the SF-36 questionnaire, with items 1; 33; 34; 35; 36 grouped in the “General Health” factor, items 23; 27; 29; 31 grouped in the “Vitality” factor, items 21; 22 grouped in the “Body Pain” factor, items 13; 14; 15; 16 grouped in the “Role limitation: Physical” factor, items 17; 18; 19 grouped in the “Role limitation: Emotional” factor, items 24; 25; 26; 28; 30 grouped in the “Mental Health” factor, items 3; 4; 5; 6; 7; 8; 9; 10; 11; 12 grouped in the “Physical Functioning” factor, and items 20; 32 grouped in the “Social Functioning” factor. Analysis suggested that this model is close to the standards defined by Hu and Bentler (1999), with only CFI and TLI which remains slightly below the cut-off.
We then performed an analysis of variance that suggested some gender differences in self-reported responses, with women reporting lower quality of life than men for all domains studied except BP. In a general manner, authors commonly agrees that women report a lower quality of life than men [21,22,23], especially in Western countries where lower quality of life scores were measured in women, in correlation with higher depression and sleep disorder score measures . However, our work is, to our knowledge, the only one to report gender differences between all scales, except for body pain. This observation could be explained by the existing difference between men and women regarding pain perception. Previous studies have indeed shown gender differences regarding the experience of pain . However, it is commonly accepted that women typically report more severe and frequent complaints about pain , including in pain thresholding experiments , suggesting that women should report higher scores. This lack of significant difference in the Body Pain dimensions could thus be explained by the phenomenon of habituation, measured in experimental pain paradigms [28,29,30], which would lead women to score more positively on items measuring perceived pain, despite experiencing greater and more frequent pain events overall. This hypothesis is strengthened by the experimental pain literature, some of whose results suggest a more rapid adaptation and habituation to pain in women in contrast to men [31,32,33], whose effects are objectivable at the neurophysiological level .
Finally, we established normative and percentile data for all eight subscales of the SF-36, as well as for its single-item subscale and in its physical and mental composite scores.
The present work strengthened existing SF36 data regarding its internal consistency in measuring physical and mental health. The study provides norms expressed in percentile ranks for the young French population.
The study has some limitations. First of all, the representativeness of the sample seems limited, as the observations were collected on the basis of volunteers frequenting the social networks. Moreover, given the lack of contact between the participants and the experimenters, it was not possible to control whether some participants completed the questionnaire more than once, nor estimate the real response rate or evaluate the test–retest reliability. Furthermore, we did not conduct an examination in terms of convergent and discriminant validities. Finally, although the description of the questionnaire clearly identified the target population as the general healthy population, it was not possible to control for the presence of individuals with medical conditions in the sample.
The dataset analyzed during the current study is available in Additional file 2 (“sf36_supplementaryFile2.xlsx”).
Murdaugh C. Health-related quality of life as an outcome in organizational research. Med Care. 1997;35:NS41.
Haslam A, Herrera-Perez D, Gill J, Prasad V. Patient experience captured by quality-of-life measurement in oncology clinical trials. JAMA Netw Open. 2020;3:e200363.
Décision n° 2021.0183/DC/SEVOQSS du 1er juillet 2021 du collège de la Haute Autorité de santé portant adoption du rapport « Panorama d’expériences étrangères et principaux enseignements sur les indicateurs de type PROMs et PREMs ». Haute Autorité de Santé. https://www.has-sante.fr/jcms/p_3277507/fr/decision-n-2021-0183/dc/sevoqss-du-1er-juillet-2021-du-college-de-la-haute-autorite-de-sante-portant-adoption-du-rapport-panorama-d-experiences-etrangeres-et-principaux-enseignements-sur-les-indicateurs-de-type-proms-et-prems.
Baldwin M, Spong A, Doward L, Gnanasakthy A. Patient-reported outcomes, patient-reported information. Patient Patient Centered Outcome Res. 2011;4:11–7.
Tarlov AR, et al. The medical outcomes study. An application of methods for monitoring the results of medical care. JAMA. 1989;262:925–30.
McHorney CA, Ware JE, Raczek AE. The MOS 36-Item Short-Form Health Survey (SF-36): II. Psychometric and clinical tests of validity in measuring physical and mental health constructs. Med Care. 1993;31:247–63.
Ware JE, Gandek B. Overview of the SF-36 Health Survey and the International Quality of Life Assessment (IQOLA) Project. J Clin Epidemiol. 1998;51:903–12.
Picavet HSJ, Hoeymans N. Health related quality of life in multiple musculoskeletal diseases: SF-36 and EQ-5D in the DMC3 study. Ann Rheum Dis. 2004;63:723–9.
Ho AK, et al. Health-related quality of life in Huntington’s disease: a comparison of two generic instruments, SF-36 and SIP. Mov Disord. 2004;19:1341–8.
Strand BH, Dalgard OS, Tambs K, Rognerud M. Measuring the mental health status of the Norwegian population: a comparison of the instruments SCL-25, SCL-10, SCL-5 and MHI-5 (SF-36). Nord J Psychiatry. 2003;57:113–8.
Leplège A. Le questionnaire MOS SF-36: Manuel de l’utilisateur et guide d’interprétation des scores. Bruxelles: De Boeck Secundair; 2001.
Richard J-L, et al. Validation et normes du SF-36 dans la population du canton de Vaud. Lausanne: Institut universitaire de médecine sociale et préventive (IUMSP); 2000.
Ware JE. SF-36 Health Survey. In: Maruish ME, editor. The use of psychological testing for treatment planning and outcomes assessment. Lawrence Erlbaum Associates Publishers; 1999. p. 1227–46.
Bland JM, Altman DG. Cronbach’s alpha. BMJ. 1997;314:572.
DeVellis RF. Scale development: theory and applications. Beverly Hills: SAGE; 2003.
Bentler PM. Comparative fit indexes in structural models. Psychol Bull. 1990;107:238–46.
Hu L, Bentler PM. Cutoff criteria for fit indexes in covariance structure analysis: conventional criteria versus new alternatives. Struct Equ Model. 1999;6:1–55.
Tucker LR, Lewis C. A reliability coefficient for maximum likelihood factor analysis. Psychometrika. 1973;38:1–10.
Bentler PM. EQS: structural equations program manual. Undefined. 1989. https://www.semanticscholar.org/paper/EQS-%3A-structural-equations-program-manual-Bentler/3b39d1d27934a461f04e0e076ddb6da5b87193b0.
Steiger JH. Statistically based tests for the number of common factors. 1980.
El Osta N, et al. Validation du SF-36, questionnaire générique de la qualité de vie liée à la santé chez les personnes âgées au Liban. East Mediterr Health J. 2019;25:706–14.
Salem S, Malouche D, Romdhane HB. Tunisian population quality of life: a general analysis using SF-36. Eastern Mediterr Health J. 2019;25:613–21.
Wong CKH, Mulhern B, Cheng GHL, Lam CLK. SF-6D population norms for the Hong Kong Chinese general population. Qual Life Res. 2018;27:2349–59.
Olsen CDH, Möller S, Ahrenfeldt LJ. Sex differences in quality of life and depressive symptoms among middle-aged and elderly Europeans: results from the SHARE survey. Aging Ment Health. 2021;1–8. https://doi.org/10.1080/13607863.2021.2013434.
Koons AL, Rayl Greenberg M, Cannon RD, Beauchamp GA. Women and the experience of pain and opioid use disorder: a literature-based commentary. Clin Therap. 2018;40:190–6.
Dao TT, LeResche L. Gender differences in pain. J Orofac Pain. 2000;14:169–84.
Soetanto AL, Chung JW, Wong TK. Are there gender differences in pain perception? J Neurosci Nurs. 2006;38:172.
Greffrath W, Baumgärtner U, Treede R-D. Peripheral and central components of habituation of heat pain perception and evoked potentials in humans. Pain. 2007;132:301–11.
Rennefeld C, Wiech K, Schoell ED, Lorenz J, Bingel U. Habituation to pain: further support for a central component. Pain®. 2010;148:503–8.
Smith BW, et al. The role of resilience and purpose in life in habituation to heat and cold pain. J Pain. 2009;10:493–500.
Bingel U, Schoell E, Herken W, Büchel C, May A. Habituation to painful stimulation involves the antinociceptive system. Pain. 2007;131:21–30.
Hashmi JA, Davis KD. Women experience greater heat pain adaptation and habituation than men. Pain. 2009;145:350–7.
Hashmi JA, Davis KD. Effects of temperature on heat pain adaptation and habituation in men and women. Pain®. 2010;151:737–43.
Wang G, Erpelding N, Davis KD. Sex differences in connectivity of the subgenual anterior cingulate cortex. Pain®. 2014;155:755–63.
Ethics approval and consent to participate
The study was approved by the Comité de Protection des Personnes Sud-Est VI (protocol number: 20.02.24.42827). Informed Consent was obtained online before the completion of the survey. The research was performed in accordance with the Declaration of Helsinki.
Consent for publication
The authors declare that they have no competing interests.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
About this article
Cite this article
Trognon, A., Tinti, E., Beaupain, B. et al. Establishment of MOS-SF36 percentile ranks in the general youth French population. BMC Psychol 10, 74 (2022). https://doi.org/10.1186/s40359-022-00786-9
- French population
- Mental health scale
- Physical health scale
- Quality of Life