Investigating the factors associated with meaningful improvement on the SF-36-PFS ... young people, 2023, Crawley et al

SNT Gatchaman

Senior Member (Voting Rights)
Staff member
Investigating the factors associated with meaningful improvement on the SF-36-PFS and exploring the appropriateness of this measure for young people with ME/CFS accessing an NHS specialist service: a prospective cohort study
Daisy Gaunt; Amberly Brigden; Chris Metcalfe; Maria Loades; Esther Crawley

Objectives
Paediatric myalgic encephalomyelitis/chronic fatigue syndrome (ME/CFS) is relatively common and disabling, but little is known about the factors associated with outcome. We aimed to describe the number and characteristics of young people reaching the 10-point minimal clinically important difference (MCID) of SF-36-Physical Function Subscale (SF-36-PFS) and to investigate factors associated with reaching the MCID.

Design
Prospective observational cohort study.

Setting
A specialist UK National Health Service ME/CFS service, Southwest England; recruitment between March 2014 and August 2015.

Participants
193 eligible patients with ME/CFS aged 8–17 years reported baseline data. 124 (65%) and 121 (63%) with outcome data at 6 and 12 months, respectively.

Outcome measures
SF-36-PFS (primary outcome). Chalder Fatigue Questionnaire, school attendance, visual analogue pain scale, Hospital Anxiety and Depression Scale, Spence Young People Anxiety Scale, Clinical Global Impression scale and EQ-5D-Y (secondary).

Results
At 6 months 48/120 (40%) had reached the MCID for SF-36-PFS. This had increased to 63/117 (54%) at 12 months. On the Clinical Global Impressions, 77% and 79% reported feeling either a little better, much better or very much better. Those with worse SF-36-PFS at baseline assessment were more likely to achieve the MCID for SF-36-PFS at 6 months (odds ratio 0.97, 95% confidence interval 0.96 to 0.99, p value 0.003), but there was weaker evidence of effect at 12 months (OR 0.98, 95% CI 0.97 to 1.00, p value 0.038). No other factors at baseline were associated with the odds of reaching the MCID at 6 months. However, at 12 months, there was strong evidence of an effect of pain on MCID (OR 0.97, 95% CI 0.95 to 0.99, p value 0.001) and SF-36-PFS on MCID (OR 0.96, 95% CI 0.94 to 0.98, p value 0.001).

Conclusions
40% and 54% of young people reached the MCID at 6 and 12 months, respectively. No factors at assessment (other than SF-36-PFS at 6 months, and pain and SF-36-PFS at 12 months) are associated with MCID of SF-36-PFS at either 6 or 12 months. Further work is needed to explore the most appropriate outcome measure for capturing clinical meaningful improvement for young people with ME/CFS. No data are available, due to lack of consent from participants to share data.

Link | PDF (BMJ Open)
 
Paediatric myalgic encephalomyelitis or chronic fatigue syndrome (ME/CFS) is defined as generalised fatigue, lasting at least three months, ‘causing disruption of daily life, persisting after routine tests and investigations have failed to identify an obvious underlying “cause”’.1 2

References:
  1. ↵National Institue for Health and Care Excellence (NICE)
    . Chronic fatigue syndrome/myalgic cencaphalomyelitis (or cncephalopathy): diagnosis and management: national collaborating centre for primary care; 2007.
    Google Scholar
  2. ↵Royal College of Paediatrics and Child Health
    . Evidence based guideline for the management of CFS/ME (Chronic Fatigue Syndrome/Myalgic Encephalopathy) in children and young people. London, 2004.
    Google Scholar
This should never have got past peer review.
 
those with worse SF-36-PFS at baseline assessment were more likely to achieve the MCID for SF-36-PFS at 6 months (odds ratio 0.97, 95% confidence interval 0.96 to 0.99, p value 0.003),

I think there is a real issue here in the SF-36 physical function scale. The scale is a fairly random set of questions around physical abilities. Looking at the questions I would say that those in the middle are quite close to each other where as those at the ends are much different. Hence I would guess that some points in the scale are easier to move than others (which perhaps defeats the purpose of a scale?). I did some cluster analysis on SF36 scores and there were clear groupings for very low and high scores but in the middle there was no easy well defined order for scoring. Which suggests that there is a problem here.
 
The study was open to recruitment between March 2014 and August 2015

Although it's arguable to open the introduction of a paper submitted on "12 October 2022" referencing guidelines that are a year out of date.

They do say in the discussion —

Ninety-nine percent of participants (191/193) meet the eligibility as defined by the NICE guidance 2021, as all participants have postexertional malaise and 191 participants have either ‘sleep disturbance including unrefreshing sleep’ or ‘cognitive dysfunction’. All participants had to have debilitating fatigue and PEM to be included in the study. Of our sample, 89% (172/193) had sleep disturbance, including unrefreshing sleep and cognitive dysfunction and, therefore, met the diagnostic criteria for the 2021 NICE guidelines.

I couldn't see NICE 2021 formally referenced, though.
 
Recruitment
The study was open to recruitment between March 2014 and August 2015,

I guess that explains the old NICE guideline used as reference for diagnostic criteria, but this is a misuse even of the 2007 NICE guideline which include other symptoms for diagnosis. I seem to remember Crawley has played this trick before, saying something like NICE criteria used but adapted, which meant removing all other symptoms needed for diagnosis.

I guess what they've done is gone back over old patient files and picked out all the patients between those dates, and collated their scores on the various questionnaires, then decided SF36 would be a good one to write a paper about, maybe for a post grad degee.

Minimally important difference was classed as a change of 10 points or more. Since each small shift on one descriptor is 5 points, this simply means the child has filled in the questionnaire saying they have some difficulty instead of a lot of difficulty on a couple of descriptors of activities.

For those unfamiliar with SF-36 physical functioning, here it is:

The following items are about activities you might do during a typical day. Does your health now limit you in these activities? If so, how much?

Yes, limited a lot - score 0
Yes, limited a little - score 5
No, not limited at all - score 10

1. Vigorous activities, such as running, lifting heavy objects, participating in strenuous sports

2. Moderate activities, such as moving a table, pushing a vacuum cleaner, bowling, or playing golf

3. Lifting or carrying groceries

4. Climbing several flights of stairs

5. Climbing one flight of stairs

6. Bending, kneeling, or stooping

7. Walking more than a mile

8. Walking several blocks

9. Walking one block

10. Bathing or dressing yourself

Add your score to make a total out of 100.
 
Minimally important difference was classed as a change of 10 points or more. Since each small shift on one descriptor is 5 points, this simply means the child has filled in the questionnaire saying they have some difficulty instead of a lot of difficulty on a couple of descriptors of activities.

Basically two answers changed. And these can be very subjective in terms of moving say from "limited a lot" to "limited a little"

The SF36 version @Trish gives is the US version I think the UK one has some translation of walk a block (not meaningful in the UK)
 
I think there is a real issue here in the SF-36 physical function scale. The scale is a fairly random set of questions around physical abilities. Looking at the questions I would say that those in the middle are quite close to each other where as those at the ends are much different. Hence I would guess that some points in the scale are easier to move than others (which perhaps defeats the purpose of a scale?). I did some cluster analysis on SF36 scores and there were clear groupings for very low and high scores but in the middle there was no easy well defined order for scoring. Which suggests that there is a problem here.

The CBT/GET program I followed used this scale and the accompanying questions I think or at least something very similar. They did so at intake, then again when I started the procedure. Then we'd have a workweek of treatment, a week off and that was repeated for 4 or 5 weeks of program. Then a follow-up week about a half year later. Then they sent the list again a year after that. But even at the start I couldn't remember what I jotted down in the week previous. I think you had to rate things on a severity scale among other things but most of it is completely arbitrary. I'm now reading through the questionnaire again.

PHYSICAL HEALTH PROBLEMS: During the past 4 weeks, have you had any of the following problems with your work or other regular daily activities as a result of your physical health?

Cut down the amount of time you spent on work or other activities. Yes/No

Straight from the outset this first question on the physical health part poses enormous problems. Back when I started the program I was doing some sort of paper route, it was about the only work I could find and wanted to be a good little worker. When I wasn't feeling well, the time I'd spent on the route would actually increase. So a route that would normally take about 2 hours would now take 4, biking very slowly. So did I cut down the amount of time, no. Is that a sign of getting better, the opposite.

On top of that, we all have to cope with brainfog. I couldn't tell you right now how well the previous week went and how the current week stacks up against that. Impossible for me to do right now. It would be way handier if there were devices that could measure activity for us or something. Technology might not quite be there yet though.
 
Aside from being a superficial and mediocre study, published 8 years after it ended, this would then dispute all their claims about behavioral stuff, or whatever. But of course their conclusion is always the same: give us more money so we can waste it doing the same useless nonsense we've been doing for decades. Always money for them, never delivering anything for patients other than wasted lives and amplified suffering.

There is clearly a natural process of recovery, when it happens, we are seeing this plainly in Long Covid, which also disputes literally everything that has ever been claimed about recovery and treatment, since all meagre improvements they can boast about are far better explained by this being a natural process where time is the most important factor. Aside from using an invalid definition anyway, which makes this inapplicable to ME/CFS, merely to a generic idea of "fatigue".

It's also a fact that the recovery process is not final. I see so many long haulers who recovered, resumed their normal lives, and relapsed. Sometimes more than onces, and often worse and for longer. The same happened to me, although my initial recovery was not really complete, I was never back to my former healthy self. Single points in time are useless here, this is a long-term condition and it needs longitudinal studies with far more subjects.

There are no factors that are known yet, and of course doing yet another superficial study that looks at a few generic things won't accomplish that. It's been done plenty before, including by Crawley and her close colleagues. There is a biological process happening and knowledge of it has been completely impeded by mediocre research from lazy ideologues doing zero-effort copy-paste jobs like this.

But they keep getting funded anyway. They keep getting approved to do the exact same nonsense over and over again. Somehow. A giant con where we are the mark and the public pays for this blatant corruption.
 
Further evidence clinics don't help
Taking a step back, it looks to me as if these findings show that the clinic made little meaningful difference to the people they were treating. In fact, the results look similar to those for the Pace trial control group (standardised medical care) at 12 months.

At 63% follow up is better than previous ME clinic observational studies, but still produces suspect data because it excludes dropouts, many of whom are likely to have deteriorated (given the nature of the treatments).

Let’s park that for the moment.

53% of those followed (with 12 months data) improved by 10 points on a 0-100 point scale after 12 months. Critically, there is no control group. Typically, you would expect to see a control group improve modestly over the same period.

UPDATE: As a comparison, the Pace trial control group (“standardised medical care“) SF36 PF score improved by 11.6 points over 12 months (baseline 39.2, 12 mo 50.8), which is similar to the 53% improving by 10 points M (presumably 10 points or more) found in the current study.

So the data doesn’t exactly inspire confidence in the methods or efficacy of the clinic.

[Deleted an erroneous point about odd ratios for SF 36 as a predictor for outcomes.]

Surely, the clinicians would worry that they were achieving so little for their patients? I haven’t read beyond the abstract, maybe the discussion section has the handwringing.


.
 
Last edited:
From memory this teams' definition of PEM is not wonderfully accurate.

There is never any acknowledgement of key issues in paediatrics.

the huge power imbalance which can manifest in therapist pleasing or telling people what you think they want to hear

having school attendance as a measure - there is huge pressure to attend ( FII is commonly kicked off by lack of attendance ) and being in school means nothing if you can't function in school or afterwards .

Brain fog - makes comparisons dodgy if you can't really remember what you did / how you felt yesterday let alone weeks ago . Even doing daily charts are not necessarily accurate.

Anxiety and mental health scales being used - given OI issues are not understood / fully acknowledged in paediatrics, common symptoms of known comorbiditues are are attributed to anxiety and MH issues . There needs to be a meaningful ME appropriate scale.

Where are the objective measures .
 
Interesting statement - basically saying you can't check our work,
The fact that the hallowed peer-review journals still allow this, in 2023, is beyond concerning. This is just straight corruption of the scientific process.

Every time I start thinking I might be a little hard on the BPS club and their supporters (including the journals), they prove me wrong.

Every. Effing. Time. :mad:

There is, as they like to say in psych, a pattern of behaviour here, and after several persistent and unrepentant decades of it there is no possible dispute about that.

The real question now is why are they still allowed to get away with it?
 
Mean days of school attendance:
Baseline 2.1 (presumably days/week); 6 months 2.1; 12 months 2.6
Yes, between the dropouts and natural improvers, these results are nothing.

Number receiving home tuition:
Baseline 13%; 6 months 24%; 12 months 24%
Possibly a good outcome in terms of care for the young person, but recovery it is not.
 
Mean days of school attendance:
Baseline 2.1 (presumably days/week); 6 months 2.1; 12 months 2.6
Yes, between the dropouts and natural improvers, these results are nothing.

It can be more complex than that. If the service helps a school plan a bit better such as providing a rest area and suggesting rests between lessons then the school trying to make a more ME environment could help school attendance.

Or if children are told they must attend then this becomes their only energy usage.
 
Back
Top Bottom