"Query re use of post-hoc unpublished outcome data: Scoring system for the Chalder fatigue scale, Wearden 2010.
I would like to highlight what appears to be a discrepancy within the Cochrane review [1] with respect to the analysis of data from Wearden 2010 [2,3].
Throughout the Cochrane review (please see details below), the impression is given that only protocol-defined and published data or outcomes were used for the Cochrane analysis of the Wearden 2010 study.
However, this does not appear to be the case and, to the best of my knowledge, instead of using protocol-defined or published data, the Cochrane analyses of fatigue for the Wearden 2010 study, appears to have used an alternative unpublished set of data.
The relevant analyses of fatigue in the Cochrane review are: Analyses: 1.1, 1.2, 2.1 and 2.3. Each of these analyses states that the “0,1,2,3” scoring system was used for the Chalder fatigue questionnaire. This scoring system is known as the Likert scoring system and uses a fatigue scale of 0-33 points.
However, to the best of my knowledge, data or analyses using this scoring system were not proposed in the Wearden 2010 trial protocol [3], and were not included in Wearden 2010 [2], and have not previously been formally (i.e. via peer review) published by Wearden et al. A post-hoc informal analysis using this data has been informally released by Wearden et al. as a BMJ Rapid Response comment [4].
In the Cochrane review, the analyses using the 0,1,2,3 scoring system contradict text within the section “Characteristics Of Studies”, in relation to Wearden 2010: Under “Outcomes”, it is stated that Chalder fatigue was measured using the 0,0,1,1 scoring system using a scale from 0-11 points: “Fatigue (Fatigue Scale, FS; 11 items; each item was scored dichotomously on a 4-point scale (0, 0, 1 or 1)”.
Wearden 2010 pre-specified Chalder fatigue questionnaire scores as a primary outcome at 70 weeks, and as a secondary outcome immediately after treatment at 20 weeks. The scoring, in both cases, used the 0,0,1,1 system, with a scale of 0-11. This scoring system was described both in the trial protocol [3] and the main results paper published in 2010 [2].
The Likert (0,1,2,3) scoring system was neither proposed in the trial protocol, nor formally published, and so the Likert scores should be considered post-hoc. Even if it is argued that the Chalder fatigue questionnaire (irrespective of the scoring system) was pre-defined as a primary outcome measure, data using the Likert scoring system was neither proposed nor published and so the data itself must surely be considered to be post-hoc. The outcome analyses using the Likert data must be considered post-hoc.
Simply changing a scoring system may, at first glance, appear not to be a significant or major adjustment, however, we do not know what difference it made because a sensitivity analysis has not been published.
I cannot find any explanation within the Cochrane review that explains why the Cochrane review has replaced pre-defined published data with an unpublished and post-hoc set of data.
Is it normal practice for a Cochrane meta-analysis to selectively ignore the pre-defined primary outcome data for a trial, and to selectively include and analyse post-hoc data? I wonder if some clarity could be shed on this situation?
I suggest that the post-hoc data are replaced with the original published data. Otherwise, the post-hoc data should be clearly labelled as such and the risk of bias analysis amended accordingly; and an explanation should be included in the review explaining why an apparently adequate pre-defined set of data has been replaced with an apparent novel set of post-hoc data.
Also, I suggest that any discrepancies that I will outline below, should be corrected where necessary; Either the analyses (1.1, 1.2, 2.1 and 2.3) should be amended or the description of the data should be amended so it is not incorrectly labelled as protocol-defined and published data with a “low risk” of bias.
Discrepancies within the text of the Cochrane Analysis.
Please note that all page numbers used below are pertinent to the current version (version 4) of the Cochrane review in PDF format.
1. On page 28 of the Cochrane review [1], in section “Potential biases in the review process”, under the heading “Potential bias in the review process”, in relation to the review in general, it is stated that: "For this updated review, we have not collected unpublished data for our outcomes..." However, as explained above, this is not the case for the Wearden 2010 fatigue data for which unpublished data has been used in the Cochrane analysis.
2. On page 45 of the review, in section “Characteristics Of Studies”, specifically in relation to Wearden 2010 [2,3], it is stated that only protocol-defined outcomes were used: "all relevant outcomes are reported in accordance with the protocol". "Selective reporting (reporting bias)" is rated as "low risk". However, as explained above, this is not the case, because the Wearden 2010 fatigue data (used in the Cochrane analysis) was not proposed in the protocol. If the data is post-hoc, then the “low-risk” category will need to be revised.
3. On page 44 of the review, in section “Characteristics Of Studies”, in relation to Wearden 2010 [2,3], under “Outcomes”, it is stated that Chalder fatigue was measured using the 0,0,1,1 scoring system using a scale from 0-11 points: “Fatigue (Fatigue Scale, FS; 11 items; each item was scored dichotomously on a 4-point scale (0, 0, 1 or 1)”. Wearden 2010 did indeed use the 0,0,1,1 scoring system for the Chalder fatigue scale: This scoring system was proposed in the trial protocol and published with the main outcome data in Wearden 2010. However, as explained above, this scoring system has not been used in the Cochrane analysis.
4. If figures 2 and 3 also contain discrepancies, after any amendments to the review, then they should be amended accordingly.
There may be other related discrepancies and inaccuracies in the text that I haven’t noticed.
I thank the Cochrane team in advance for giving this submission careful consideration, and for making amendments to the analysis, and providing explanations, where appropriate. I hope you will agree that clarity, transparency and accuracy in relation to the analysis is paramount.
References:
1. Larun L, Brurberg KG, Odgaard-Jensen J, Price JR. Exercise therapy for chronic fatigue syndrome. Cochrane Database Syst Rev. 2016; CD003200.
2. Wearden AJ, Dowrick C, Chew-Graham C, et al. Nurse led, home based self help treatment for patients in primary care with chronic fatigue syndrome: randomised controlled trial. BMJ. 2010; 340:c1777.
3. Wearden AJ, Riste L, Dowrick C, et al. Fatigue Intervention by Nurses Evaluation – The FINE Trial. A randomised controlled trial of nurse led self-help treatment for patients in primary care with chronic fatigue syndrome: study protocol. BMC Med. 2006; 4:9.
4. Wearden AJ, Dowrick C, Chew-Graham C, et al. Fatigue scale. BMJ Rapid Response. 2010.
http://www.bmj.com/rapid-response/2011/11/02/fatigue-scale-0 (accessed April 16, 2016).