Barry
Senior Member (Voting Rights)
I think it would be crucial that existing diagnoses not be blindly accepted, but the researchers apply their own diagnostic assesments.I would worry that they ended up recruiting from fatigue clinics.
I think it would be crucial that existing diagnoses not be blindly accepted, but the researchers apply their own diagnostic assesments.I would worry that they ended up recruiting from fatigue clinics.
This is the key question. Not all the details have been finalised yet. Certainly it will include self-report of a ME/CFS diagnosis (I'm not sure this will be bright simply a doctor or by a consultant). There will also be some questionnaire screening (note that the UK ME/CFS biobank is involved).
There's also the possibility of a validation step: taking a subsample of those people who pass the questionnaire screen and giving them a thorough medical assessment/diagnosis.
Yes, with larger sample sizes you have more leeway (because non--cases should be easier to identify in a cluster) but I think it's important there is some quality control of who exactly is passing the standard screen.
Yes, handling people who don't/won't meet the selection criteria will be tricky. But it may be useful to have a group within the GWAS of people who have chronic fatigue the don't meet the criteria.The difficulty, given history, may be targeting the patient group without alienating ( or being seen to denigrate) those who don't meet strict selective criteria.
And hopefully all the different patient groups and charities will come together to back this study: it will need the support of almost everybody in the online community. We will also need to reach beyond into the much bigger pool of patients who are not part of the online community, which will mean using traditional media as well as finding ways to reach people online beyond our bubble.Could this be the project to drive wholesale cooperation?
I think it would be essential to recruit beyond the UK in order to reach the 20,000 target (even 10,000, I suspect).Would multiple countries introduce too much variability?
I don’t know why patients were not told more of what was happening this time. I expect a similar website will be put up for the new project and a Q& A would get basic facts out.maybe they are waiting for funding assurance this time.
I have no idea of the answer to those statistical questions, but I do know they will be recruiting experts in GWAS analysis to be part of the team if it gets funded.I suspect that part of the issues will come with the algorithms used to look at the data.
In the context of a lot of data analytics techniques using an L2 norm (least squares) are quite sensitive to mislabelled data (i.e. labelled ME but not). But other techniques (L1 - median and MAD) are more to robust. But I'm not sure if this is relevant here.
Looking at the Manhattan graph which I think uses a p test to derive the importance of the features. I guess if the sample is from multiple different diseases then there may be lower thresholds that apply and also more things showing up above the threshold. I'm assuming that this produces much smaller set of features that can then be clustered (or visualized with things like PCA or T-SNE) and compared with information from the patients around diagnostic info to see if there are subsets etc. The question here is around further analysis and whether this can be sufficient to pick out interesting data?
GWAS need very large samples if they are to discover the very small effects that influence the likes of height and Type II diabetes. Even a study with 2,000 patients is now considered small and so studies typically use at least 10,000 patients to generate robust results. Studies of diseases include similar numbers of healthy patients to aid comparison.
I think this is an issue that this study will possibly answer, and that doing it in the way proposed is the only sensible way to go at the moment. If/when we get 10k samples analysed in this way, the results surely would go a long way to either proving what we suspect, that what is currently called ME/CFS is a collection of illnesses or a collection of sub-groups, or showing that, despite the disparity of symptoms that many of us have, the majority seem to have the same illness/there aren't many sub-groups.Many thanks for another excellent article @Simon M.
I’m not well enough to read all the replies at the moment so apologies is this point has already been made. My only concern about the proposed study (which is the same point I made in response to MEGA) is about the number of samples that would be needed to identify abnormalities if it transpired that ME/CFS is not one illness but a number of different illnesses with different causal pathways.
Simon writes:
If 10,000 patients would be required to generate robust results for very precisely defined conditions or characteristics such as diabetes or height, then would it not be necessary to use a far great sample size if it transpired that ME/CFS included more than one illness? Say, for example, that 10% of people with ME have a different illness with a different cause and mechanism, could this be detected in a 10,000 ME/CFS sample size or would it be necessary to have 10,000 samples of this subgroup and therefore a total ME/CFS sample size of 100,000 in order to identify the abnormal SNPs in this subgroup?
I don’t mean this to be a criticism of the proposal – I am very encouraged by it – I am just interested to understand, and to ensure that the possibility of there being subgroups with different illnesses has not been overlooked.
Presumably another possibility would be that ME/CFS is one illness with a common causal pathway but that there are no significant predisposing genetic factors – ie some type of as yet unidentified pathogen. (Or that this may be the case for a subgroup.) Whatever the truth, it would seem that the proposed GWAS can only help to further our understanding – provided it is overseen by the right people, which seems probable with Chris Ponting behind it.If/when we get 10k samples analysed in this way, the results surely would go a long way to either proving what we suspect, that what is currently called ME/CFS is a collection of illnesses or a collection of sub-groups, or showing that, despite the disparity of symptoms that many of us have, the majority seem to have the same illness/there aren't many sub-groups.
If 10,000 patients would be required to generate robust results for very precisely defined conditions or characteristics such as diabetes or height, then would it not be necessary to use a far great sample size if it transpired that ME/CFS included more than one illness? Say, for example, that 10% of people with ME have a different illness with a different cause and mechanism, could this be detected in a 10,000 ME/CFS sample size or would it be necessary to have 10,000 samples of this subgroup and therefore a total ME/CFS sample size of 100,000 in order to identify the abnormal SNPs in this subgroup?
There are not many "very precisely defined" diseases. Type II diabetes, for instance, is not a single entity but several conditions, and that is actually the norm.If 10,000 patients would be required to generate robust results for very precisely defined conditions or characteristics such as diabetes or height, then would it not be necessary to use a far great sample size if it transpired that ME/CFS included more than one illness? Say, for example, that 10% of people with ME have a different illness with a different cause and mechanism, could this be detected in a 10,000 ME/CFS sample size or would it be necessary to have 10,000 samples of this subgroup and therefore a total ME/CFS sample size of 100,000 in order to identify the abnormal SNPs in this subgroup?
That's a good point. As I said earlier, no one knows the correct size until the first GWAS has been done and that is what we are here. Bigger studies will find more, but the MRC (and NIHR who might co-fund this) won't be keen to fund to too a big study for an initial result.So we will have either fairly clear results, indicating that sample size was enough and there isn't actually much variation in us all, or the results are muddled, which then indicates the need for a follow-on study with a much larger sample size. As we've seen recently with Karl Morten's application for funds, the MRC aren't keen on what they consider to be overly ambitious projects, so I think that trying for a sample size which is generally reckoned to be normally sufficient is the best way to go.
I think that is the idea. Or rather than subgroups, it might deliver signs of very different causes: perhaps some to do with the immune system and others to do with the nervous system. Or maybe even something else. And it will be possible to see if the results map to different case definitions.A variation of the question above: if enough patients take part, would it be possible to recognize several subgroups from the data?
This is worth exploring. Let's asssume that the single cause is a specific pathogen. Even here, you would expect to find some genetic signals e.g. relating to antiviral parts of the immune system if it was a specific virus. In fact, such a finding would point towards a pathogen as a/the cause.Presumably another possibility would be that ME/CFS is one illness with a common causal pathway but that there are no significant predisposing genetic factors – ie some type of as yet unidentified pathogen. (Or that this may be the case for a subgroup.)
This is very interesting and not something I had appreciated or understand. If GWAS had been available before the discovery of, say, HIV, how would GWAS have helped to identify it? My understanding is that, with a tiny number of exceptions, without treatment, anyone who is infected with HIV will eventually develop AIDS, regardless of their genes. Presumably there may be SNPs which might be helpful in identifying genes which affect behavioural characteristics which might increase the chances of being infected with HIV but I don’t understand how they could have helped to identify HIV.This is worth exploring. Let's asssume that the single cause is a specific pathogen. Even here, you would expect to find some genetic signals e.g. relating to antiviral parts of the immune system if it was a specific virus. In fact, such a finding would point towards a pathogen as a/the cause.
That is how a GWAS works: if there is an environmental factor like an infection it is likely to throw up genetic differences that affects the ability of the body to respond to that infection.
I would agree with that because that's what's needed to make progress and to develop treatments to help people with ME. I would also argue that is exactly whywe need a GWAS, because it is well designed to help identify the causes of the illness. @Jonathan Edwards put it very well here (taken from my blog):What we desperately need is basic research that ME has been deprived of for decades
I would say over the last decade the standout findings are rituximab, T cell clonal expansion and the Nanoneedle salt stress test.If you go fishing, you go to the spot where you're most likely to catch. IMO, all the people spending time on this could be putting their efforts into more fruitful areas of research.
Mark Davis is continuing to work on T cell clonal expansion
Here is an example to describe what @Simon M is saying. Some folks believe mutations in the gene MBL2 play a role in making folks susceptible to ME. However this is quite a common mutation. Without a good quality study with enough participants to rule out statistical effects we do not know if this gene mutation really is more prevalent in the ME population.That is how a GWAS works: if there is an environmental factor like an infection it is likely to throw up genetic differences that affects the ability of the body to respond to that infection.
https://ghr.nlm.nih.gov/gene/MBL2#conditionsThe change of a single amino acid in the mannose-binding lectin subunit eliminates its ability to assemble into the functional mannose-binding lectin. Similarly, certain mutations in the promoter region of the MBL2 gene reduce production of the mannose-binding lectin subunit, leading to a decreased number of subunits available for protein assembly and a reduction in the amount of functional protein. With decreased levels of mannose-binding lectin, the body does not recognize and fight foreign invaders efficiently. Consequently, infections can be more common in people with this condition.
[missed this para out originally]
"Most genetic variants identified from GWAS studies have teeny-tiny influences on the disease, and their function is unknown."
As you say, the individual genetic variants have small effect on disease, but as any example or other pathogen I gave above (quoted here) those small effects can identify important causes