Uncovering the genetic architecture of ME/CFS: a precision approach reveals impact of rare monogenic variation, 2025, Birch, Younger et al

As I think someone else may have said earlier, if there is another feasible explanation for all a person’s symptoms then the ME/CFS diagnosis is not appropriate to give. In which case their appearance in your cohort is a case of actual misdiagnosis, not evidence that ME/CFS is a convergent disease presentation from lots of different genetic abnormalities.

A very common mistake, in my view:

“If ME/CFS cannot include multiple mechanisms, then cases with other (underreported, as of yet unidentified) mechanisms are misdiagnosed.
Since those cases are misdiagnosed, ME/CFS cannot include multiple mechanisms."

That's circular reasoning, I believe.

If ME/CFS is heterogeneous, which seems plausible even if there is downstream convergence at some point, then progressively identifying subgroups will naturally narrow the residual ME/CFS category.

That process alone does not demonstrate that ME/CFS is not heterogeneous; it mainly reflects our current uncertainty (and in this case bias) about boundaries and mechanisms. We simply don't know enough.

I think Liz’s work and its general direction make a lot of sense and are clearly worth pursuing further.
 
The impact of a mitochondrial trifunctional protein variant isn’t unknown or guessed at though; it’s very well studied and classically presents with exercise intolerance and early fatigability. That’s not the same as PEM, but most reported patients are pediatric; so I have asked myself whether a baby or child would ever get a PEM diagnosis; I don’t know. Would love experts thoughts here.
Yes, that’s possible if the physician is familiar with ME/CFS. Kristian Sommerfelt (@Bivox ) works a lot with children, you can find talks he has done on youtube.
 
A very common mistake, in my view:

“If ME/CFS cannot include multiple mechanisms, then cases with other (underreported, as of yet unidentified) mechanisms are misdiagnosed.
Since those cases are misdiagnosed, ME/CFS cannot include multiple mechanisms."

That's circular reasoning, I believe.

If ME/CFS is heterogeneous, which seems plausible even if there is downstream convergence at some point, then progressively identifying subgroups will naturally narrow the residual ME/CFS category.

That process alone does not demonstrate that ME/CFS is not heterogeneous; it mainly reflects our current uncertainty (and in this case bias) about boundaries and mechanisms. We simply don't know enough.

I think Liz’s work and its general direction make a lot of sense and are clearly worth pursuing further.
It’s something that is necessary in research by virtue of not knowing the mechanism of ME/CFS and not having any objective measure to test for. It is entirely possible that someone could end up with the entity known as ME/CFS by multiple mechanisms. That being said, if someone has a genetic disorder that is consistent with all of their symptoms that one might’ve attributed to ME/CFS (and, frankly, several symptoms that are quite inconsistent with ME/CFS) then it is prudent to exclude them from your case label because there is a much larger possibility that their biology is actually different from everyone else and they might not exhibit the phenomenon you’re studying even though it is a real phenomenon.

There is just no way at an early stage in disease research to both avoid excluding anyone who might have your disease of interest and have decent chances of actually getting a positive finding—that’s why researchers try to get as homogenous a cohort as possible, design experiments so that the effect size might be big enough to overcome some outliers, and then once that is established see if the mechanism is consistent in more heterogenous cases.
 
That process alone does not demonstrate that ME/CFS is not heterogeneous; it mainly reflects our current uncertainty (and in this case bias) about boundaries and mechanisms. We simply don't know enough.

Yes, I think your basic point is right here @butter. An ME/CFS diagnosis precludes there being another explanation for symptoms. But if three different gene variants are found in people diagnosed with ME/CFS on the basis of current knowledge and it then becomes convincing that each of the three gene variants leads to the same symptoms through different routes then these are valid 'causes of ME/CFS' (or at least causal components) until a decision is made to subcategorise.

More or less this has happened in autoimmune disease with the discovery a much wider range of autoantibodies associated with myositis and scleroderma-like syndromes. In many cases once the antibody is known clinicians realise that they can separate patients clinically but it i s not always true. Scleroderma remains the common name for three separate diseases caused by three separate antibodies.
 
Yes, that’s possible if the physician is familiar with ME/CFS. Kristian Sommerfelt (@Bivox ) works a lot with children, you can find talks he has done on youtube.
Ahh great. The key notion being "is familiar with". Will watch his youtube videos and try to work out how to get a good answer to this from physicians less familiar.. I wonder if we could look in the EHRs and see what the variability is across providers for this type of things. It might help broadly re who does and does not get a PEM entry in their records... Thanks!
 
Yes, I think your basic point is right here @butter. An ME/CFS diagnosis precludes there being another explanation for symptoms. But if three different gene variants are found in people diagnosed with ME/CFS on the basis of current knowledge and it then becomes convincing that each of the three gene variants leads to the same symptoms through different routes then these are valid 'causes of ME/CFS' (or at least causal components) until a decision is made to subcategorise.

More or less this has happened in autoimmune disease with the discovery a much wider range of autoantibodies associated with myositis and scleroderma-like syndromes. In many cases once the antibody is known clinicians realise that they can separate patients clinically but it i s not always true. Scleroderma remains the common name for three separate diseases caused by three separate antibodies.
Really like this way of thinking about this @Jonathan Edwards. A back of the envelop for the p.Glu510Gln HADHA variant - so far we have it in two unrelated participants meeting all the criteria for ME/CFS out of 40 unrelated individuals or so. Out of 1,612,330 unique samples in GnomAD this variant is seen 2,162 times as a heterozygote (just one copy of the gene affected in each person) with no homozygotes (would be both copies affected). The frequency is fairly consistent across all the ancestry groups they delineate, but higher in Finnish Europeans, so ancestry will be important to consider so as not to make the mistake of thinking there is significance if everyone with it happens to be northern European :). With 100 people in the cohort, you’d need ≥4 people with the variant for it to start becoming a bit interesting (background carrier probability calculation).
 
A very common mistake, in my view:

“If ME/CFS cannot include multiple mechanisms, then cases with other (underreported, as of yet unidentified) mechanisms are misdiagnosed.
Since those cases are misdiagnosed, ME/CFS cannot include multiple mechanisms."

That's circular reasoning, I believe.

If ME/CFS is heterogeneous, which seems plausible even if there is downstream convergence at some point, then progressively identifying subgroups will naturally narrow the residual ME/CFS category.

That process alone does not demonstrate that ME/CFS is not heterogeneous; it mainly reflects our current uncertainty (and in this case bias) about boundaries and mechanisms. We simply don't know enough.

I think Liz’s work and its general direction make a lot of sense and are clearly worth pursuing further.
Love the way you phrased it. Going to steal that verbiage if ok with you? We do think of it as trying to narrow down leaving a residual category step by step.
 
This was carefully thought about: when both copies are altered you can see a pediatric-lethal disorder that doesn’t resemble ME/CFS (lethargy, hypoglycemia, hypotonia, liver and heart problems, coma, sudden death); when both are altered but with different variants, you can see later-onset, often milder disease with fewer catastrophic features but with exercise intolerance/early fatigability, chronic fatigue, weakness, peripheral neuropathy, and late-onset cardiomyopathy.

When only one copy is altered, physiologic changes; and in some case reports, symptoms, have been seen; these may well reflect an unrecognized “second hit,” but heterozygous mouse models also show liver long-chain fatty-acid abnormalities, oxidative-stress differences versus wild type, and other downstream effects.

Triggers for acute episodes are well known to include illness, infection, strenuous exercise, prolonged fasting, and stress.

And there are precedents in these same pathways: disorders with early-onset recessive forms and later-onset, milder dominant forms that were only recognized much later because everyone was looking for the catastrophic childhood phenotype. It's seen in other mitochondrial diseases too; LHON-Plus.
So my main point is still a concern here: that n-of-1 approaches will always end up quite biased towards overestimating the relevance of specific variants for symptoms that might be better explained otherwise. It bears asking what "well-supported" late-onset symptoms of a heterozygous variant actually means. As you say, for genotypes that don't present severely and with early onset within families, what we have as evidence of pathogenicitiy is most often a handful of case studies--which specifically become case studies because there is something of note about them. So in each of those instances (correct me if I'm wrong) we don't have information on what proportion of individuals have that potentially pathogenic heterozygous variant and are completely fine (but presumably it is a higher proportion given that the case studies are explicitly presented as unusual). Heterozygous mouse models, similarly, largely come from uniform genetic backgrounds that might cause those symptoms to appear when they almost never would in a human.

Which then begs the question of how we can be even reasonably confident that the variant is actually relevant to those instances of late onset milder symptoms. If we had a large cross sectional study tracking people with that variant and saw that a similar phenotype tends to come up quite often, then I would be more confident (and please let me know if this is where your information is actually coming and was just not cited for many of the variants in that table). But when the literature is only a handful of cases, as you say, these may well reflect the effect of another factor--that could be an unknown"second hit" working synergistically with the original variant, or it could be that we're just highlighting two or three people who happened to develop a chronic illness explaining the symptoms (and the only reason it got documented is because someone was already interested in the variant they have). In which case, this might just be a repeated practice of comparing individuals who happened to have a variant and unrelatedly developed something like ME/CFS in the literature (which is classified as "well-supported" evidence of pathogenicity) to someone who happened to have a variant and unrelatedly developed ME/CFS in your cohort.

It's entirely possible that the variant of interest actually predisposes to developing a chronic disease like ME/CFS, and that would definitely be interesting information. But in order to determine that you need a big cohort study--going about that at an n-of-1 level is always just going to be an unsubstantiated guess about pathology. The variants that are relevant for a given person might not be the ones that actually get pulled out by the n-of-1 analysis. So trying to use an n-of-1 strategy to narrow down subgroups would not actually be giving you the relevant subgroups.

If any part of this logic is wildly off base or missing critical context please let me know.
 
Just getting back to this, but I don’t have access to the article.

How were the rare variants discovered?
Honestly, my statement was just based on the abstracts (there's two links btw). It was just to make the point that researchers can and do talk about mixtures of different rare and common variants as being causal for the same disease.

Edit: So in answer to your question, I'm not sure of the specifics.
 
Honestly, my statement was just based on the abstracts (there's two links btw). It was just to make the point that researchers can and do talk about mixtures of different rare and common variants as being causal for the same disease.
I see your point here, but I think the key difference re: this paper is the difference between pointing to a rare variant as a potential causal factor predisposing to falling into a chronic disease feedback loop and pointing to a rare variant to claim that the person actually has a rare genetic disorder (and that there might be no unifying disease process known as ME/CFS). If someone is claiming the latter, that's when the question of misdiagnosis does become valid
 
Honestly, my statement was just based on the abstracts (there's two links btw). It was just to make the point that researchers can and do talk about mixtures of different rare and common variants as being causal for the same disease.
Oh, I clicked on the first every time so I didn’t see the second.

The AD one doesn’t say, but the liver cirrhosis is based on massive cohorts so you actually know that the rare variants are over-represented in the cases.

If you have that validation, I completely agree that the genes can be very relevant for figuring how the disease works.

Jnmaciuch put the rest better than me.
 
If any part of this logic is wildly off base or missing critical context please let me know.

I am not sure that anyone is disputing the logic here.
My thought is that if a rare gene variant has very strong evidence for pathogenicity - as in a multicase family with a Mendelian disease expression - or points very strongly to a very specific testable model, then it may take us a step forward on its own. The difficulty with ME/CFS of course is that we do not have pathology to slot a model in to, so we need to find evidence of a pathological pathway too. Otherwise rare genes of the sort picked out here are going to be candidates for reasonably large statistical replication studies as you say, with the plus of a smaller correction for multiple analyses.
 
I see your point here, but I think the key difference re: this paper is the difference between pointing to a rare variant as a potential causal factor predisposing to falling into a chronic disease feedback loop and pointing to a rare variant to claim that the person actually has a rare genetic disorder (and that there might be no unifying disease process known as ME/CFS). If someone is claiming the latter, that's when the question of misdiagnosis does become valid
Oh yeah, none of my posts were specifically about the method in the present paper. Just pushing back specifically on saying someone was probably misdiagnosed if we did find a robust rare variant connection.

Edit: Which I am now thinking might not have been Utsikt's point anyway.
 
Last edited:
My thought is that if a rare gene variant has very strong evidence for pathogenicity - as in a multicase family with a Mendelian disease expression - or points very strongly to a very specific testable model, then it may take us a step forward on its own
So I guess my concern is that I don't see that level of strong evidence for most of the variants pulled out in this study. The evidence is those handful of case studies but the conclusion is that the relevance of the results is akin to something with strong evidence

[edit: and even in this example it would be a matter of using the variant to get information on the unifying disease process of ME/CFS, not relying on variants to narrow down residual heterogeneity as described in the absence of a known mechanism]
 
Last edited:
So my main point is still a concern here: that n-of-1 approaches will always end up quite biased towards overestimating the relevance of specific variants for symptoms that might be better explained otherwise. It bears asking what "well-supported" late-onset symptoms of a heterozygous variant actually means. As you say, for genotypes that don't present severely and with early onset within families, what we have as evidence of pathogenicitiy is most often a handful of case studies--which specifically become case studies because there is something of note about them. So in each of those instances (correct me if I'm wrong) we don't have information on what proportion of individuals have that potentially pathogenic heterozygous variant and are completely fine (but presumably it is a higher proportion given that the case studies are explicitly presented as unusual). Heterozygous mouse models, similarly, largely come from uniform genetic backgrounds that might cause those symptoms to appear when they almost never would in a human.

Which then begs the question of how we can be even reasonably confident that the variant is actually relevant to those instances of late onset milder symptoms. If we had a large cross sectional study tracking people with that variant and saw that a similar phenotype tends to come up quite often, then I would be more confident (and please let me know if this is where your information is actually coming and was just not cited for many of the variants in that table). But when the literature is only a handful of cases, as you say, these may well reflect the effect of another factor--that could be an unknown"second hit" working synergistically with the original variant, or it could be that we're just highlighting two or three people who happened to develop a chronic illness explaining the symptoms (and the only reason it got documented is because someone was already interested in the variant they have). In which case, this might just be a repeated practice of comparing individuals who happened to have a variant and unrelatedly developed something like ME/CFS in the literature (which is classified as "well-supported" evidence of pathogenicity) to someone who happened to have a variant and unrelatedly developed ME/CFS in your cohort.

It's entirely possible that the variant of interest actually predisposes to developing a chronic disease like ME/CFS, and that would definitely be interesting information. But in order to determine that you need a big cohort study--going about that at an n-of-1 level is always just going to be an unsubstantiated guess about pathology. The variants that are relevant for a given person might not be the ones that actually get pulled out by the n-of-1 analysis. So trying to use an n-of-1 strategy to narrow down subgroups would not actually be giving you the relevant subgroups.

If any part of this logic is wildly off base or missing critical context please let me know.
I'm not sure we will shift each other much off our current thinking in a slow back and forward here :). I am pretty convinced that as we said; we need large studies and consideration of the n-of-1s. The n-of-1 is not so hard these days with the tools we have and should be easily part of any project where they are doing WGS or even WES...; they truly go hand in hand IMHO.

And I really don't follow the logic on why rare variants can't be relevant. I think about Occam’s razor. Going to break this out from a big paragraph as requested :).

For HADHA: if we find 5/100 ME/CFS participants (we currently have 2 I think 40 unrelated individuals; pending confirmation of the unrelated count since we have affected family members) carrying the same rare variant, then we exceed a reasonable likelihood threshold. Not proving, but more datapoints.

Where that variant is known pathogenic with documented heterozygous effects (small number of case report/animal findings) and these confirm expressed long-chain fatty-acid abnormalities, oxidative-stress differences.

Where this variant in a homozygous state is associated with a very severe neonatal disease.

Where milder/more moderate later onset forms (often with "milder" variants) have been noted presenting with exercise intolerance/early fatigability, chronic fatigue, weakness, peripheral neuropathy.

Where the disease triggers are known to be illness, infection, strenuous exercise, prolonged fasting, stress.

Where SARS‐CoV‐2 has been shown to be a preceding event in the death of a patient with this variant (homozygous) - acute respiratory failure and refractory hypotension from severe cardiomyopathy, and a different variant (homozygous) with a second death.

Where the impacted mechanism of fatty-acid oxidation/long-chain FA metabolism, has been shown altered across a dozen or more ME/CFS/Long COVID/PASC studies.

Where progressive disease worsening has been seen in HADHA associated disease with even low amounts of exercise due to rhabdomyolysis and where rhabdomyolysis has been published as a theory related to PEM in ME/CFS.

Where chronic fatigue and exercise intolerance is a core symptom of many mitochondrial diseases, including in a number like LHON where there is an early onset disorder with varied severe symptoms beyond those two symptoms, but also a extended phenotype in the carrier parents that is often very much focused on chronic fatigue, brain fog, and exercise intolerance.

Where HADHA has been noted as harboring SNPs related to outcome of COVID severity.

Then I don't think we can say that none of that is relevant. ... Leads shouldn’t be dismissed simply because they are rare or might not be large signals in a heterogeneous cohort. This signal/a signal from the higher order program may even be present in larger datasets; we just need people to look. So we felt we needed to put some testable hypotheses out there. This is why we wrote these things:

"Our findings support the hypothesis that at least a subset of ME/CFS cases represent distinct molecular disorders that converge on shared physiological pathways." "While replication in larger, more diverse cohorts remains critical, the rarity of many variants identified suggests that population-scale validation for all may be impractical." "By integrating genomics, transcriptomics, and deep patient-informed phenotyping, we uncovered plausible, testable molecular contributors in over a third of participants." "We suggest that progress will require experimental designs that integrate individual-level genomic data with deep, participant-informed deep phenotyping, capturing the combined effects of rare and common variants and environmental modifiers on disease expression and progression." "We identified likely molecular contributors in 13 families, with impacted loci mapping to biologically plausible pathways, offering hypothesis-generating insights consistent with previously reported disease mechanisms and observed symptoms." "Moving forward, we feel that to refine mechanistic understanding, efforts may need to integrate deep, participant informed phenotyping with assessments of rare variants, polygenic risk, and environmental exposures, while also accounting for the timing of sample collection."

As you note; we’re also testing enrichment for other diseases for these variants using All of Us data though with ~600k genomes, small expected counts and often poor phenotype capture (e.g. no ICD-10 for post-exertional malaise) it's a bit trickier than it sounds, but we’re pursuing it.

I will leave it there. Will be off for a while now, but happy to connect later on in person for follow ups. And always happy for experts to point out errors; that is how we all get closer to the goal.
 
Oh yeah, none of my posts were specifically about the method in the present paper. Just pushing back specifically on saying someone was probably misdiagnosed if we did find a robust rare variant connection.

Edit: Which I am now thinking might not have been Utsikt's point anyway.
Oh yes I agree; and the point was not that there isn't a unifying mechanism, just that in some cases it can be caused by a rare variant. Totally agree.
 
Back
Top Bottom