Yeah, mine has always said "Disability". In a video about Trends, they seem to imply that any word under the term other than "Search term" means a Topic.
Interesting that your R2 is higher.
Just to clarify, the trends page you're using says "CFS/ME" and not "Myalgic encephalomyelitis/chronic fatigue syndrome"? Can you link to that page?
And you're doing a correlation against the percentages from the World Population Review website?
Yeah, mine has always said "Disability". In a video about Trends, they seem to imply that any word under the term other than "Search term" means a Topic.
Interesting that your R2 is higher.
Just to clarify, the trends page you're using says "CFS/ME" and not "Myalgic encephalomyelitis/chronic fatigue syndrome"? Can you link to that page?
And you're doing a correlation against the percentages from the World Population Review website?
Will get you that information in a bit;
Meantime I've also found the Australian census has extremely detailed ancestry and place of birth data, combined with some health data. It's not going to be definitive because mecfs is not listed. But I will pre-specify the hypothesis I will set out to test.
If there is a stronger association between English heritage and "other health conditions" than between other heritages and "other health conditions". Putting extra weight on association among females.
And using association with other health conditions, e.g. stroke diabetes mental health, as a baseline.
And after doing any coldly rational bonferroni adjustments I will also probably do some highly motivated data mining!
So then I got a bit systematic, and looked to see what really was most associated with English heritage, controlling for age. Turns out it is early onset arthritis. Also early onset mental health issues. (English heritage is strongly negatively correlated with kidney disease, apparently).
It is fascinating and it certainly seems to stand up the idea that people from certain backgrounds can be more and less prone to certain diseases but it doesn't shed light on me/cfs directly nor does it say why. So I think it is worth leaving there - unless anyone has any hypotheses they would like me to dig into? Feel free to ask!
I think what could be interesting is looking for the best correlations of ME/CFS searches with other searches. For example, if the states that search most for ME/CFS also search most for "mono".
There's no publicly available API for Google Trends, though, so data would have to be downloaded one at a time through the browser. The idea I had was to start by looking at the correlation of ME/CFS searches with searches for 10 random words. For the most correlated search term, find 10 concepts related to that search term, with something like a thesaurus, and test correlation with those to see if any are even better. And keep iterating.
But I think without being able to automate doing that for thousands of search terms, it might be too slow to be fruitful. But maybe still worth testing with some hand selected terms.
I managed to do this, using gtrendsR, as suggested by Murph. I decided to have an AI code basically the whole script, and I am pleasantly surprised by how well it works.
Essentially, the script is testing how well search interest in "chronic fatigue syndrome" correlates to search interest for other terms. I chose the specific search term "chronic fatigue syndrome" (without quotes) because I worry the ME/CFS Topic includes unrelated terms, and searches for the specific term "ME/CFS" are much less common than for "chronic fatigue syndrome" (henceforth referred to as CFS).
I decided to use the scores for metro regions instead of for states because there are more of them (~210 vs 51), making the statistics more precise, and allows comparing a larger variety of regions.
The algorithm used is as follows:
I have a huge list of words as "seeds". The script starts with a seed word, say "elephant", and then tests how well search interest correlates between "elephant" and CFS.
If this initial correlation passes a lenient threshold of p<0.01, then the script identifies 10 related search terms. Google Trends helpfully provides related terms, which are accessible in the gtrendsR results.
For example, for "elephant", here are the first 3 related terms it returns: "white elephant", "baby elephant", "drunk elephant".
Then the script tests correlation of CFS with each of those terms, and identifies the most significant correlation with CFS out of these 10 terms (let's say "baby elephant"), and if that term is also more significant than the "parent" term ("elephant"), then 10 more related terms are retrieved that are related to this new term.
For example, related terms for "baby elephant" include: "baby elephant baby shower", "baby elephants", "baby elephant walk"
Then the script tests correlation of CFS with these 10 new terms, and on and on, until none of the 10 new terms is more significant than its parent term.
Then the script goes on to the next seed word and starts all over. The correlation statistics for each term are saved to a file.
For "uveitis", none of the 10 related terms was more significantly correlated than "uveitis" to CFS, so it stopped there. For "kidney infection", the related term "kidney infection back pain" was even more significant, so it identified the related terms to this new term, of which there was only one, and for that term, there was too little data to run the correlation test.
It takes about 2 to 5 seconds per term, and if retrieving too many results too fast, it starts giving errors.
So far, I've tested the correlation of CFS with around 4800 other terms. I've used some seed words that are totally random words, some that are diseases, and some that are body parts.
Here are the top 50 most significant correlations so far. (I'll attach a file with the full results.)
Search depth of 0 indicates that this was a seed term. 1 means it was a related term of the seed word, and so on.
Notably the correlation of CFS with itself is not 1. This is because the search score data isn't the same every time, as previously discussed. But reassuringly, searches for fibromyalgia are most significantly correlated with CFS, which makes sense.
Among the top correlations are the very confusing "dog food recall", "food recall", and "cross stitch pattern".
Some of the diseases highly correlated in search interest to CFS are "ocular migraine", "plantar warts", "Raynaud's", and "tendonitis".
It is interesting that multiple sclerosis is highly correlated to CFS here, but was not when I tested previously. I think that is because I previously tested the ME/CFS "Topic" vs the multiple sclerosis "Topic". We know that the Multiple Sclerosis Topic includes the short abbreviation "MS", since Mississippi (abbreviation MS) has extremely high scores, so it's possible that searches for "MS" make the Topic scores less true to searches specifically about multiple sclerosis.
Here are plots for a couple of the most significant terms. (Note that the data in the plots may not be identical to that used in the initial correlations because they are based on re-downloaded data.)
Some potential other things to try:
Use states or countries instead of metros.
Set target term to "chronic fatigue" or the ME/CFS Topic.
Test correlations using Spearman's rho instead of Pearson.
I uploaded the script to GitHub if anyone else wants to try experimenting with it.
Now just need to figure out what ME/CFS, migraines, and cross stitching have in common...
Edit: The time span used for all trends data was 2004-01-01 to 2026-03-24.
The correlation of English ancestry with early onset arthritis in Murph's Australia census analysis, and the many diseases highly correlated to ME/CFS in my USA Google Trends analysis made me wonder if places with British people have higher rates of many diseases, whether through genetic or diagnostic reasons, and ME/CFS is just one among many.
I tested correlation of search interest in various diseases by state against English and Scottish ancestry, as we did before for ME/CFS and ancestry. I picked a few diseases at random, some diseases that were highly correlated to ME/CFS searches above, and arthritis ("early onset arthritis" doesn't have enough data in Google Trends).
And yet, ME/CFS search interest still has the top correlation out of all these diseases, with both ancestries. It's possible there are other diseases that might do even better, so if anyone has any suggestions, I can test them.
* Capitalized disease names are Google Trends "Topics" that include related terms, and lower case names are just for the specific search term.
Edit: Updated to include chronic fatigue, chronic pain, and fibromyalgia. "chronic fatigue" is a higher correlation than ME/CFS for English ancestry.
Edit: We can see here how much the Google Trends numbers change when re-downloading the same data at a different time. R^2 for the same Scottish ancestry data correlated against "chronic fatigue" was previously 0.60, but here, with newly downloaded chronic fatigue data, it was 0.46.
Relatively low correlations of Scottish or English ancestry with these four search terms:
Note that the data for "type 1 narcolepsy" and "type 2 narcolepsy" might not be very reliable, because there looks to be very little search volume for these, and unlike most searches, states only have one of three values for these two searches: 100, <1, and No Data. I replaced <1 with 0 in the analysis.
Here for example is "type 2 narcolepsy" vs Scottish ancestry:
Alpha-1 antitrypsin deficiency is apparently more common in Ireland (link); cystic fibrosis may be as well (link). And at least two epidemiological studies of MS suggest a high prevalence in Scotland (link).
I tested the above suggestions, alongside ME/CFS, "tired", "always tired", and "fatigue" as well, for English, Scottish, and Irish ancestry.
I sorted by Pearson, but also tested Spearman to see if large outliers are masking an association (which seems to be the case for Mississippi being an outlier for the Multiple sclerosis Topic, making MS the highest Spearman correlation with Irish ancestry).
These are all newly downloaded Trends scores, so they may not match exactly to previous results.
Also note that there is very little variety in search scores for the "haemochromatosis" spelling (the only scores are either 100, 50, or <1), so that one is probably not very reliable.
And again, the terms that are capitalized are "Topics", so they contain multiple related terms.
So at least some of these seem to produce the expected strong correlations, especially the Irish/hemochromatosis relationship.
It's interesting that "chronic fatigue" has a large correlation with English and Scottish, but "tired" and "always tired" do not. Maybe suggests that it is more about the language used in states that have higher British ancestries?
Something weird seems to be going on with West Virginia (WV). I keep seeing WV listed as the number one ranked state by search interest, or at least highly ranked, for various diseases.
To examine this more systematically, I downloaded the Google Trends state scores for the 28 common conditions listed under A, B, and C on the NHS website. And I also downloaded the first 30 words listed in a file of common English words, like "about", "business", and "click".
Here is WV's ranking plotted for search interest for each of the common words. WV is in red, and dots at the top are ranked higher (more searches for this term). The ranking of WV looks more or less random for these common words.
On the other hand, this is WV's search interest among all states for 28 common health conditions:
Is West Virginia a particularly unhealthy state? Why are they searching so much for all sorts of diseases?
* Note: For terms where WV was tied with another state, the order of the tied states in the plot is arbitrary. (E.g. If WV was tied with 2 other states for rank 1, then it could be at position 1, 2, or 3.)
This site uses cookies to help personalise content, tailor your experience and to keep you logged in if you register.
By continuing to use this site, you are consenting to our use of cookies.