I wanted to dig into that chart of phenotype associations (fig. 5: depression, COVID, etc). I realized the data they used is all on
Genebass. I tried to figure out how to work with all >4500 phenotypes, but I couldn't figure out how to access the bulk data which is hosted in Google Cloud Storage. So instead I used the browser version of Genebass to download summary statistics for the few phenotypes that Zhang et al found were significant (e.g. here's the
England COVID phenotype).
I was able to download the phenotype metadata file though to be able to find the codes to look up all phenotypes on the browser tool, because searching for things like "chronic fatigue syndrome" wasn't working on the website. This file is also hosted on Google Cloud, so it's not as straightforward as downloading from a link and requires a Google Cloud account with billing set up. It's a bit bigger than the file attachment size here, so if anyone needs it, I'll figure out how to share it.
So I downloaded the data for each phenotype they labeled in figure 5A and 5B as significant, plus a few other random ones as well as "chronic fatigue syndrome". What I think they did for a given phenotype is get the SKATO P-values for every single gene and make that one set of data. Then pick out only the 115 ME/CFS genes and use the P-Values from the same dataset for only those genes to make the other group. Then do a one-sided Mann-Whitney to compare the p values of ME/CFS genes versus all genes for a given phenotype.
I did that and I got identical results (I don't know what they're showing on the x-axis, but the y-axis is what I plotted and it matches up). The red line is p=0.05, and everything above it is more significant.
I did the same thing for "P-Value Burden" and it's also identical to the significant items in figure 5B. Except for one thing: I got IBS as the most significant phenotype, and their chart didn't show IBS at all. I think maybe they cut off the top of the chart where it would have been. [Edit: or my method isn't totally identical in some way.]
Notably, CFS is far down the chart with both methods.
Edit: Note about Ranitidine.
There seem to be two different phenotypes that mention Ranitidine. The one labeled "Medication for pain relief..." on my chart that is in the same location as their "Ranitidine" appears to represent several pain killer drugs, including ranitidine. It's not totally clear from the phenotype metadata file. The one labeled "Ranitidine" on my chart is specifically ranitidine.
Edit 2: Nevermind about Ranitidine, I was looking at it wrong. They are both specifically Ranitidine, asked about using different methods. Here are what the two rows look like. "
description" and "
description_more" are italicized and "
coding description" is bolded and underlined.
- Treatment/medication code
- Code for treatment Negative codes indicate free-text entry.
-
ranitidine
- UK Biobank Assessment Centre > Verbal interview > Medications
- Medication for pain relief, constipation, heartburn
- ACE touchscreen question "Do you regularly take any of the following? (You can select more than one answer)" The following checks were performed: If code -7 was selected, then no additional choices were allowed. If code -1 was selected, then no additional choices were allowed. If code -3 was selected, then no additional choices were allowed. If the participant activated the Help button they were shown the message: Some over the counter medicines are known by other names. Please enter the corresponding name if you take any of the following REGULARLY (that is, most days of the week for the last 4 weeks): Aspirin: Alka Rapid Crystals, Alka-Seltzer XS, Anadin Extra, Anadin Original, Askit powders, Aspro Clear, Codis 500, Disprin, Disprin Extra Ibuprofen: Anadin Ultra, Anadin Ibuprofen, Cuprofen Plus, Nurofen, Solpaflex, Ibuleve Paracetamol: Anadin Extra, Hedex Extra, Panadol, Paracodol, Paramol, Solpadeine, Syndol, Veganin, Feminax, Midrid, Migraleve Codeine: Codis 500, Cuprofen Plus, Nurofen Plus, Panadol Ultra, Paracodol, Paramol, Solpadeine Max, Sopadeine Plus, Solpafelx, Syndol, Veganin, Feminax, Migraleve
-
Ranitidine (e.g. Zantac)
- UK Biobank Assessment Centre > Touchscreen > Health and medical history > Medication
Edit 3: Turns out if I convert the metadata file from tsv to xlsx, it becomes much smaller and I can attach it. So it's here in case anyone needs.