Running FLAMES on DecodeME data

I've used the 1000G Phase3 EUR LD reference panel, will try to use the UK biobank LD panel later.
I got slightly different results using the UK biobank LD panel (UKBrelease2b10kEuropean). I specified the full rather than effective sample size (n = 275488).

FLAMES highlighted 31 instead of 32 causal genes (and VRK2 appears twice again, so it's 30 genes). The main differences are:

No longer in FLAMES with UK biobank LD FLAMES
RPP40
PLCL1
NEURL1
NEK1

New in FLAMES with UK biobank LD FLAMES
MLLT10
HTT
SSR1

It probably makes more sense to focus on the UK biobank LD panel so would focus on this list. I'll paste it below again:

ARFGEF2
CA10
UNC13C
SHISA6
SOX6
MMS22L
OLFM4
PEBP1
ZNF644
LRRC7
DCC
MLLT10
HTT
CACNA1E
VRK2
ALK
VRK2
MICALL2
KIAA1239
STT3B
VPS54
RIMS1
PTPRE
NR2F1
PTBP2
RP11-147C23.1
ADARB2
SMCHD1
SSR1
LAMA2
HABP2
 

Attachments

It probably makes more sense to focus on the UK biobank LD panel so would focus on this list. I'll paste it below again:

ARFGEF2
CA10
UNC13C
SHISA6
SOX6
MMS22L
OLFM4
PEBP1
ZNF644
LRRC7
DCC
MLLT10
HTT
CACNA1E
VRK2
ALK
VRK2
MICALL2
KIAA1239
STT3B
VPS54
RIMS1
PTPRE
NR2F1
PTBP2
RP11-147C23.1
ADARB2
SMCHD1
SSR1
LAMA2
HABP2
Nice, thanks.

When doing the enrichment again with those, only one gene set is highlighted: synapse.

Another gene set, which I guess is targets of the microRNA mir-3620, was also significant, but was not highlighted, probably due to similarity to the synapse gene set.

term_idterm_namehighlightedadjusted_p_valueterm_sizequery_sizeintersection_sizeeffective_domain_sizeintersections
GO:0045202synapseTRUE0.0159160827922155ARFGEF2,UNC13C,SHISA6,DCC,HTT,CACNA1E,VPS54,RIMS1,LAMA2
MIRNA:hsa-mir-3620hsa-mir-3620FALSE0.024084027816638ARFGEF2,SHISA6,PEBP1,MLLT10,CACNA1E,PTBP2,SMCHD1,SSR1

It seems the other synapse-related gene sets are not significant anymore mainly because they include the now removed PLCL1 and/or NEURL1.

Edit: I might have been wrong about the reason mir-3620 isn't highlighted. I think the "highlighting" is just for GO terms. It might be worth looking at this miRNA.
 
Last edited:
@forestglip, I have a question about this data.

when I click on GO:0045202, the link takes to another website and when I click on the link there to all direct and indirect annotations to synapse (excluding "regulates"), I get this:


This page says that the synapse data is from domestic cattle and domestic cats. Do we have any idea of the relevance of this information to human synapses?
 
@forestglip, I have a question about this data.

when I click on GO:0045202, the link takes to another website and when I click on the link there to all direct and indirect annotations to synapse (excluding "regulates"), I get this:


This page says that the synapse data is from domestic cattle and domestic cats. Do we have any idea of the relevance of this information to human synapses?
Good question. I don't know a lot about the gene ontology database. [Edit: For all I know, the annotations for all the different species are just the different forms of the same genes, just for a different animal. But I don't know how true that is.]

In any case, it looks like all of the DecodeME genes that showed up in the synapse gene set, and thus made it significant, are annotated under "Homo sapiens" so probably relevant to humans.

The gene set is also available on msigdb, where it says source species "Homo sapiens". If expanding the link to Show Members near the bottom, all of the following synapse DecodeME genes are there:
ARFGEF2,UNC13C,SHISA6,DCC,HTT,CACNA1E,VPS54,RIMS1,LAMA2
 
Last edited:
Thanks, quite a lot of differences unfortunately. Did it only focus on the hits above 5*10^-8?
Yes, exactly. I am not familiar with FLAMES. I used a classic approach (SusieR selects the true signal by posterior inclusion probability, PIP), no machine learning. Also, I used eQTLs from GTEx V10 to map true causal variants to genes. I wonder if this affected the results.
 
Back
Top Bottom