I just had an idea (apologies
@Andy @Chris Ponting if you have already discussed this).
With the questionnaires that people will be asked to fill out, what if you had a set of 'compulsory' questions that are necessary for the basic participation in the GWAS study (e.g. the symptom questions needed for the diagnostic algorithm, and basic patient data that you want to gather), and then also an additional set of 'optional' questionnaires that patients who have the energy to do so are able to fill in.
I am just thinking that, with the sample size aimed for, this could be an amazing opportunity to gather masses of data from a huge number of patients which might help with phenotyping, identifying subsets, understanding disease progression, patterns in patient histories, quality of life, functional impairment, etc. etc.
This could be helpful e.g. to look for correlations with any genetic findings that come from the GWAS, but also even if this study doesn't have the resources to analyse the extra data, if participants consented to it being held in a 'data bank' that other researchers or future studies could use, this could be a cheap way to gather data on thousands of ME patients for use in future studies which often suffer from small sample sizes.
If the online questionnaires were hosted on a 'participant portal' that patients could log in to and fill out section by section over time, they could potentially have, say, a year to fill out all the questionnaires, allowing them to pace themselves, thus maybe making it more accessible for more severe patients to provide additional optional data if they desire.
E.g., to register with the study, patients have to fill out and submit the 'core compulsory' questionnaires. If they are accepted, they send in their saliva sample. Then, while data collection/analysis is going on for the remaining patients, those who want to could log in to the participant portal to fill out additional optional questionnaires, log their disease history, etc. at a pace that suits them.
If this additional data is useful for this study and there are resources to include it in the analysis, then this could happen in the latter stages of the 4 year study duration, once people have had time to fill out as much as they can.
If there is no need for the additional data in this particular study, or no resources to analyse it, then it could be stored in a 'data bank' linked to the DNA samples (and blood samples for CURE ME Biobank participants), using a participant identifier code.
Researchers could then apply to access the data, like they do to the CURE ME Biobank. If participatlnt identifier codes are used, researchers could apply for DNA sequences, questionnaire data, and blood samples (for ME/CFS Biobank participants), or any combination of the three. This could help with the historical problem we have of studies which only have 30 participants, because the researchers couldn't get the funding to collect data and samples from more people.
What do people think about this? Would this be feasible
@Andy @Chris Ponting ?
There would obviously be some resource costs involved in secure storage of the data, reviewing applications from researchers who want to use it, and administering release of the data to approved researchers, but perhaps this could be tied in somehow with existing Biobank infrastructure?
Since it would be digital data, the costs associated with gathering, storing and releasing it would be much smaller than if it were additional biological samples.