Don't miss our weekly PhD newsletter | Sign up now Don't miss our weekly PhD newsletter | Sign up now

  Conditional false discovery rates in high dimensional data sets


   MRC Biostatistics Unit

This project is no longer listed on FindAPhD.com and may not be available.

Click here to search FindAPhD.com for PhD studentship opportunities
  Dr C Wallace  No more applications being accepted  Competition Funded PhD Project (Students Worldwide)

About the Project

One of the challenges in high-dimensional data analysis is the number of parallel hypotheses that are simultaneously tested. Methods to control the number of false discoveries, such as Benjamini-Hochberg, have been widely adopted as a pragmatic solution that avoids the considerable loss of power that can be induced by attempting to control family-wise error rates.

Often, multiple datasets exist which explore the same dimensions in different contexts, such as testing the same genes for association with different diseases. When the diseases are related, information provided by one study can be used to increase power in the other. Conditional false discovery rates, which exploit empirical Bayes ideas, are an intuitive and powerful method to do this, by defining variable thresholds for significance whilst maintaining strong control of false discovery rates (FDR). By using cumulative density functions, they are more robust to sampling variation in lower-powered and lower-dimensional datasets than methods to modulate FDR which model the probability density functions. However, as currently defined, they can incorporate information from only a single quantitative covariate. There is considerable scope for their extension to multiple and/or ordered covariates, or even high-dimensional covariates through dimensionality-reducing mapping functions.

This PhD offers the chance to develop these extensions, and apply them in a range of areas in high-dimensional omics data, for example to the detection of interactions in high-dimensional data, or through developing custom mapping functions to detecting rare variants that cause rare, extreme forms of autoimmunity, building on information available from their common disease counterparts already studied.

More about our group: http://chr1swallace.github.io

Funding Notes

The MRC Biostatistics Unit offers at least 6 fulltime PhDs funded by the Medical Research Council or NIHR for commencement in April 2019 or October 2019.

Academic and Residence eligibility criteria apply.

More details are available at
(https://www.mrc-bsu.cam.ac.uk/training/phd/ )

In order to be formally considered all applicants must also complete a University of Cambridge application form- full details can be found here (https://www.mrc-bsu.cam.ac.uk/training/phd/ )

However informal enquiries are welcome to [Email Address Removed]

Projects will remain open until the studentships are filled but priority will be given to applications received by the 3rd January 2019

References

Liley J, Wallace C (2015) A Pleiotropy-Informed Bayesian False Discovery Rate Adapted to a Shared Control Design Finds New Disease Associations From GWAS Summary Statistics. PLOS Genetics 11(2): e1004926. https://doi.org/10.1371/journal.pgen.1004926