Don't miss our weekly PhD newsletter | Sign up now Don't miss our weekly PhD newsletter | Sign up now

  Deep learning analysis of large-scale sequence data in health and disease


   School of Pharmacy

This project is no longer listed on FindAPhD.com and may not be available.

Click here to search FindAPhD.com for PhD studentship opportunities
  Dr Barry Devereux, Prof Michael Tunney  No more applications being accepted  Competition Funded PhD Project (Students Worldwide)

About the Project

As is the case in many other information processing domains, deep learning frameworks are emerging as powerful and highly effective tool in Bioinformatics and health analytics, including in the analysis of genetics, proteomics and clinical text data. Key to the success of deep learning solutions is their ability to extract complex problem-relevant features from raw input representations. Often, these input representations are sequential in nature (e.g. the representation of a DNA sequence, a string representation of a protein’s amino acid sequence, and unstructured textual data relating to clinical health records or biomedical literature).

Recent research in the domain of Natural Language Processing (NLP) has demonstrated the power of large scale artificial neural networks to represent, process and extract information from text data. In particular, pre-training has emerged as an effective strategy for training such networks – conceptually, pretraining involves training a model using a very large amount of unlabelled data and a very general task, before fine-tuning that model on a more specific task.

In this project, we will adapt NLP deep learning models to the task of extracting useful information from heterogeneous sources of sequence data relating to health and disease. Although highly successful in NLP, there has been little work to date applying the large-scale sequence processing and pretraining framework to other kinds of sequence structure, including the various kinds of sequence structure found in pharmacology, biochemistry, genetics and proteomics. Our goal is to develop deep learning solutions that allow us to better understand and represent these kinds of data, and the clinically relevant relationships that can exist between them.


Funding Notes

Applicants should have a 1st or 2.1 honours degree (or equivalent) in a relevant subject. Relevant subjects include Computer Science, Bioinformatics, Pharmacy, Molecular Biology, Pharmaceutical Sciences, Biochemistry, Biological/Biomedical Sciences, Chemistry, Engineering, or a closely related discipline. Students who have a 2.2 honours degree and a Master’s degree may also be considered, but the School reserves the right to interview only those applicants who have demonstrated high academic attainment. “The studentship will be funded by the Department for the Economy. Please read information on eligibility criteria: https://www.economy-ni.gov.uk/publications/student-finance-postgraduate-studentships-terms-and-conditions. However, there may be flexibility to fund a small number of exceptional International applicants”.