Imperial College London Featured PhD Programmes
University of Glasgow Featured PhD Programmes
Birkbeck, University of London Featured PhD Programmes
University of Kent Featured PhD Programmes
University of Reading Featured PhD Programmes

Clinical prediction under informative presence: Exploring what the patterns and timing of repeated observations in routinely collected data can tell us about disease risk.

  • Full or part time
    Dr J Barrett
    Dr B Tom
  • Application Deadline
    Tuesday, January 07, 2020
  • Competition Funded PhD Project (Students Worldwide)
    Competition Funded PhD Project (Students Worldwide)

Project Description

In an era of increasingly large data, the use of routinely collected healthcare data for medical research is becoming more common, particularly in the area of risk prediction/prognosis research. Examples include developing risk prediction tools for cardiovascular disease using primary care data(1), and the validation of risk prediction tools for chronic kidney disease using electronic health record data from Salford(2). When using routinely collected data, we should consider the reasons why the observed data were collected in the first place. Unhealthier people are more likely to have more frequent measurements taken of disease risk factors, such as estimated glomerular filtration rate (eGFR) for chronic kidney disease and blood pressure and cholesterol for cardiovascular disease. More generally, the pattern and timing of observed measurements may contain information about the underlying health status of the individual. In the context of prediction modelling, we may be able to use this information to predict disease risk more accurately.

The aim of this project is to explore methods for incorporating information about patterns of observations into prediction modelling, utilising simulation studies and real-world data. One method is joint modelling of the observation process with the outcomes of interest, but this may not be feasible to implement in large datasets. Other methods which may be more scalable include methods for clustering patients according to their pattern of informative observation such as latent class modelling, profile regression, and more general mixture modelling and machine learning methods. The methods may be extended to include modelling multiple longitudinal risk factors and within-individual variability which differs between individuals.

The project will be supervised by Dr Jessica Barrett and Dr Brian Tom from the MRC Biostatistics Unit, in collaboration with Prof. Niels Peek, Dr Glen Martin and Dr Matthew Sperrin from the University of Manchester.

Funding Notes

The MRC Biostatistics Unit offers 4 fulltime PhDs funded by the Medical Research Council for commencement in April 2020 (UK applicants only) or October 2020 (all applicants). Academic and Residence eligibility criteria apply.

In order to be formally considered all applicants must complete a University of Cambridge application form. Informal enquiries are welcome to

Applications received via the University application system will all be considered as a gathered field after the closing date 7th January 2020

For all queries see our website for details View Website


1. Paige et al. Landmark models for optimizing the use of repeated measurements of risk factors in electronic health records to predict future disease risk. American Journal of Epidemiology 2018; 187(7): 1530-1538.

2. Fraccaro et al. An external validation of models to predict the onset of chronic kidney disease using population-based electronic health records from Salford, UK. BMC Medicine 2016; 14:104.

Related Subjects

Email Now

Insert previous message below for editing? 
You haven’t included a message. Providing a specific message means universities will take your enquiry more seriously and helps them provide the information you need.
Why not add a message here
* required field
Send a copy to me for my own records.

Your enquiry has been emailed successfully

FindAPhD. Copyright 2005-2019
All rights reserved.