Looking to list your PhD opportunities? Log in here.
This project is no longer listed on FindAPhD.com and may not be available.
Click here to search FindAPhD.com for PhD studentship opportunitiesAbout the Project
Biology is being transformed by large-scale data sets, but how do you find the right one? The data needs a ‘dating profile’: sounds easy but it turns out to be a bottleneck that is holding back research progress and research culture. This project tackles that metadata bottleneck. We aim to help researchers to show the best side of their data.
Data-intensive bioscience depends upon online repositories that share the “Big Data”. There’s little value in sharing data, if you can’t tell which organism, sample, or conditions it came from, so the databases also need descriptions of the data, termed metadata. Top-tier repositories pay professional data curators to deal with their metadata but many other repositories cannot do so. Even curators can’t invent metadata, the original researchers have to describe their research for the curators.
This project first aims to understand current data descriptions in research data repositories, using text mining and machine learning, in particular named entity recognition in free-text descriptions. Based on this evidence, you will research the simplest ways to improve the descriptions in future. The project will test real-time feedback that encourages researchers to provide better descriptions, for example using controlled vocabularies. You will work with software developers to test and evaluate simple feedback processes, in practice, for biological data repositories. By then, you will also be an expert data steward.
Improving data descriptions will accelerate data-intensive bioscience across many research fields, as this bottleneck applies to many repositories and even electronic lab notebooks. Making the data easier to re-use will also reward the researchers who share their data, supporting the “Open Science” aspect of the new research culture.
The team: Andrew Millar (Edinburgh) and Jason Swedlow (Dundee), are biologists who also develop and run data repositories, and help researchers to manage their data. We have access to internationally-adopted repositories (e.g. https://idr.openmicroscopy.org), their metadata, and to their software developers, who can help to implement feedback processes.
Ian Simpson (Edinburgh Informatics) applies natural language processing (text mining) software tools to analyse bioscience literature, and has worked on several “Big Data” bioscience projects, including with Andrew Millar.
The project is based with the BioRDM team in the C.H. Waddington building, at the focal point of SynthSys, the interdisciplinary biology research centre at the University of Edinburgh, where many labs generate, analyse and model large-scale biological data. More information at https://www.ed.ac.uk/biology/synthsys
Individual profile - www.amillar.org
BioRDM team’s public wiki - https://www.wiki.ed.ac.uk/x/VA1wF
https://homepages.inf.ed.ac.uk/tsimpson/
https://github.com/biomedicalinformaticsgroup
https://www.dundee.ac.uk/people/jason-swedlow
The School of Biological Sciences is committed to Equality & Diversity: https://www.ed.ac.uk/biology/equality-and-diversity
How to Apply
The “Institution Website” button will take you to our online Application Checklist. From here you can formally apply online. This checklist also provides a link to EASTBIO - how to apply web page. You must follow the Application Checklist and EASTBIO guidance carefully, in particular ensuring you complete all the EASTBIO requirements, and use /upload relevant EASTBIO forms to your online application.
Funding Notes
This opportunity is open to UK and International students and provides funding to cover stipend at UKRI standard rate (£17,668 annually in 2022) and UK level tuition fees. The fee difference will be covered by the University of Edinburgh for successful international applicants, however any Visa or Health Insurance costs are not covered. UKRI eligibility guidance: Terms and Conditions: https://www.ukri.org/wp-content/uploads/2020/10/UKRI-291020-guidance-to-training-grant-terms-and-conditions.pdf International/EU: https://www.ukri.org/wp-content/uploads/2021/03/UKRI-170321-InternationalEligibilityImplementationGuidance.pdf
How good is research at University of Edinburgh in Biological Sciences?
Research output data provided by the Research Excellence Framework (REF)
Click here to see the results for all UK universities
Search suggestions
Based on your current searches we recommend the following search filters.
Check out our other PhDs in Edinburgh, United Kingdom
Check out our other PhDs in United Kingdom
Start a New search with our database of over 4,000 PhDs

PhD suggestions
Based on your current search criteria we thought you might be interested in these.
Adaptivity and machine learning techniques for PDE problems with uncertain inputs
University of Birmingham
Computer Vision with Deep Learning for Human Data Modelling
Durham University
Discovery of new materials for applications on glass using Deep Machine Learning and Data Analytics
University of Liverpool