Don't miss our weekly PhD newsletter | Sign up now Don't miss our weekly PhD newsletter | Sign up now

  Using machine learning to identify aggregation resistant biopharmaceuticals


   Faculty of Biological Sciences

This project is no longer listed on FindAPhD.com and may not be available.

Click here to search FindAPhD.com for PhD studentship opportunities
Prof David Brockwell Prof David Westhead Prof Sheena Radford  No more applications being accepted  Competition Funded PhD Project (European/UK Students Only)

About the Project

Background
The UK is a major stakeholder in biopharmaceutical development and production, a sector that had sales of $228 billion in 2016. Aggregation is a major hurdle to their manufacture resulting in the failure of promising candidate biologics even at very late stages in the development pipeline. The ability to identify sequences likely to aggregate during production, transport or storage is of crucial importance to the biologics industry. This is currently beyond our capability both for mAbs and for the arsenal of advanced therapies (antibody-drug conjugates etc) that have the potential of revolutionising medicine in the future.
Together with Astra Zeneca, we have developed an in vivo selection method in E.coli able to quantify the aggregation propensity of bio-therapeutics that include mAbs by linking aggregation to antibiotic resistance. We have shown the assay can be used to screen for aggregation-resistant proteins of therapeutic importance with different protein scaffolds (reference 1) (a previous BBSRC CASE student with Avacta/AZ)) and, most recently, have used it combined with directed evolution to generate new proteins with enhanced bioprocessing capability (under review).
Excitingly, in addition to isolating inherently developable therapeutics, this combined approach allows isolation of thousands protein sequences with known aggregation properties, opening the door to using machine learning (ML) to identify the key drivers of aggregation (whether during ageing and neurodegeneration or during advanced therapy manufacture) from such highly complex datasets.

Objectives. In collaboration with our industrial collaborators at Astra Zeneca we will:
1. Generate a large dataset of protein sequences with improved (positive selection) and worsened (negative selection) aggregation propensity. This will be achieved by performing directed evolution on five single-chain Fv (scFv) sequences with low sequence identity but poor biophysical behavior identified from the literature and our industrial partner.
2. Use these data as training sets for the development of ML algorithms to identify aggregation resistant sequences.
3. Validate the machine learning outputs by quantifying the aggregation properties of a test set of sequences ranked by the optimized ML algorithm.

Novelty and timeliness
The ability to identify aggregation-resistant protein therapeutics early in development, without the need of large scale purification is both novel and timely, especially as more complex protein therapeutics are currently in development. Additionally, our novel evolution platform will be used as a high throughput screen enabling the generation of large datasets which will be used in a ‘big data’ approach to understand the complex multi-factorial mechanisms underlying selection. This will ultimately lead to novel predictors of aggregation and an understanding of the fundamental mechanisms.

Experimental Approach
Molecular biology (error prone PCR and golden gate cloning) will be used to generate libraries of mutated scFv. High throughput sequencing and high throughput aggregation assays will be used to construct a large dataset of sequences with known aggregation behaviour.
These data will be used to carry out ML initially within Python using Scj-kit Learn with the aim of generating new predictive methods for protein aggregation.
The predictive power of the optimised classifier will be verified by expressing a range of optimised sequences in the full IgG scaffold (the student will do these experiments at AZ) and their properties assessed using industry employed methods (e.g. accelerated stability assays, SEC and AC SINS).

Work during placement
it is envisaged that several short visits to Astra Zeneca’s Cambridge site in years one and two will precede a longer visit in year 3. The aims of the visits in years 1 and 2 will be to construct, express, purify and characterize the “wild-type” IgG sequences that will be subjected to directed evolution at Leeds. In year 3, similar work will be undertaken on a larger number of constructs to quantify prediction accuracy of the developed algorithm. Proteins will be characterized using the panoply of methods used in industry e.g. SEC, AC-SINS, DSC, IEF, MS. The project will form part of a true collaboration with AZ, and visits to AZ will also be organized as the science dictates as the project develops.

Funding Notes

BBSRC White Rose Mechanistic Biology DTP CASE 4 year studentship.
Studentships covers UK/EU fees and stipend (c.£15,009) for 4 years to start in Oct 2020. Applicants should have/be expecting at least a 2.1 Hons. degree in a relevant subject. EU candidates require 3 years of UK residency in order to receive full studentship. English language requirements may apply.
Apply online https://studentservices.leeds.ac.uk/pls/banprod/bwskalog_uol.P_DispLoginNon Course is PhD in Biological Sciences and we require a CV and transcripts.

References

1. An in vivo platform for identifying inhibitors of protein aggregation. Saunders, J., Young, L., Mahood, R., Jackson, M., Revill, C., Foster, R., Smith, A., Ashcroft, A., Brockwell, D. and Radford, S. (2016) Nat Chem Biol. 12:94-101.

How good is research at University of Leeds in Biological Sciences?


Research output data provided by the Research Excellence Framework (REF)

Click here to see the results for all UK universities

Where will I study?


Project supervisors

Career overview

Professor David Brockwell is a Professor of Biochemistry and Molecular Biology at the University of Leeds, where he has been a faculty member since 2004. He completed his BSc in Pharmacy at the University of Manchester, followed by a pre-registration year at St Bartholomew''s Hospital in London, qualifying as a pharmacist in 1993. He returned to the University of Manchester for his PhD research, supervised by Dr Jill Barber, focusing on the biophysical effects of protein perdeuteriation. After a brief postdoctoral position at the same laboratory, he worked as a postdoc at the University of Leeds in Professor Sheena Radford''s lab for six years, where he began investigating force-induced unfolding and remodelling of proteins. In 2004, he was appointed to a joint URF/Lecturer position at Leeds and became an Associate Professor in 2012. With over 15 years of experience, Professor Brockwell''s research primarily investigates the effects of force on proteins and their aggregation, resulting in more than 45 publications in the field. His expertise encompasses protein (un)folding, force in biology, outer membrane protein biogenesis, biopharmaceutical aggregation and engineering, and protein hydrogels.


Research interests

Professor Brockwell''s research focuses on several key areas within biochemistry and molecular biology. His work investigates the effects of mechanical force on proteins and their complexes, utilising atomic force microscopy (AFM) to measure the mechanical properties of single protein molecules. He has explored how proteins with similar stability to chemical denaturants can exhibit different behaviours when subjected to force, and has studied the mechanical gating of outer membrane transporters. In the realm of membrane protein folding, Professor Brockwell examines the folding and insertion processes of bacterial outer membrane proteins (OMPs), collaborating with other researchers to understand how periplasmic chaperones and the b-barrel assembly machinery facilitate these processes. His research also addresses the challenges in biopharmaceutical manufacture, particularly how environmental changes can lead to unwanted protein unfolding and aggregation, which is critical in the biopharmaceutical industry. He collaborates with colleagues to investigate flow-induced aggregation and the manufacturability of biopharmaceuticals. Additionally, Professor Brockwell''s interests extend to protein hydrogels, which have applications in tissue engineering and drug delivery. He is working on developing hydrogels from folded globular proteins to exploit their full functional spectrum, including catalysis and ligand binding.

View Professor David Brockwell's profile 
Career overview

Professor David R Westhead is a Professor of Bioinformatics at the University of Leeds, within the School of Molecular and Cellular Biology. He completed his MA and DPhil at the University of Cambridge and the University of Oxford, respectively, both in 1992. Following his studies, he undertook postdoctoral work at Zeneca plc, Proteus Molecular Design Ltd., and the European Bioinformatics Institute under the supervision of Professor Janet Thornton FRS. He was appointed as a Lecturer in Bioinformatics in 1998, became a Senior Lecturer in 2003, and was promoted to Professor in 2006. From 2011 to 2018, he served as the Head of the School of Molecular and Cellular Biology.


Research interests

Professor Westhead''s research focuses on prediction methods for biological problems using machine learning and statistical methods. The main application areas include genetic regulation and cancer, working with genome-scale data sets derived from various technologies, particularly high-throughput sequencing. The motivation is to address biological problems through data and computational methods, such as understanding the molecular networks underlying cancer and their relation to prognosis and therapy. Professor Westhead collaborates extensively with data-generating groups, including local partnerships with St. James University Hospital and national networks in haematological oncology. The research group is part of the Leeds Institute of Data Analytics (LIDA) and the Leeds Omics group. Current projects include targeting transfer RNA-derived fragments during KSHV infection and investigating virus manipulation of host non-coding RNA regulatory networks. The ultimate aim of the research is to contribute to stratified medicine by identifying key cancer-driving processes in individual patients and targeting these with specific therapies.

View Professor David R Westhead's profile 
Career overview

Professor Sheena Radford joined the University of Leeds in 1995 as a Lecturer in the School of Biochemistry and Molecular Biology, progressing to Reader in 1998 and Professor in 2000. In 2009, she became the Deputy Director of the Astbury Centre for Structural Molecular Biology, and served as its Director from 2012 to 2021. She was appointed Astbury Professor of Biophysics in 2014 and became a Royal Society Research Professor in 2021. Professor Radford graduated with a BSc in Biochemistry from the University of Birmingham and completed her PhD in Biochemistry at the University of Cambridge under the supervision of Professor R.N. Perham, FRS. She has held various postdoctoral positions and a Royal Society University Research Fellowship at the Oxford Centre for Molecular Sciences. Throughout her career, Professor Radford has supervised around 25 PhD students and postdoctoral researchers in her laboratory, with over 160 individuals successfully progressing from her lab into various careers in academic research, industry, and technical editing. She has published more than 360 peer-reviewed papers and book chapters and has delivered over 475 invited lectures at national and international conferences across numerous countries. In the last five years, she has served on five major research funding panels and 20 Scientific Advisory Boards for prestigious institutions and companies. Additionally, she has been involved with editorial boards for several journals and currently serves as an Associate Editor for the Journal of Molecular Biology. She is also a Trustee and Council member of the Dementia Research Institute, UK, and the Regional Champion for the Academy of Medical Sciences. Professor Radford has received multiple awards, including the Biochemical Society Colworth Medal in 1996, the Royal Society of Chemistry AstraZeneca Prize in 2005, the Hites Award from the American Society for Mass Spectrometry in 2009, the Protein Society Carl Branden Award in 2013, and the Rita and John Cornforth Award of the Royal Society of Chemistry in 2015. She was elected a member of EMBO in 2007, a member of Academia Europaea in 2020, and has been recognised as a Fellow of the Academy of Medical Sciences (2010), the Royal Society (2014), and the Royal Society of Biology (2021). She was made an honorary member of the British Biophysical Society in 2014 and a Fellow of the Biophysical Society in 2018. In 2022, she received an honorary doctorate from the University of Liège, and in 2024, she became an International Member of the National Academy of Sciences (USA).


Research interests

Professor Radford''s research focuses on fundamental structural molecular biology, specifically the measurement of the conformational dynamics of proteins and the elucidation of the role that these motions play in protein folding and misfolding of both water-soluble and membrane proteins. Their research employs a wide range of biophysical methods, combining techniques from protein chemistry, molecular biology, chemical biology, and structural biology. Over the last 35 years, they have concentrated on delineating the mechanisms by which proteins fold or misfold, how dynamic excursions enable proteins to self-associate into amyloid fibrils—which are complex macromolecular assemblies associated with some of the deadliest human diseases—and how proteins fold into the bacterial outer membranes of Gram-negative organisms. Current major projects include: Mechanism(s) of protein misfolding and assembly into amyloid, Outer Membrane Protein (OMP) folding – The role of chaperones & BAM, Stabilising proteins of therapeutic interest against aggregation, Method development (MS, NMR, single molecule, biophysical methods).

View Professor Sheena Radford's profile