Don't miss our weekly PhD newsletter | Sign up now Don't miss our weekly PhD newsletter | Sign up now

This project is no longer listed on and may not be available.

Click here to search for PhD studentship opportunities
  Dr T Peng, Assoc Prof A Lawson  Applications accepted all year round  Self-Funded PhD Students Only

About the Project

The proliferation of Big Data analytics is developing ever more sophisticated models for intelligent data-driven insight and decision making in business and in other areas such as health and social care. However critical issues relating to the data quality that is required for these models to be effective and trustworthy are not getting the attention they deserve. This project will investigate data quality and data cleaning in Big Data, focusing on how the characteristics of Big Data affect the suitability of existing data quality/data cleaning approaches.

The successful candidate will be expected to undertake research into current data quality approaches, and then propose and evaluate a novel data quality approach/framework, which can be used in Big Data applications. The area of applications, such as banking, retail, manufacturing, internet of things, or health and social care will be for the successful candidate to determine in conversation with the supervisors.

Prospective applicants are encouraged to contact the Supervisor before submitting their applications. Applications should make it clear the project you are applying for and the name of the supervisor(s).

Academic qualifications

A first degree (at least a 2.1) ideally in Mathematics or Computing with a good fundamental knowledge of Data Science and Algorithms.

English language requirement

IELTS score must be at least 6.5 (with not less than 6.0 in each of the four components). Other, equivalent qualifications will be accepted. Full details of the University’s policy are available online.

Essential attributes:

  • Experience of fundamental database applications
  • Competent in data structures and algorithms
  • Knowledge of data science
  • Good written and oral communication skills
  • Strong motivation, with evidence of independent research skills relevant to the project
  • Good time management

Desirable attributes:

  • A basic understanding of data quality and data cleaning would be beneficial.

For enquiries about the content of the project, please email Dr Taoxin Peng [Email Address Removed]

For information about how to apply, please visit our website

To apply, please select the link for the PhD Computing FT application form

Computer Science (8) Mathematics (25)


A. Immonen, P. Paakkonen and E. Ovaska, Evaluating the Quality of Social Media Data in Big Data Architecture, IEEE Access 2015, Vol. 3 C. Batini and M. Scannapieco, Data and Infomration Quality: Dimensions, Principles and Techniques, Springer, 2016.
H. Liu, A. Kumar T.K., J. P. Thomas and X. Hou, Cleaning Framework for BigData: An Interactive Approach for Data Cleaning, 2016 IEEE Second International Conference on Big Data Computing Service and Applications (BigDataService), 2016, pp. 174-181, doi: 10.1109/BigDataService.2016.41. F. Ridzuan and Z. Wan, A review on data cleansing methods for big data. Procedia Comput Sci. 2019,
X. Wang and C. Wang, "Time Series Data Cleaning: A Survey," in IEEE Access, vol. 8, pp. 1866-1881, 2020, doi: 10.1109/ACCESS.2019.2962152 Mayur Kishor Shende, Andrés E. Feijóo-Lorenzo, Neeraj Dhanraj Bokde, cleanTS: Automated (AutoML) tool to clean univariate time series at microscales, Neurocomputing,Volume 500, 2022, Pages 155-176
Search Suggestions
Search suggestions

Based on your current searches we recommend the following search filters.

 About the Project