Don't miss our weekly PhD newsletter | Sign up now Don't miss our weekly PhD newsletter | Sign up now

  Data Quality and Cleaning in Big Data - Project ID SOC0014


   School of Computing

This project is no longer listed on FindAPhD.com and may not be available.

Click here to search FindAPhD.com for PhD studentship opportunities
  Dr T Peng, Assoc Prof A Lawson  No more applications being accepted  Self-Funded PhD Students Only

About the Project

The proliferation of Big Data analytics is developing ever more sophisticated models for intelligent data-driven insight and decision making in business and in other areas such as health and social care. However, critical issues relating to the data quality that is required for these models to be effective and trustworthy are not getting the attention they deserve. This project will investigate data quality and data cleaning in Big Data, focusing on how the characteristics of Big Data affect the suitability of existing data quality/data cleaning approaches. The successful candidate will be expected to undertake research into current data quality approaches, and then propose and evaluate a novel data quality approach/framework, which can be used in Big Data applications. The area of applications, such as banking, retail, or health and social care will be for the successful candidate to determine in conversation with the supervisors.

Academic qualifications
A first degree (at least a 2.1) ideally in Mathematics or Computing with a good fundamental knowledge of Data Science and Algorithms.

English language requirement
IELTS score must be at least 6.5 (with not less than 6.0 in each of the four components). Other,
equivalent qualifications will be accepted. Full details of the University’s policy are available here https://www.napier.ac.uk/research-and-innovation/research-degrees/application-process

Essential attributes:
 Experience of fundamental database applications
 Competent in data structure and algorithms
 Knowledge of data science
 Good written and oral communication skills
 Strong motivation, with evidence of independent research skills relevant to the project
 Good time management

Desirable attributes:
A basic understanding of data quality and data cleaning would be beneficial.

When applying for this position please quote Project ID SOC0014

Edinburgh Napier University is committed to promoting equality and diversity in our staff and student community https://www.napier.ac.uk/about-us/university-governance/equality-and-diversity-information

Funding Notes

This is an unfunded position

References

L.Li, T. Peng and J. Kennedy, A Rule Based Taxonomy of Dirty Data. GSTF
International Journal on Computing, Vol. 1, No. 2 2011

A. Immonen, P. Paakkonen and E. Ovaska, Evaluating the Auality of Social
Media Data in Big Data Architecture, IEEE Access 2015, Vol. 3

C. Batini and M. Scannapieco, Data and INfomration Quality: Dimensions,
Principles and Techniques, Springer, 2016.