Birkbeck, University of London Featured PhD Programmes
Norwich Research Park Featured PhD Programmes
University of Kent Featured PhD Programmes
Sheffield Hallam University Featured PhD Programmes
The Hong Kong Polytechnic University Featured PhD Programmes

Specifying and Optimising Data Wrangling Tasks

  • Full or part time
  • Application Deadline
    Applications accepted all year round
  • Self-Funded PhD Students Only
    Self-Funded PhD Students Only

Project Description

Data wrangling is "the process of cleaning, structuring and enriching raw data into a desired format for better decision making in less time" [2].

To clean the data prior to analytical tasks, a wide variety of data quality techniques and tools are used [1]. There is also a trade-off between flexibility, performance and usability of data quality techniques and tools [1]. Highly flexible tools tend to overburden the end user with the need to complex application programming interfaces towards expressing quality-aware manipulations over the data. The balance lies somewhere in a spectrum between highly flexible and extensible solutions and less flexible but efficient and user-friendly frameworks. In practice, a combination of complementary tools and techniques may be needed in a data quality management project.

This PhD project aims to investigate popular techniques and tools used by data scientists to conduct data wrangling tasks prior to big data analytics and develop domain specific methods and languages

Funding Notes

If you have the correct qualifications and access to your own funding, either from your home country or your own finances, your application to work with this supervisor will be considered.

References

Sampaio, Sandra ; Al-Jubairah, Mashael ; Permana, Hapsoro Adi ; Sampaio, Pedro. A Conceptual Approach for Supporting Traffic Data Wrangling Tasks. In: The Computer Journal. 2018 (to appear).
What is Data Wrangling?
https://www.trifacta.com/data-wrangling/

Related Subjects

How good is research at The University of Manchester in Computer Science and Informatics?

FTE Category A staff submitted: 44.86

Research output data provided by the Research Excellence Framework (REF)

Click here to see the results for all UK universities

Email Now

Insert previous message below for editing? 
You haven’t included a message. Providing a specific message means universities will take your enquiry more seriously and helps them provide the information you need.
Why not add a message here
* required field
Send a copy to me for my own records.

Your enquiry has been emailed successfully





FindAPhD. Copyright 2005-2019
All rights reserved.