Don't miss our weekly PhD newsletter | Sign up now Don't miss our weekly PhD newsletter | Sign up now

  Data Integration & Exploration on Data Lakes


   Department of Computer Science

This project is no longer listed on FindAPhD.com and may not be available.

Click here to search FindAPhD.com for PhD studentship opportunities
  Dr A Freitas, Prof N Paton  Applications accepted all year round  Self-Funded PhD Students Only

About the Project

Data Lakes are emerging as data management infrastructures for storing data in various schemata and structural forms. Their goal is to serve as a single entry point for the data analysis process across highly heterogeneous datasets, supporting analytical tasks following a schema-on-read approach, in which data is discovered and integrated when it is to be used. Due to their semantic and structural heterogeneity, Data Lakes bring integration challenges to a new scale of complexity.

The Information Management Group at the University of Manchester invites applications for PhD candidates in the area of data integration and exploration on Data Lakes. PhD projects in this area will explore how contemporary techniques in Natural Language Processing (such as Open Information Extraction, Distributional Semantics and Semantic Parsing) can be used as a foundation to support exploratory data analysis on real-world data lakes.

Examples of research challenges include:

How to scale the integration of unstructured, semi-structured and structured datasets.
How to support end-users in exploratory data analysis (using Natural Language Questions for example).
How to use information embedded in large-scale corpora to support data integration.
How to use contemporary techniques in one-shot machine learning to support data integration.

Applicants are expected to have:

An excellent undergraduate degree in Computer Science or Mathematics (or related discipline), and preferably, a relevant M.Sc. degree.
Confidence and independence in programming complex systems in Java or Python.
Previous academic or industry experience in Natural Language Processing or Data Science (desired).
Excellent report writing and presentation skills.

Please note that applicants must additionally satisfy the standard requirements for postgraduate studies at the University of Manchester, such as a first-class or high upper-second class (or an equivalent international qualification) and English language qualifications, as stated in the PGR guidelines.

Qualified applicants are strongly encouraged to informally contact Norman Paton ([Email Address Removed]) and Andre Freitas ([Email Address Removed]) to discuss the application prior to applying.

Funding Notes

If you have the correct qualifications and access to your own funding, either from your home country or your own finances, your application will be considered.

How good is research at The University of Manchester in Computer Science and Informatics?


Research output data provided by the Research Excellence Framework (REF)

Click here to see the results for all UK universities