Project objectives: Data systems are going through a major transition due to the challenges of Big Data processing. The volume and velocity of data generated from a variety of sources are far outpacing the available storage and processing capacity. Data science enables one to bring structure to large quantities of data and make analysis possible. However, existing data systems are not able the meet the computational challenges of Data Science applications. Through this project the researcher will devise new approaches to data processing that can support analysis on data at massive scales. The goal of the project is to develop a scalable runtime system for data science applications and big data processing. Therefore, the focus is more on the systems aspects, rather than data analysis or data mining.
Location: The University of New Brunswick, Fredericton is one of the top comprehensive universities of Canada. The Faculty of Computer Science is the first faculty of computer science in Canada and a leader in Atlantic Canada since 1968 with the oldest and most successful COOP program in Atlantic Canada.
Description: This research project will develop high performance big data and data science systems. The researcher will explore high performance SQL query processing approaches using cutting-edge query compilation techniques, while taking advantage of modern multi-core hardware, as well as distributed Big Data frameworks like Hadoop and Spark. The researcher will also investigate parallel runtime data processing infrastructure on modern hardware. This is a fully funded (i.e. full scholarship) PhD position.
Qualifications: A solid background in Computer Science (or Computer Engineering), including a thesis-based research Master’s level degree from a reputed university with excellent grades, is required. Strong programming (coding and debugging) skills in C/C++ are necessary, and sound knowledge of Python and/or Java are expected. Solid understanding of and experience with database system internals, parallel programming, compiler design and Linux systems programming are advantageous. Familiarity with relational databases like PostgreSQL, MySQL, and Python Data Science echo-system, machine learning libraries is appreciated.
Contact: Please contact with your CV, and Bachelor’s and Master’s degree transcripts; Email to [email protected]