University of Hong Kong Featured PhD Programmes
University of Leeds Featured PhD Programmes
University of Edinburgh Featured PhD Programmes

Machine Learning for Dimension Reduction in High‐Dimensional Datasets


Cardiff School of Mathematics

About the Project

In today’s environment where computer processors are powerful and computer memory cheap, researchers are able to collect and store huge amounts of data. Analysing that data needs sophisticated statistical and computational methods as most classic statistical methodology was developed at an era where data collection was not as easy and datasets where a lot of orders of magnitude smaller. Sufficient dimension reduction (SDR) is a class of methods for feature extraction in regression and classification problems with the purpose of reducing the size of a multidimensional dataset to a few important features.

This has the potential of improving visualization of the most important relationships between the variables. This project will focus on the improvement of existing methodology for more accurate and computationally faster estimation algorithms to achieve SDR. Among the most interesting suggestions in the literature uses machine learning algorithms and more specifically Support Vector Machines (SVM). The method although powerful can be improved in different directions and therefore there are a number of directions that a student can take on this project. A few examples are: to derive new SDR methodology robust to outliers; to derive Sparse SDR methodology; to derive SDR methodology when we have missing predictors; to derive SDR methodology for functional data and many more.

Moreover there are many modern applications (like text data analysis) where the data are really high-dimensional and not derived from a Gaussian distribution. In those cases, the literature is rather thin in computationally effective methods for efficient dimension reduction. We are looking into developing both supervised and unsupervised dimension reduction methods (like non-Gaussian PCA, non-Gaussian CCA etc) which are computationally efficient and accurate in the results especially in the nonlinear feature extraction setting. Interested students can look into a number of directions sparse methodology, real time algorithms or applications to real datasets.


Funding Notes

We are interested in pursuing this project and welcome applications if you are self-funded or have funding from other sources, including government sponsorships or your employer.

HOW TO APPLY

Applicants should submit an application for postgraduate study via the online application service: View Website



In the research proposal section of your application, please specify the project title and supervisors of this project.

References

If you are applying for more than one Cardiff University project please note this in the research proposal section.

Email Now

Insert previous message below for editing? 
You haven’t included a message. Providing a specific message means universities will take your enquiry more seriously and helps them provide the information you need.
Why not add a message here

The information you submit to Cardiff University will only be used by them or their data partners to deal with your enquiry, according to their privacy notice. For more information on how we use and store your data, please read our privacy statement.

* required field

Your enquiry has been emailed successfully



Search Suggestions

Search Suggestions

Based on your current searches we recommend the following search filters.



FindAPhD. Copyright 2005-2021
All rights reserved.