Feature selection with NaN

8 views (last 30 days)
Azura Hashim
Azura Hashim on 14 Mar 2018
Answered: Prajit T R on 22 Mar 2018
I have a high dimensional data where I've managed to build a classification model using fitctree that is returning satisfactory accuracy. The predictors contain a decent proportion of unknown values represented as NaN.
I chose fitctree because it can handle the unknowns. Now I need to reduce the number of predictors using feature selection because recording all the predictors in the final model is not practical.
Is there a feature selection function that will ignore unknown values? I have looked at fscnca and stepwiselm but both don't seem to work. Removing rows containing NaN in the predictor will ignore many other potentially useful predictors and there is no easy way to replace/estimate the unknowns.
Thank you.

Accepted Answer

Prajit T R
Prajit T R on 22 Mar 2018
Hi Azura,
Have you tried the 'fillmissing' function in MATLAB? See the following link: Fillmissing
F = fillmissing(A,method) fills missing entries using the method specified by method, which can be one of the following: previous, next, nearest, linear,spline, pchip.

More Answers (0)

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!