How to Split test and training set
33 views (last 30 days)
Garmit Pant on 23 Jun 2022
It is my understanding that you have data in the form of CSV files that you wish to load and split into train and test subsets.
You can use the 'cvpartition' function of the Statistics and Machine Learning Toolbox to do the same.
You'll can use the following code snippet to divide your data into training and test subsets.
XTrain = readtable("urbanGB.csv");
YTrain = readtable("urbanGB.labels.csv");
XTrain.Y = table2cell(YTrain); %assuming YTrain to be a single column
rng('default') % For reproducibility
n = length(XTrain.Y);
hpartition = cvpartition(n,'Holdout',0.3); % Nonstratified partition
idxTrain = training(hpartition);
tblTrain = tbl(idxTrain,:);
idxTest = test(hpartition);
tblTest = tbl(idxNew,:);
You can refer to the documentation of cvpartition for more details: cvpartition