How to partition a table or matrix of observation data so that samples from every group are included?
4 views (last 30 days)
Show older comments
I have a table of categorized data with 41 categories. Every category has at least 5 samples. If I partition it into a 90:10 training:test split like so:
cvp = cvpartition(data.Category,'Holdout',0.1);
it'll split it into the quantities I want but one group will frequently not contain samples from all 41 categories. How do I split it so that all categories are represented in both the training and test datasets? It's in Table format at the moment but can switch to cell array if needed.
0 Comments
Answers (1)
Sachin Meena
on 21 Sep 2018
Try the
cvpartition(group,'HoldOut',p,'Stratify',stratifyOption)
0 Comments
See Also
Community Treasure Hunt
Find the treasures in MATLAB Central and discover how the community can help you!
Start Hunting!