MATLAB Answers

List the 10% highest values of a distribution

38 views (last 30 days)
Alessandro Ruda
Alessandro Ruda on 2 Mar 2021
Commented: Jan on 2 Mar 2021
Dear MatLab comunity,
I would like to obtain a list of values that are within 10% the highest value of the distribution that I have. The data is a list of aminoacids of a protein and their associated root mean square fluctuation.
My idea tough was to have not only the values of RMSF but also the residue associated with it in the list, so that I knew which value belongs to each residue.
So like:
Residue 234 3.3301
Residue 121 2.5150
et cetera
Here's the beginning of the script, I can get a list out of the 10% highest values but I need the association with the residue.
RMSF = load('rmsf.dat');
n = RMSF(:,1); %number of residue
x = RMSF(:,2); %RMSF for given residue
hold on
title('RMSF UDP');
w1 = 0.5;
xlim([0 364])
ylim([0 5])
Ms = sort(x_UDP,'descend'); % Sort Descending
Result = Ms(1:ceil(length(Ms)*0.1))
Hope I have been somehow clear in stating the problem. Any suggestion about this would be highly appreciated!
All the best,

Accepted Answer

Jan on 2 Mar 2021
Edited: Jan on 2 Mar 2021
I guess that "x_UDP" is the same as "x".
RMSF = load('rmsf.dat');
n = RMSF(:,1); %number of residue
x = RMSF(:,2); %RMSF for given residue
[Ms, index] = sort(x, 'descend');
Result = Ms(1:ceil(length(Ms)*0.1));
Result2 = n(index(1:ceil(length(Ms)*0.1)));
% Or easier:
RMSF_sorted = sortrows(RMSF, 2, 'descend');
Result = RMSF_sorted(1:ceil(size(RMSF, 1) * 0.1), :)

Sign in to comment.

More Answers (1)

Veronica Taurino
Veronica Taurino on 2 Mar 2021
Change the last part:
[Ms, idx] = sort(x,'descend'); % Sort Descending
Result = Ms(1:ceil(length(Ms)*0.1));
idx_result = idx(1:length(Result));




Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!