Vector Manipulation

Hi Everyone,
I have a vector that is an index containing row numbers that need to be removed from a dataset. However, due the nature of my data, I know that not only do these rows need to be removed, but also the 30 rows after any of them.
My question is:
How would I add new entries to the existing vector to include each of the 30 rows following any existing entry. There should end up being 30 new entries for every existing one and I could then unique(vector) to remove any duplicate row numbers.
I hope this isn't too much of a doit4me. I don't really use anything other than dataset arrays from the statistics toolbox, so my vector manipulation skills are sort of limited.

 Accepted Answer

row_index = [1;33;67]; %row indices
row_index = unique(cumsum(horzcat(row_index,ones(size(row_index,1),30)),2))
I threw the unique in there in case you have say (15,30) where 15 of the 15's values would overlap with some of the 30's. If there's no fear of that:
row_index = reshape(cumsum(horzcat(row_index,ones(size(row_index,1),30)),2).',[],1)
Definitely not a doit4me, as you posed your problem well, asked politely, made it interesting (matrix manipulation usually is).

5 Comments

DJ
DJ on 22 Jul 2011
I do have periods where it definitely overlaps, so I used the unique version. It worked like a charm.
Thanks!
Interestingly, you don't actually need unique, or for the vector to be in ascending order:
A = magic(10);
B = [3;3;1;1];
A(B,:) = [];
That simplifies the above to:
reshape(cumsum(horzcat(row_index,ones(size(row_index,1),30)),2),[],1)
Is there a way to check if the row_index exceeds the size of the data? e.g.
A=1:70;
A(row_index) cause an error
but (A(row_index)=0 still works through.
DJ
DJ on 26 Jul 2011
I just ran into the index exceeding the size of data problem actually. The the last row contained a bad entry. If any of the last 29 rows were in the index, it would exceed the size of the data :s.
I know this is old but I've been away from the computer for a very happy 10 days.
Use min() with the row size so if it's exceeded it's deleted.
row_index = min(row_index,size(A,1));

Sign in to comment.

More Answers (1)

The best way is to provide some example data so we can understand your question better. Below is my guess.
Data=(1:1000)';
Index=1:100:1000
for k=1:length(Index)
EndIndex=min(Index(k)+30,length(Data));
% remove
%Data(Index(k):EndIndex)=[];
% or provide new data
Data(Index(k):EndIndex)=0;
end

2 Comments

Removing won't work as the vector will change positions on each deletion. You'd either have to run it backwards (where there would still be inconsistencies with duplicates) or build an index vector and to the deletion all at once. The deletion all at once is the best method since you won't be resizing the array on each iteration.
Good point, thank you! I missed it.

Sign in to comment.

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!