the number of occurences of each character of one string,in another

Question

hiva on 28 Dec 2014

1
Link

Direct link to this question

https://ch.mathworks.com/matlabcentral/answers/168299-the-number-of-occurences-of-each-character-of-one-string-in-another

Edited: Luuk van Oosten on 24 Jan 2015

i have a string of more than 100 characters (fasta format of a protein sequence. like

'MEQNGLDHDSRSSIDTTINDTQKTFLEFRSYTQLSEKLASSSSYTAPPLNEDGPKGVASAVSQGSESVVSWTTLTHVYSILGAYGGPTCLYPTATYFLMGTSKGCVLIFNYNEHLQTILVPTLSEDPSIH'

which is being shortened here for simplicity) and i want to find out whether or not it is hydrophobic. so i have to check the number of occurrences of each of the characters in the set 'A C F I L M P V W Y'(hydrophob amino acids) in my fasta string. considering the very long length of fasta strings, is there any easy way to do that by matlab string functions?

0 Comments
Show -2 older commentsHide -2 older comments

Sign in to comment.

Sign in to answer this question.

Answer 1

Azzi Abdelmalek on 28 Dec 2014

1
Link

Direct link to this answer

https://ch.mathworks.com/matlabcentral/answers/168299-the-number-of-occurences-of-each-character-of-one-string-in-another#answer_163456

Edited: Azzi Abdelmalek on 28 Dec 2014

Open in MATLAB Online

str='MEQNGLDHDSRSSIDTTINDTQKTFLEFRSYTQLSEKLASSSSYTAPPLNEDGPKGVASAVSQGSESVVSWTTLTHVYSILGAYGGPTCLYPTATYFLMGTSKGCVLIFNYNEHLQTILVPTLSEDPSIH'
p={'A' 'C' 'F' 'I' 'L' 'M' 'P' 'V' 'W' 'Y'}'
out=[p cellfun(@(x) nnz(ismember(str,x)),p,'un',0)]

2 Comments
Show NoneHide None

hiva on 29 Dec 2014

thanks a lot.i guess this works well for a lot of similar cases that are supposed to work the same way in my code(since it is feature extraction and there are lots of features). also tells me how much i don't know from matlab.thanks.

Stephen23 on 30 Dec 2014

Edited: Stephen23 on 30 Dec 2014

Open in MATLAB Online

This could be simplified and speeded-up by using arrayfun instead of cellfun, and removing the ismember:

>> t = 'ACFILMPVWY';
>> arrayfun(@(x)sum(str==x), t)
ans =
     6     2     4     6    13     2     7     7     1     7

Sign in to comment.

Answer 2

Peter Perkins on 29 Dec 2014

2
Link

Direct link to this answer

https://ch.mathworks.com/matlabcentral/answers/168299-the-number-of-occurences-of-each-character-of-one-string-in-another#answer_163537

Open in MATLAB Online

Another possibility:

>> s = 'MEQNGLDHDSRSSIDTTINDTQKTFLEFRSYTQLSEKLASSSSYTAPPLNEDGPKGVASAVSQGSESVVSWTTLTHVYSILGAYGGPTCLYPTATYFLMGTSKGCVLIFNYNEHLQTILVPTLSEDPSIH';
>> t = 'ACFILMPVWY';
>> n = hist(double(s),1:90);
>> n(t)
ans =
     6     2     4     6    13     2     7     7     1     7

1 Comment
Show -1 older commentsHide -1 older comments

Jan on 30 Dec 2014

This is a histogram problem, so histc is an efficient and direct solution.

Sign in to comment.

Answer 3

Luuk van Oosten on 24 Jan 2015

2
Link

Direct link to this answer

https://ch.mathworks.com/matlabcentral/answers/168299-the-number-of-occurences-of-each-character-of-one-string-in-another#answer_165835

Edited: Luuk van Oosten on 24 Jan 2015

Open in MATLAB Online

I reckon you are using the BioInformatics Toolbox. In that case you can probably use:

aacount('SEQ')

Where SEQ is of course your sequence of interest: MEQNGLDHDSRSSIDTTINDTQKTFLEF....

and using

nr_A = All.A
nr_C = All.C
nr_F = All.F

etc. (you get the idea)

you get the numbers of your hydrophobic residues. Sum these and you have your hydrophobic score. You might want to 'normalize' this number by dividing this number by the total amount of amino acids in the sequence.

Of course you can write a loop for this and calculate the hydrophobic score for all your sequences in your FASTA file.

0 Comments
Show -2 older commentsHide -2 older comments

Sign in to comment.

Answer 4

Shoaibur Rahman on 28 Dec 2014

1
Link

Direct link to this answer

https://ch.mathworks.com/matlabcentral/answers/168299-the-number-of-occurences-of-each-character-of-one-string-in-another#answer_163455

Open in MATLAB Online

s = 'MEQNGLDHDSRSSIDTTINDTQKTFLEFRSYTQLSEKLASSSSYTAPPLNEDGPKGVASAVSQGSESVVSWTTLTHVYSILGAYGGPTCLYPTATYFLMGTSKGCVLIFNYNEHLQTILVPTLSEDPSIH';
numA = sum(s=='A')
numC = sum(s=='C')
numF = sum(s=='F')
numI = sum(s=='I')
numL = sum(s=='L')
numM = sum(s=='M')
numP = sum(s=='P')
numV = sum(s=='V')
numW = sum(s=='W')
numY = sum(s=='Y')

1 Comment
Show -1 older commentsHide -1 older comments

hiva on 29 Dec 2014

very simple and delicate. really thanks

Sign in to comment.

Answer 5

Stephen23 on 30 Dec 2014

1
Link

Direct link to this answer

https://ch.mathworks.com/matlabcentral/answers/168299-the-number-of-occurences-of-each-character-of-one-string-in-another#answer_163616

Edited: Stephen23 on 30 Dec 2014

Open in MATLAB Online

A neat solution using bsxfun :

>> s = 'MEQNGLDHDSRSSIDTTINDTQKTFLEFRSYTQLSEKLASSSSYTAPPLNEDGPKGVASAVSQGSESVVSWTTLTHVYSILGAYGGPTCLYPTATYFLMGTSKGCVLIFNYNEHLQTILVPTLSEDPSIH';
>> t = 'ACFILMPVWY';
>> sum(bsxfun(@eq,s.',t))
ans =
     6     2     4     6    13     2     7     7     1     7

1 Comment
Show -1 older commentsHide -1 older comments

hiva on 30 Dec 2014

Edited: hiva on 30 Dec 2014

wow!!! just wonderful. it works pretty well.thanks a lot.

Sign in to comment.

the number of occurences of each character of one string,in another

0 Comments
Show -2 older commentsHide -2 older comments

Accepted Answer

2 Comments
Show NoneHide None

More Answers (4)

1 Comment
Show -1 older commentsHide -1 older comments

0 Comments
Show -2 older commentsHide -2 older comments

1 Comment
Show -1 older commentsHide -1 older comments

1 Comment
Show -1 older commentsHide -1 older comments

See Also

Categories

Tags

Community Treasure Hunt

the number of occurences of each character of one string,in another

0 Comments Show -2 older commentsHide -2 older comments

Accepted Answer

2 Comments Show NoneHide None

More Answers (4)

1 Comment Show -1 older commentsHide -1 older comments

0 Comments Show -2 older commentsHide -2 older comments

1 Comment Show -1 older commentsHide -1 older comments

1 Comment Show -1 older commentsHide -1 older comments

See Also

Categories

Tags

Community Treasure Hunt

0 Comments
Show -2 older commentsHide -2 older comments

2 Comments
Show NoneHide None

1 Comment
Show -1 older commentsHide -1 older comments

0 Comments
Show -2 older commentsHide -2 older comments

1 Comment
Show -1 older commentsHide -1 older comments

1 Comment
Show -1 older commentsHide -1 older comments