Replace strings with a different string and save the data

Question

Avik Mahata on 9 Dec 2023

0
Link

Direct link to this question

https://ch.mathworks.com/matlabcentral/answers/2058309-replace-strings-with-a-different-string-and-save-the-data

Answered: Voss on 21 Dec 2023

graphene.txt

I have a file for atomic coordinates, where I wanted to replace the 'c' with a certain pertange of maerials 'Fe', 'Al', 'Au'. The column C needs to be replaced with 40% Fe, 30% Al and 30% Au. Nothing else changes. The rest of the files stays same. I also plan to extend this to more elements lets say, Fe, Al, Ni, Cu, Zr all 20% each. I have attached a sample file. Below is a exceprt from the file. I will really appreciate any suggestions.

168

generated by VMD

C 0.000000 0.000000 0.000000

C -0.866025 0.500000 0.000000

C -0.866025 1.500000 0.000000

C 0.000000 2.000000 0.000000

C 1.732051 0.000000 0.000000

C 0.866025 0.500000 0.000000

C 0.866025 1.500000 0.000000

C 1.732051 2.000000 0.000000

C 3.464102 0.000000 0.000000

C 2.598076 0.500000 0.000000

0 Comments
Show -2 older commentsHide -2 older comments

Sign in to comment.

Sign in to answer this question.

Answer 1

VBBV on 9 Dec 2023

0
Link

Direct link to this answer

https://ch.mathworks.com/matlabcentral/answers/2058309-replace-strings-with-a-different-string-and-save-the-data#answer_1368439

Edited: VBBV on 9 Dec 2023

Open in MATLAB Online

data.txt

D = readtable('data.txt');
C = char(['40% Fe';'30% Al']) % e.g. element data to be filled in 1st column
C = 2×6 char array
    '40% Fe'
    '30% Al'
D.(1) = repmat(C,168/2,1) % replace 1st column of the table with element data
D = 168×4 table
     Var1       Var2      Var3    Var4
    ______    ________    ____    ____

    40% Fe           0      0      0  
    30% Al    -0.86603    0.5      0  
    40% Fe    -0.86603    1.5      0  
    30% Al           0      2      0  
    40% Fe      1.7321      0      0  
    30% Al     0.86603    0.5      0  
    40% Fe     0.86603    1.5      0  
    30% Al      1.7321      2      0  
    40% Fe      3.4641      0      0  
    30% Al      2.5981    0.5      0  
    40% Fe      2.5981    1.5      0  
    30% Al      3.4641      2      0  
    40% Fe      5.1962      0      0  
    30% Al      4.3301    0.5      0  
    40% Fe      4.3301    1.5      0  
    30% Al      5.1962      2      0  

2 Comments
Show NoneHide None

VBBV on 9 Dec 2023

You can rpleace the 1st column of the datatable with percentage of elements you want as vector shown above

Avik Mahata on 9 Dec 2023

I really appreciate your comment. I didn't do a good job explaining the problem. What I meant to say is , that 'C' will be replace by 2/3/4 different elements (characters) of centain pertanage. That means if I have 100 'C' rows in the first column, it will be replace by 40 Fe, 30 Al and lets say 30 Ni. I should have a way to change the amount/percentage I want to change 'C' in the first column.

So basically what I have is below,

168

generated by VMD

C 0.000000 0.000000 0.000000

C -0.866025 0.500000 0.000000

C -0.866025 1.500000 0.000000

C 0.000000 2.000000 0.000000

C 1.732051 0.000000 0.000000

What I want is this,

168

generated by VMD

Fe 0 0 0

Al -0.866025000000000 0.500000000000000 0

Ni -0.866025000000000 1.50000000000000 0

Fe 0 2 0

Al 1.73205100000000 0 0

Ni 0.866025000000000 0.500000000000000 0

Sign in to comment.

Answer 2

Voss on 21 Dec 2023

0
Link

Direct link to this answer

https://ch.mathworks.com/matlabcentral/answers/2058309-replace-strings-with-a-different-string-and-save-the-data#answer_1375917

Open in MATLAB Online

graphene.txt

I assume that you want to replace the C's with the specified elements (e.g., Fe, Al, Ni) at random according to the specified proportions (e.g., 40, 30, 30).

% name of the original file:
filename = 'graphene.txt';
% name of the new file that will be created, with the new elements in the
% specified (approximate) proportions:
new_filename = 'graphene_modified.txt';
% elements to replace the "C"s with:
elements = ["Fe","Al","Ni"];
% (approximate) proportion of each element to use:
proportions = [40,30,30];
% show the original file's contents, for reference:
dbtype(filename)
   168
    generated by VMD
    C         0.000000        0.000000        0.000000
    C        -0.866025        0.500000        0.000000
    C        -0.866025        1.500000        0.000000
    C         0.000000        2.000000        0.000000
    C         1.732051        0.000000        0.000000
    C         0.866025        0.500000        0.000000
    C         0.866025        1.500000        0.000000
   C         1.732051        2.000000        0.000000
   C         3.464102        0.000000        0.000000
   C         2.598076        0.500000        0.000000
   C         2.598076        1.500000        0.000000
   C         3.464102        2.000000        0.000000
   C         5.196152        0.000000        0.000000
   C         4.330127        0.500000        0.000000
   C         4.330127        1.500000        0.000000
   C         5.196152        2.000000        0.000000
   C         6.928203        0.000000        0.000000
   C         6.062178        0.500000        0.000000
   C         6.062178        1.500000        0.000000
   C         6.928203        2.000000        0.000000
   C         8.660254        0.000000        0.000000
   C         7.794229        0.500000        0.000000
   C         7.794229        1.500000        0.000000
   C         8.660254        2.000000        0.000000
   C         0.000000        3.000000        0.000000
   C        -0.866025        3.500000        0.000000
   C        -0.866025        4.500000        0.000000
   C         0.000000        5.000000        0.000000
   C         1.732051        3.000000        0.000000
   C         0.866025        3.500000        0.000000
   C         0.866025        4.500000        0.000000
   C         1.732051        5.000000        0.000000
   C         3.464102        3.000000        0.000000
   C         2.598076        3.500000        0.000000
   C         2.598076        4.500000        0.000000
   C         3.464102        5.000000        0.000000
   C         5.196152        3.000000        0.000000
   C         4.330127        3.500000        0.000000
   C         4.330127        4.500000        0.000000
   C         5.196152        5.000000        0.000000
   C         6.928203        3.000000        0.000000
   C         6.062178        3.500000        0.000000
   C         6.062178        4.500000        0.000000
   C         6.928203        5.000000        0.000000
   C         8.660254        3.000000        0.000000
   C         7.794229        3.500000        0.000000
   C         7.794229        4.500000        0.000000
   C         8.660254        5.000000        0.000000
   C         0.000000        6.000000        0.000000
   C        -0.866025        6.500000        0.000000
   C        -0.866025        7.500000        0.000000
   C         0.000000        8.000000        0.000000
   C         1.732051        6.000000        0.000000
   C         0.866025        6.500000        0.000000
   C         0.866025        7.500000        0.000000
   C         1.732051        8.000000        0.000000
   C         3.464102        6.000000        0.000000
   C         2.598076        6.500000        0.000000
   C         2.598076        7.500000        0.000000
   C         3.464102        8.000000        0.000000
   C         5.196152        6.000000        0.000000
   C         4.330127        6.500000        0.000000
   C         4.330127        7.500000        0.000000
   C         5.196152        8.000000        0.000000
   C         6.928203        6.000000        0.000000
   C         6.062178        6.500000        0.000000
   C         6.062178        7.500000        0.000000
   C         6.928203        8.000000        0.000000
   C         8.660254        6.000000        0.000000
   C         7.794229        6.500000        0.000000
   C         7.794229        7.500000        0.000000
   C         8.660254        8.000000        0.000000
   C         0.000000        9.000000        0.000000
   C        -0.866025        9.500000        0.000000
   C        -0.866025       10.500000        0.000000
   C         0.000000       11.000000        0.000000
   C         1.732051        9.000000        0.000000
   C         0.866025        9.500000        0.000000
   C         0.866025       10.500000        0.000000
   C         1.732051       11.000000        0.000000
   C         3.464102        9.000000        0.000000
   C         2.598076        9.500000        0.000000
   C         2.598076       10.500000        0.000000
   C         3.464102       11.000000        0.000000
   C         5.196152        9.000000        0.000000
   C         4.330127        9.500000        0.000000
   C         4.330127       10.500000        0.000000
   C         5.196152       11.000000        0.000000
   C         6.928203        9.000000        0.000000
   C         6.062178        9.500000        0.000000
   C         6.062178       10.500000        0.000000
   C         6.928203       11.000000        0.000000
   C         8.660254        9.000000        0.000000
   C         7.794229        9.500000        0.000000
   C         7.794229       10.500000        0.000000
   C         8.660254       11.000000        0.000000
   C         0.000000       12.000000        0.000000
  C        -0.866025       12.500000        0.000000
  C        -0.866025       13.500000        0.000000
  C         0.000000       14.000000        0.000000
  C         1.732051       12.000000        0.000000
  C         0.866025       12.500000        0.000000
  C         0.866025       13.500000        0.000000
  C         1.732051       14.000000        0.000000
  C         3.464102       12.000000        0.000000
  C         2.598076       12.500000        0.000000
  C         2.598076       13.500000        0.000000
  C         3.464102       14.000000        0.000000
  C         5.196152       12.000000        0.000000
  C         4.330127       12.500000        0.000000
  C         4.330127       13.500000        0.000000
  C         5.196152       14.000000        0.000000
  C         6.928203       12.000000        0.000000
  C         6.062178       12.500000        0.000000
  C         6.062178       13.500000        0.000000
  C         6.928203       14.000000        0.000000
  C         8.660254       12.000000        0.000000
  C         7.794229       12.500000        0.000000
  C         7.794229       13.500000        0.000000
  C         8.660254       14.000000        0.000000
  C         0.000000       15.000000        0.000000
  C        -0.866025       15.500000        0.000000
  C        -0.866025       16.500000        0.000000
  C         0.000000       17.000000        0.000000
  C         1.732051       15.000000        0.000000
  C         0.866025       15.500000        0.000000
  C         0.866025       16.500000        0.000000
  C         1.732051       17.000000        0.000000
  C         3.464102       15.000000        0.000000
  C         2.598076       15.500000        0.000000
  C         2.598076       16.500000        0.000000
  C         3.464102       17.000000        0.000000
  C         5.196152       15.000000        0.000000
  C         4.330127       15.500000        0.000000
  C         4.330127       16.500000        0.000000
  C         5.196152       17.000000        0.000000
  C         6.928203       15.000000        0.000000
  C         6.062178       15.500000        0.000000
  C         6.062178       16.500000        0.000000
  C         6.928203       17.000000        0.000000
  C         8.660254       15.000000        0.000000
  C         7.794229       15.500000        0.000000
  C         7.794229       16.500000        0.000000
  C         8.660254       17.000000        0.000000
  C         0.000000       18.000000        0.000000
  C        -0.866025       18.500000        0.000000
  C        -0.866025       19.500000        0.000000
  C         0.000000       20.000000        0.000000
  C         1.732051       18.000000        0.000000
  C         0.866025       18.500000        0.000000
  C         0.866025       19.500000        0.000000
  C         1.732051       20.000000        0.000000
  C         3.464102       18.000000        0.000000
  C         2.598076       18.500000        0.000000
  C         2.598076       19.500000        0.000000
  C         3.464102       20.000000        0.000000
  C         5.196152       18.000000        0.000000
  C         4.330127       18.500000        0.000000
  C         4.330127       19.500000        0.000000
  C         5.196152       20.000000        0.000000
  C         6.928203       18.000000        0.000000
  C         6.062178       18.500000        0.000000
  C         6.062178       19.500000        0.000000
  C         6.928203       20.000000        0.000000
  C         8.660254       18.000000        0.000000
  C         7.794229       18.500000        0.000000
  C         7.794229       19.500000        0.000000
  C         8.660254       20.000000        0.000000
% read the file into string array L, each element of which is a line of
% text from the file:
L = readlines(filename);
% fidx: indices of the lines that start with " C":
fidx = find(startsWith(L," C"));
% total number of lines that will be altered:
N_lines = numel(fidx);
% number of lines that will be altered, for each element:
N = round(proportions/sum(proportions)*N_lines);
% enforce that sum(N) == N_lines:
N(end) = N_lines-sum(N(1:end-1));
% permute fidx to get a random replacement order:
fidx = fidx(randperm(N_lines));
% get the start and end index (in fidx) for each element:
temp = cumsum([0 N]);
s_idx = temp(1:end-1)+1;
e_idx = temp(2:end);
% loop over elements and replace the "C"s:
for ii = 1:numel(elements)
    
    % indices of the random lines of text that will be changed to
    % element(ii):
    idx = fidx(s_idx(ii):e_idx(ii));
    
    % replace "C" with element(ii) in those lines:
    L(idx) = replace(L(idx),"C",elements(ii));
    
end
% write the result to a new file:
writelines(L,new_filename)
% verification: check the new file's contents:
dbtype(new_filename)
   168
    generated by VMD
    Ni         0.000000        0.000000        0.000000
    Al        -0.866025        0.500000        0.000000
    Al        -0.866025        1.500000        0.000000
    Al         0.000000        2.000000        0.000000
    Ni         1.732051        0.000000        0.000000
    Ni         0.866025        0.500000        0.000000
    Al         0.866025        1.500000        0.000000
   Fe         1.732051        2.000000        0.000000
   Fe         3.464102        0.000000        0.000000
   Fe         2.598076        0.500000        0.000000
   Ni         2.598076        1.500000        0.000000
   Al         3.464102        2.000000        0.000000
   Ni         5.196152        0.000000        0.000000
   Fe         4.330127        0.500000        0.000000
   Fe         4.330127        1.500000        0.000000
   Fe         5.196152        2.000000        0.000000
   Fe         6.928203        0.000000        0.000000
   Al         6.062178        0.500000        0.000000
   Fe         6.062178        1.500000        0.000000
   Ni         6.928203        2.000000        0.000000
   Ni         8.660254        0.000000        0.000000
   Al         7.794229        0.500000        0.000000
   Ni         7.794229        1.500000        0.000000
   Fe         8.660254        2.000000        0.000000
   Al         0.000000        3.000000        0.000000
   Ni        -0.866025        3.500000        0.000000
   Fe        -0.866025        4.500000        0.000000
   Fe         0.000000        5.000000        0.000000
   Fe         1.732051        3.000000        0.000000
   Al         0.866025        3.500000        0.000000
   Fe         0.866025        4.500000        0.000000
   Fe         1.732051        5.000000        0.000000
   Fe         3.464102        3.000000        0.000000
   Fe         2.598076        3.500000        0.000000
   Ni         2.598076        4.500000        0.000000
   Al         3.464102        5.000000        0.000000
   Al         5.196152        3.000000        0.000000
   Fe         4.330127        3.500000        0.000000
   Al         4.330127        4.500000        0.000000
   Ni         5.196152        5.000000        0.000000
   Al         6.928203        3.000000        0.000000
   Al         6.062178        3.500000        0.000000
   Ni         6.062178        4.500000        0.000000
   Fe         6.928203        5.000000        0.000000
   Fe         8.660254        3.000000        0.000000
   Al         7.794229        3.500000        0.000000
   Fe         7.794229        4.500000        0.000000
   Ni         8.660254        5.000000        0.000000
   Ni         0.000000        6.000000        0.000000
   Fe        -0.866025        6.500000        0.000000
   Fe        -0.866025        7.500000        0.000000
   Ni         0.000000        8.000000        0.000000
   Ni         1.732051        6.000000        0.000000
   Fe         0.866025        6.500000        0.000000
   Ni         0.866025        7.500000        0.000000
   Fe         1.732051        8.000000        0.000000
   Fe         3.464102        6.000000        0.000000
   Fe         2.598076        6.500000        0.000000
   Fe         2.598076        7.500000        0.000000
   Al         3.464102        8.000000        0.000000
   Ni         5.196152        6.000000        0.000000
   Fe         4.330127        6.500000        0.000000
   Fe         4.330127        7.500000        0.000000
   Ni         5.196152        8.000000        0.000000
   Fe         6.928203        6.000000        0.000000
   Fe         6.062178        6.500000        0.000000
   Fe         6.062178        7.500000        0.000000
   Al         6.928203        8.000000        0.000000
   Ni         8.660254        6.000000        0.000000
   Fe         7.794229        6.500000        0.000000
   Al         7.794229        7.500000        0.000000
   Ni         8.660254        8.000000        0.000000
   Fe         0.000000        9.000000        0.000000
   Ni        -0.866025        9.500000        0.000000
   Al        -0.866025       10.500000        0.000000
   Al         0.000000       11.000000        0.000000
   Al         1.732051        9.000000        0.000000
   Al         0.866025        9.500000        0.000000
   Ni         0.866025       10.500000        0.000000
   Ni         1.732051       11.000000        0.000000
   Al         3.464102        9.000000        0.000000
   Ni         2.598076        9.500000        0.000000
   Fe         2.598076       10.500000        0.000000
   Ni         3.464102       11.000000        0.000000
   Al         5.196152        9.000000        0.000000
   Fe         4.330127        9.500000        0.000000
   Fe         4.330127       10.500000        0.000000
   Ni         5.196152       11.000000        0.000000
   Al         6.928203        9.000000        0.000000
   Fe         6.062178        9.500000        0.000000
   Fe         6.062178       10.500000        0.000000
   Ni         6.928203       11.000000        0.000000
   Fe         8.660254        9.000000        0.000000
   Fe         7.794229        9.500000        0.000000
   Fe         7.794229       10.500000        0.000000
   Al         8.660254       11.000000        0.000000
   Ni         0.000000       12.000000        0.000000
  Ni        -0.866025       12.500000        0.000000
  Al        -0.866025       13.500000        0.000000
  Fe         0.000000       14.000000        0.000000
  Fe         1.732051       12.000000        0.000000
  Al         0.866025       12.500000        0.000000
  Al         0.866025       13.500000        0.000000
  Al         1.732051       14.000000        0.000000
  Ni         3.464102       12.000000        0.000000
  Ni         2.598076       12.500000        0.000000
  Al         2.598076       13.500000        0.000000
  Fe         3.464102       14.000000        0.000000
  Fe         5.196152       12.000000        0.000000
  Al         4.330127       12.500000        0.000000
  Ni         4.330127       13.500000        0.000000
  Al         5.196152       14.000000        0.000000
  Ni         6.928203       12.000000        0.000000
  Al         6.062178       12.500000        0.000000
  Fe         6.062178       13.500000        0.000000
  Ni         6.928203       14.000000        0.000000
  Al         8.660254       12.000000        0.000000
  Fe         7.794229       12.500000        0.000000
  Ni         7.794229       13.500000        0.000000
  Fe         8.660254       14.000000        0.000000
  Fe         0.000000       15.000000        0.000000
  Al        -0.866025       15.500000        0.000000
  Al        -0.866025       16.500000        0.000000
  Al         0.000000       17.000000        0.000000
  Ni         1.732051       15.000000        0.000000
  Fe         0.866025       15.500000        0.000000
  Al         0.866025       16.500000        0.000000
  Al         1.732051       17.000000        0.000000
  Al         3.464102       15.000000        0.000000
  Fe         2.598076       15.500000        0.000000
  Al         2.598076       16.500000        0.000000
  Fe         3.464102       17.000000        0.000000
  Al         5.196152       15.000000        0.000000
  Fe         4.330127       15.500000        0.000000
  Al         4.330127       16.500000        0.000000
  Al         5.196152       17.000000        0.000000
  Ni         6.928203       15.000000        0.000000
  Al         6.062178       15.500000        0.000000
  Fe         6.062178       16.500000        0.000000
  Fe         6.928203       17.000000        0.000000
  Fe         8.660254       15.000000        0.000000
  Ni         7.794229       15.500000        0.000000
  Fe         7.794229       16.500000        0.000000
  Fe         8.660254       17.000000        0.000000
  Ni         0.000000       18.000000        0.000000
  Ni        -0.866025       18.500000        0.000000
  Ni        -0.866025       19.500000        0.000000
  Ni         0.000000       20.000000        0.000000
  Fe         1.732051       18.000000        0.000000
  Ni         0.866025       18.500000        0.000000
  Ni         0.866025       19.500000        0.000000
  Fe         1.732051       20.000000        0.000000
  Ni         3.464102       18.000000        0.000000
  Fe         2.598076       18.500000        0.000000
  Al         2.598076       19.500000        0.000000
  Fe         3.464102       20.000000        0.000000
  Al         5.196152       18.000000        0.000000
  Al         4.330127       18.500000        0.000000
  Fe         4.330127       19.500000        0.000000
  Ni         5.196152       20.000000        0.000000
  Ni         6.928203       18.000000        0.000000
  Fe         6.062178       18.500000        0.000000
  Al         6.062178       19.500000        0.000000
  Fe         6.928203       20.000000        0.000000
  Ni         8.660254       18.000000        0.000000
  Ni         7.794229       18.500000        0.000000
  Ni         7.794229       19.500000        0.000000
  Fe         8.660254       20.000000        0.000000
% verification: read the new file and verify that each element starts the
% expected number of lines:
new_L = readlines(new_filename);
for ii = 1:numel(elements)
    assert(nnz(startsWith(new_L," "+elements(ii))) == N(ii), ...
        sprintf('Unexpected number of "%s" lines',elements(ii)));
end

0 Comments
Show -2 older commentsHide -2 older comments

Sign in to comment.

Replace strings with a different string and save the data

0 Comments
Show -2 older commentsHide -2 older comments

Answers (2)

2 Comments
Show NoneHide None

0 Comments
Show -2 older commentsHide -2 older comments

See Also

Categories

Tags

Community Treasure Hunt

Replace strings with a different string and save the data

0 Comments Show -2 older commentsHide -2 older comments

Answers (2)

2 Comments Show NoneHide None

0 Comments Show -2 older commentsHide -2 older comments

See Also

Categories

Tags

Community Treasure Hunt

0 Comments
Show -2 older commentsHide -2 older comments

2 Comments
Show NoneHide None

0 Comments
Show -2 older commentsHide -2 older comments