Remove accent from struct
5 views (last 30 days)
Show older comments
I have this info_circuitos.Name_circuito. I need to shearch in a name. but the struct have accents. a e i o u OR A E I O U. Could have accent in any of that letter.
I want to create a new info_circuitos.Name_circuito without accent Help pls.
Thanks
2 Comments
Accepted Answer
sixwwwwww
on 9 Dec 2013
Edited: sixwwwwww
on 9 Dec 2013
you can do it like this:
names = {files.name};
str = {'Á', 'É', 'Í', 'Ó', 'Ú', 'á', 'é', 'í', 'ó', 'ú'};
strreplace = {'A', 'E', 'I', 'O', 'U', 'a', 'e', 'i', 'o', 'u'};
for j = 1:length(names)
for i = 1:numel(str)
names{j} = regexprep(names{j}, str{i}, strreplace{i});
end
end
Good luck!
14 Comments
Walter Roberson
on 9 Dec 2013
This code does not change the structure, just finds the equivalent strings.
More Answers (1)
Walter Roberson
on 9 Dec 2013
Edited: Walter Roberson
on 9 Dec 2013
Assuming ISO-8859-1 rather than full Unicode:
This code is untested.
function new_structure = remove_accents_from_struct(old_structure)
% Note: this code is only designed for size(structure) = 1 x 1
% and does not descend recursively into substructures.
old_fields = fieldnames(old_structure);
new_fields = cellfun( @remove_accents_from_string, old_fields, 'Uniform', 0);
if isequal(old_fields, new_fields) %optimization when there are no accents
new_structure = old_structure;
return
end
num_fields = length(old_fields);
% create empty structure populated with the translated field names but not their contents
new_structure = cell2struct( cell( num_fields, 1 ), new_fields );
% now copy the field contents over
for K = 1 : num_fields
new_structure.(new_fields{K}) = new_structure.(old_fields{K});
end
end
function newS = remove_accents_from_string(S)
% table extracted from % http://core.trac.wordpress.org/browser/tags/3.7.1/src/wp-includes/formatting.php
accented_chars = [ ...
char(128), char(131), char(138), char(142), char(154), ...
char(158), char(159), char(162), char(165), char(181), ...
char(192), char(193), char(194), char(195), char(196), ...
char(197), char(199), char(200), char(201), char(202), ...
char(203), char(204), char(205), char(206), char(207), ...
char(209), char(210), char(211), char(212), char(213), ...
char(214), char(216), char(217), char(218), char(219), ...
char(220), char(221), char(224), char(225), char(226), ...
char(227), char(228), char(229), char(231), char(232), ...
char(233), char(234), char(235), char(236), char(237), ...
char(238), char(239), char(241), char(242), char(243), ...
char(244), char(245), char(246), char(248), char(249), ...
char(250), char(251), char(252), char(253), char(255) ];
unaccented_chars = 'EfSZszYcYuAAAAAACEEEEIIIINOOOOOOUUUUYaaaaaaceeeeiiiinoooooouuuuyy';
newS = S;
[tf, idx] = ismember(S, accented_chars);
newS(tf) = unaccented_chars(idx);
end
2 Comments
Walter Roberson
on 9 Dec 2013
You said you could have an accent on any letter, so I could not assume that you only want to deal with grave accents on English vowels.
Most of the rest of the length is comments or efficiency.
See Also
Categories
Find more on Characters and Strings in Help Center and File Exchange
Community Treasure Hunt
Find the treasures in MATLAB Central and discover how the community can help you!
Start Hunting!