How do I match nested parenthesis (brackets, or braces) with dynamic regular expressions?

50 views (last 30 days)
One can read all over the web how it is impossible to use regular expressions to match nexted parenthesis. However MATLAB has this cool feature called 'dynamic regular expressions' that allow one to insert some MATLAB code to do all kinds of special 'gymnastics'. Is there a way to use this feature to count instances of parenthesis and, in turn, find their matches? Consider the following string:
g = 'asdf (( dwer e: ( asdedsdskek))::)asd fg ( qwe 4 dfy5 57) q34 dqa5';
or
g = 'asdf (( (dwer ) e: ( asdedsdskek))::)asd fg ( qwe 4 dfy5 57) q34 dqa5';
or
g = 'asdf ( dwer e: )asd fg ( qwe 4 dfy5 57) q34 dqa5';
Specifically, my need is only to match the first left parenthesis with its partner but one would think the more general solution of matching all sets of parenthesis is feasible with dynamic regular expressions. If anyone can help with this, it would be much appreciated.

Accepted Answer

Stephen23
Stephen23 on 1 Apr 2020
Edited: Stephen23 on 1 Apr 2020
This matches the outer-most matched pair of parentheses:
>> str = 'asdf (( dwer e: ( asdedsdskek))::)asd fg ( qwe 4 dfy5 57) q34 dqa5';
>> fun = @(s)sprintf('.{%d}',find(cumsum((s==')' )-(s=='('))>0,1,'first'));
>> out = regexp(str,'\((??@fun($''))','match')
out =
'(( dwer e: ( asdedsdskek))::)' '( qwe 4 dfy5 57)'

More Answers (2)

Walter Roberson
Walter Roberson on 17 Mar 2014
It might be possible, but it will not be easy.
The regular expressions supported by MATLAB are very similar to the regular expressions supported by Perl.
Here is one way to use Perl just to count to see if parens are matched:
In pattern matching in Perl in which you are trying to balance pairs, see
and the (?PARNO) construct described at
The (?PARNO) and recurse constructs are not supported by MATLAB.
You just might be able to use the dynamic expressions to invoke a function that names itself inside of a dynamic expression, thus achieving recursion.
Warning: you will spend a lot of time getting it right. It would be much easier to write some code that did the analysis then to try to use regular expressions for it.
  1 Comment
Dan
Dan on 18 Mar 2014
Thanks, Walter ... I ended up pulling in the string and writing a little MATLAB subroutine to do the task so I'm not motivated to research your references. Hopefully someone else can use the information in the future. Dan

Sign in to comment.


Daniel Renjewski
Daniel Renjewski on 15 Mar 2023
Edited: Daniel Renjewski on 15 Mar 2023
I have got a similar problem as I wanted to identify fractions in the string of an equation to replace it with proper latex code. The following function gives you the position of all pairs of open and closing brackets with their respective position in the string, assuming there are indeed only pairs.
str = 'asdf (( dwer e: ( asdedsdskek))::)asd fg ( qwe 4 dfy5 57) q34 dqa5'
str = 'asdf (( dwer e: ( asdedsdskek))::)asd fg ( qwe 4 dfy5 57) q34 dqa5'
br = detect_brackets(str)
br = 4×2
17 30 7 31 6 34 42 57
for idx = 1:size(br,1)
display(str(br(idx,1):br(idx,2)))
end
( asdedsdskek) ( dwer e: ( asdedsdskek)) (( dwer e: ( asdedsdskek))::) ( qwe 4 dfy5 57)
function [oc] = detect_brackets(str)
oc = [];
% find all opening and closing brackets in the string
op=strfind(str,'(');
cl=strfind(str,')');
% search for pairs until all are identified
while ~isempty(op | cl)
% find opening bracket for first closing bracket
idx = find(op < cl(1),1,'last');
% append this pair to function output
oc = [oc;op(idx) cl(1)];
% remove found opening bracket from vector
op(idx) = [];
% remove found closing bracket from vector
cl(1) = [];
end
end

Products

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!