removing parentheses around digits using regular expressions
Show older comments
Dear all, I am slowly making progress on my learning of regular expressions. At the moment, I am trying to solve the following problem: replace all occurrences of (n) with n, where n is a number, provided that no alphabetical letter occurs before the first parenthesis. As an example,
str='(2)+p_5*(3)-(0.3)'
would become
2+p_5*3-0.3
I wrote the following
regexprep(str,'(\W)(\()([.012345789]+)(\))','$1$3')
but it does not solves the problem if one of the expressions to change occurs at the beginning as in the example above. More concretely, the answer I get from running this is
(2)+p_5*3-0.3
which is not the expected result.
Thanks in advance for any help
Pat.
Accepted Answer
More Answers (2)
Walter Roberson
on 5 Oct 2012
0 votes
Consider using a "look-behind"
per isakson
on 5 Oct 2012
Edited: per isakson
on 5 Oct 2012
>> regexprep( str, '\(([\d.]+)\)', '$1' )
ans =
2+p_5*3-0.3
str =
(2)+p_5*(3)-(0.3)+exp(3)
>> regexprep( str, '\(([\d.]+)\)', '$1' )
ans =
2+p_5*3-0.3+exp3
- *\(* represents "("
- \) represents ")"
- [\d.]+ represents one or more digits and periods, e.g. "....0" and "2"
- (expr) "Group regular expressions and capture tokens." The token may be refered to in the replacement string by $1 - "1" because it is the first
Every substring in the string that matches this expression is replaced, i.e. a number enclosed by parentheses is replaced by the number.
.
--- in response to a comment ---
>> regexprep( str, '(?<!\w)\(([\d.]+)\)', '$1' )
ans =
2+p_5*3-0.3+exp(3)
better
>> regexprep( str, '(?<![a-zA-Z])\(([\d.]+)\)', '$1' )
because \w includes digits.
- (?<![a-zA-Z]) "Look behind from current position and test if expr is not found." Where expr evaluates to a letter. Thus, if preceded by a letter there is no match.
7 Comments
Patrick Mboma
on 5 Oct 2012
per isakson
on 5 Oct 2012
Edited: per isakson
on 5 Oct 2012
Yes, it does. What should it give? OK preceded by letter
per isakson
on 5 Oct 2012
What about atan2()?
Matt Fig
on 5 Oct 2012
The solution I posted in the comments to my answer handles atan2.
str='(2)+p_5*(3)-(0.3)+cos(5.7)+(3)^(2-3) + (.55) + atan2(9)';
regexprep(str,'(?<![\w])(\()([\d*\.]+)(\))','$2')
ans =
2+p_5*3-0.3+cos(5.7)+3^(2-3) + .55 + atan2(9)
per isakson
on 5 Oct 2012
Edited: per isakson
on 5 Oct 2012
Yes, and that is because "\w" stands for "[A-Za-z0-9]". However, it is difficult to know whether including "0-9" might have any unintended side effects.
I find it difficult to construct robust expressions. However, I have never tried to learn regular expressions in a systematic way.
Patrick Mboma
on 5 Oct 2012
per isakson
on 6 Oct 2012
Edited: per isakson
on 6 Oct 2012
Categories
Find more on Programmatic Model Editing in Help Center and File Exchange
Community Treasure Hunt
Find the treasures in MATLAB Central and discover how the community can help you!
Start Hunting!