bug/231/fix regexp restore paren handling #244

mtlewis · 2016-10-20T23:02:42Z

Fixes #231.

Comments in line with the code explain the related issues. The test that I added is failing in master, passing with the changes in this branch (though I haven't included the updated build in the PR).

mtlewis · 2016-10-20T23:04:25Z

src/util.js

-            lm = regExpCache.lastMatch.replace(esc, '\\$&'),
-            reg = new List();
+            lastMatch = regExpCache.lastMatch.replace(esc, '\\$&'),
+            exprStr = '';


Not strictly necessary, but I made a couple of changes for readability. One is removing the reg variable and building up exprStr from the start of the function. I think this makes it a little clearer that we're processing lastMatch and adding the processed parts to exprStr bit by bit.

mtlewis · 2016-10-20T23:07:03Z

src/util.js

                }
-
-                // Push it to the reg and chop lm to make sure further groups come after
-                arrPush.call(reg, lm.slice(0, lm.indexOf('(') + 1));


First significant problem with the previous implementation. lm.indexOf('(') is looking for the paren we added on line 170, but it could also potentially catch escaped parens that occur earlier in the string.

mtlewis · 2016-10-20T23:15:09Z

src/util.js


        // Shorten the regex by replacing each part of the expression with a match
        // for a string of that exact length.  This is safe for the type of
        // expressions generated above, because the expression matches the whole
        // match string, so we know each group and each segment between capturing
        // groups can be matched by its length alone.
-        exprStr = exprStr.replace(/(\\\(|\\\)|[^()])+/g, (match) => {


Second significant problem with the code. This second operation on exprStr takes a series of characters other than unescaped parentheses and replaces it with [\s\S]{length}. This allows our RegEx to work with much longer input strings. However, the prior version of this regex does not handle _un_escaped parentheses that come immediately after escaped backslashes. The new version is much more complicated, but accounts for sequences of escaped and unescaped backslashes and parentheses.

Looking at it now, I think the whole thing will be much more maintainable if we stop using a RegEx here and instead iterate over the characters in exprStr and check for escaped characters and match groups character-by-character. Happy to work on that tomorrow under this PR or a separate one post-merge.

@caridy can I suggest we merge this as-is and circle back on the removal of this regex?

@caridy: Just had a play around with implementing this bit of functionality via iteration rather than the existing regex-based approach. You can see the changes here. The iteration-based approach didn't turn out to be as clean as I was hoping it would be, so I'd propose we leave the regex-based approach in place.

…storation issues

…xpRestore

…xRestore Previously this expression contains a fragment similar to /aa*/ - have simplified this to /a+/.

caridy · 2016-11-11T19:31:40Z

@mtlewis let me know when this is ready for review.

mtlewis · 2016-11-11T19:34:09Z

@caridy hey! Its ready from my perspective 👍

caridy · 2016-11-16T13:46:49Z

thanks @mtlewis, this is looking good!

mtlewis commented Oct 20, 2016

View reviewed changes

mtlewis mentioned this pull request Oct 20, 2016

createRegExpRestore explodes with an unmatched parenthesis #231

Closed

(bug andyearnshaw#231) Initial fix for escaped paren related regex re…

06012a9

…storation issues

mtlewis force-pushed the bug/231/fix-regexp-restore-paren-handling branch 2 times, most recently from 74edeca to d150c95 Compare October 21, 2016 09:44

(bug andyearnshaw#231) Simplifications and improvements to createRege…

f91377b

…xpRestore

mtlewis force-pushed the bug/231/fix-regexp-restore-paren-handling branch from d150c95 to f91377b Compare October 21, 2016 13:06

(andyearnshaw#231) simplify regex compacting expression in createRege…

c3ad4d4

…xRestore Previously this expression contains a fragment similar to /aa*/ - have simplified this to /a+/.

vicb mentioned this pull request Oct 29, 2016

Issue with Regexp on mobile safari 7 (DatePipe error on Browser Stack) angular/angular#12597

Closed

caridy merged commit c3ad4d4 into andyearnshaw:master Nov 16, 2016

caridy added bug enhancement labels Nov 16, 2016

This was referenced Nov 21, 2016

Invalid regular expression #257

Closed

Intl.js with polyfill not working on Safari #256

Open

ghostd mentioned this pull request Sep 14, 2018

Release a new version! #306

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

bug/231/fix regexp restore paren handling #244

bug/231/fix regexp restore paren handling #244

mtlewis commented Oct 20, 2016

mtlewis Oct 20, 2016

mtlewis Oct 20, 2016

mtlewis Oct 20, 2016

mtlewis Oct 26, 2016

mtlewis Oct 29, 2016

caridy commented Nov 11, 2016

mtlewis commented Nov 11, 2016

caridy commented Nov 16, 2016

bug/231/fix regexp restore paren handling #244

bug/231/fix regexp restore paren handling #244

Conversation

mtlewis commented Oct 20, 2016

mtlewis Oct 20, 2016

Choose a reason for hiding this comment

mtlewis Oct 20, 2016

Choose a reason for hiding this comment

mtlewis Oct 20, 2016

Choose a reason for hiding this comment

mtlewis Oct 26, 2016

Choose a reason for hiding this comment

mtlewis Oct 29, 2016

Choose a reason for hiding this comment

caridy commented Nov 11, 2016

mtlewis commented Nov 11, 2016

caridy commented Nov 16, 2016