Fix performance bug in regexp's citerdissect/creviterdissect.
authorTom Lane <tgl@sss.pgh.pa.us>
Fri, 20 Aug 2021 18:19:04 +0000 (14:19 -0400)
committerTom Lane <tgl@sss.pgh.pa.us>
Fri, 20 Aug 2021 18:19:04 +0000 (14:19 -0400)
commitb30f7f399ea87f0621dfe900547cda401f8da762
tree0672b9f12e842a096c725a82ff57346b5917e484
parent7fa367d96bb1ae25f422d6c2a78424f0f9227b5a
Fix performance bug in regexp's citerdissect/creviterdissect.

After detecting a sub-match "dissect" failure (i.e., a backref match
failure) in the i'th sub-match of an iteration node, we should proceed
by adjusting the attempted length of the i'th submatch.  As coded,
though, these functions changed the attempted length of the *last*
sub-match, and only after exhausting all possibilities for that would
they back up to adjust the next-to-last sub-match, and then the
second-from-last, etc; all of which is wasted effort, since only
changing the start or length of the i'th sub-match can possibly make
it succeed.  This oversight creates the possibility for exponentially
bad performance.  Fortunately the problem is masked in most cases by
optimizations or constraints applied elsewhere; which explains why
we'd not noticed it before.  But it is possible to reach the problem
with fairly simple, if contrived, regexps.

Oversight in my commit 173e29aa5.  That's pretty ancient now,
so back-patch to all supported branches.

Discussion: https://postgr.es/m/1808998.1629412269@sss.pgh.pa.us
src/backend/regex/regexec.c