Fix creation of partition descriptor during concurrent detach+drop
authorAlvaro Herrera <alvherre@alvh.no-ip.org>
Mon, 12 Aug 2024 22:17:56 +0000 (18:17 -0400)
committerAlvaro Herrera <alvherre@alvh.no-ip.org>
Mon, 12 Aug 2024 22:17:56 +0000 (18:17 -0400)
commit0820f80622ed415a88d2c79b04d292ae79023f50
treee0f2913c303cd140519b2318a7efc7c6546f5e2a
parente57296ed4867b3a3734db9ca621223c30eebb90d
Fix creation of partition descriptor during concurrent detach+drop

If a partition undergoes DETACH CONCURRENTLY immediately followed by
DROP, this could cause a problem for a concurrent transaction
recomputing the partition descriptor when running a prepared statement,
because it tries to dereference a pointer to a tuple that's not found in
a catalog scan.

The existing retry logic added in commit dbca3469ebf8 is sufficient to
cope with the overall problem, provided we don't try to dereference a
non-existant heap tuple.

Arguably, the code in RelationBuildPartitionDesc() has been wrong all
along, since no check was added in commit 898e5e3290a7 against receiving
a NULL tuple from the catalog scan; that bug has only become
user-visible with DETACH CONCURRENTLY which was added in branch 14.
Therefore, even though there's no known mechanism to cause a crash
because of this, backpatch the addition of such a check to all supported
branches.  In branches prior to 14, this would cause the code to fail
with a "missing relpartbound for relation XYZ" error instead of
crashing; that's okay, because there are no reports of such behavior
anyway.

Author: Kuntal Ghosh <kuntalghosh.2007@gmail.com>
Reviewed-by: Junwang Zhao <zhjwpku@gmail.com>
Reviewed-by: Tender Wang <tndrwang@gmail.com>
Discussion: https://postgr.es/m/18559-b48286d2eacd9a4e@postgresql.org
src/backend/partitioning/partdesc.c