git.postgresql.org Git - users/simon/postgres.git/commit

projects / users / simon / postgres.git / commit

summary | shortlog | log | commit | commitdiff | tree
(parent: 77ad0ba) | patch

author	Tom Lane <tgl@sss.pgh.pa.us>
	Sun, 16 Mar 2008 23:15:08 +0000 (23:15 +0000)
committer	Tom Lane <tgl@sss.pgh.pa.us>
	Sun, 16 Mar 2008 23:15:08 +0000 (23:15 +0000)
commit	907f1c9888654666bd1488b74049854510a59430
tree	83f0064c7ad0ee8c837b39851a16d60cbbe1329e	tree
parent	77ad0ba6ad782c10e8010395711b8fcd3f7f9a61	commit \| diff

When creating a large hash index, pre-sort the index entries by estimated
bucket number, so as to ensure locality of access to the index during the
insertion step. Without this, building an index significantly larger than
available RAM takes a very long time because of thrashing. On the other
hand, sorting is just useless overhead when the index does fit in RAM.
We choose to sort when the initial index size exceeds effective_cache_size.

This is a revised version of work by Tom Raney and Shreya Bhargava.

src/backend/access/hash/Makefile		diff \| blob \| blame \| history
src/backend/access/hash/hash.c		diff \| blob \| blame \| history
src/backend/access/hash/hashpage.c		diff \| blob \| blame \| history
src/backend/access/hash/hashsort.c	[new file with mode: 0644]	blob
src/backend/access/nbtree/nbtsort.c		diff \| blob \| blame \| history
src/backend/utils/sort/tuplesort.c		diff \| blob \| blame \| history
src/include/access/hash.h		diff \| blob \| blame \| history
src/include/utils/tuplesort.h		diff \| blob \| blame \| history

Simon's dev repository