提交 · 608195a3a3656145a7eec7a47d903bc684011d73 · Greenplum / Gpdb

03 12月, 2008 1 次提交

Introduce visibility map. The visibility map is a bitmap with one bit per · 608195a3

由 Heikki Linnakangas 提交于 12月 03, 2008

heap page, where a set bit indicates that all tuples on the page are
visible to all transactions, and the page therefore doesn't need
vacuuming. It is stored in a new relation fork.

Lazy vacuum uses the visibility map to skip pages that don't need
vacuuming. Vacuum is also responsible for setting the bits in the map.
In the future, this can hopefully be used to implement index-only-scans,
but we can't currently guarantee that the visibility map is always 100%
up-to-date.

In addition to the visibility map, there's a new PD_ALL_VISIBLE flag on
each heap page, also indicating that all tuples on the page are visible to
all transactions. It's important that this flag is kept up-to-date. It
is also used to skip visibility tests in sequential scans, which gives a
small performance gain on seqscans.

608195a3

19 11月, 2008 1 次提交

Rethink the way FSM truncation works. Instead of WAL-logging FSM · 33960006

由 Heikki Linnakangas 提交于 11月 19, 2008

truncations in FSM code, call FreeSpaceMapTruncateRel from smgr_redo. To
make that cleaner from modularity point of view, move the WAL-logging one
level up to RelationTruncate, and move RelationTruncate and all the
related WAL-logging to new src/backend/catalog/storage.c file. Introduce
new RelationCreateStorage and RelationDropStorage functions that are used
instead of calling smgrcreate/smgrscheduleunlink directly. Move the
pending rel deletion stuff from smgrcreate/smgrscheduleunlink to the new
functions. This leaves smgr.c as a thin wrapper around md.c; all the
transactional stuff is now in storage.c.

This will make it easier to add new forks with similar truncation logic,
like the visibility map.

33960006

10 11月, 2008 1 次提交
- T
  Make relhasrules and relhastriggers work like relhasindex, namely we let · c5451c22
  由 Tom Lane 提交于 11月 10, 2008
```
VACUUM reset them to false rather than trying to clean 'em up during DROP.
```
  c5451c22
31 10月, 2008 1 次提交

Unite ReadBufferWithFork, ReadBufferWithStrategy, and ZeroOrReadBuffer · 19c8dc83

由 Heikki Linnakangas 提交于 10月 31, 2008

functions into one ReadBufferExtended function, that takes the strategy
and mode as argument. There's three modes, RBM_NORMAL which is the default
used by plain ReadBuffer(), RBM_ZERO, which replaces ZeroOrReadBuffer, and
a new mode RBM_ZERO_ON_ERROR, which allows callers to read corrupt pages
without throwing an error. The FSM needs the new mode to recover from
corrupt pages, which could happend if we crash after extending an FSM file,
and the new page is "torn".

Add fork number to some error messages in bufmgr.c, that still lacked it.

19c8dc83

30 9月, 2008 1 次提交

Rewrite the FSM. Instead of relying on a fixed-size shared memory segment, the · 15c121b3

由 Heikki Linnakangas 提交于 9月 30, 2008

free space information is stored in a dedicated FSM relation fork, with each
relation (except for hash indexes; they don't use FSM).

This eliminates the max_fsm_relations and max_fsm_pages GUC options; remove any
trace of them from the backend, initdb, and documentation.

Rewrite contrib/pg_freespacemap to match the new FSM implementation. Also
introduce a new variant of the get_raw_page(regclass, int4, int4) function in
contrib/pageinspect that let's you to return pages from any relation fork, and
a new fsm_page_contents() function to inspect the new FSM pages.

15c121b3

12 5月, 2008 1 次提交

Restructure some header files a bit, in particular heapam.h, by removing some · f8c4d7db

由 Alvaro Herrera 提交于 5月 12, 2008

unnecessary #include lines in it.  Also, move some tuple routine prototypes and
macros to htup.h, which allows removal of heapam.h inclusion from some .c
files.

For this to work, a new header file access/sysattr.h needed to be created,
initially containing attribute numbers of system columns, for pg_dump usage.

While at it, make contrib ltree, intarray and hstore header files more
consistent with our header style.

f8c4d7db

27 3月, 2008 1 次提交

Move the HTSU_Result enum definition into snapshot.h, to avoid including · 73b0300b

由 Alvaro Herrera 提交于 3月 26, 2008

tqual.h into heapam.h.  This makes all inclusion of tqual.h explicit.

I also sorted alphabetically the includes on some source files.

73b0300b

25 3月, 2008 1 次提交

Fix various infelicities that have snuck into usage of errdetail() and · 32b58d02

由 Tom Lane 提交于 3月 24, 2008

friends. Avoid double translation of some messages, ensure other messages
are exposed for translation (and make them follow the style guidelines),
avoid unsafe passing of an unpredictable message text as a format string.

32b58d02

10 3月, 2008 1 次提交

Reduce memory consumption during VACUUM of large relations, by using · 3fcc7e8e

由 Tom Lane 提交于 3月 10, 2008

FSMPageData (6 bytes) instead of PageFreeSpaceInfo (8 or 16 bytes)
for the temporary array of page-free-space information.

Itagaki Takahiro

3fcc7e8e

02 1月, 2008 1 次提交
- B
  
  Update copyrights in source tree to 2008. · 9098ab9e
  由 Bruce Momjian 提交于 1月 01, 2008
  
  9098ab9e
16 11月, 2007 1 次提交
- B
  
  pgindent run for 8.3. · fdf5a5ef
  由 Bruce Momjian 提交于 11月 15, 2007
  
  fdf5a5ef
27 9月, 2007 1 次提交
- A
  Adjust the new memory limit in the lazy vacuum code to use MaxHeapTuplesPerPage · b83e1163
  由 Alvaro Herrera 提交于 9月 26, 2007
```
tuples per page instead of fixed 200, to better cope with systems that use a
different block size.
```
  b83e1163
24 9月, 2007 2 次提交

Reduce the size of memory allocations by lazy vacuum when processing a small · 58536626

由 Alvaro Herrera 提交于 9月 24, 2007

table, by allocating just enough for a hardcoded number of dead tuples per
page.  The current estimate is 200 dead tuples per page.

Per reports from Jeff Amiel, Erik Jones and Marko Kreen, and subsequent
discussion.
CVS: ----------------------------------------------------------------------
CVS: Enter Log.  Lines beginning with `CVS:' are removed automatically
CVS:
CVS: Committing in .
CVS:
CVS: Modified Files:
CVS: 	commands/vacuumlazy.c
CVS: ----------------------------------------------------------------------

58536626

Simplify and rename some GUC variables, per various recent discussions: · 48f7e643

由 Tom Lane 提交于 9月 24, 2007

* stats_start_collector goes away; we always start the collector process,
unless prevented by a problem with setting up the stats UDP socket.

* stats_reset_on_server_start goes away; it seems useless in view of the
availability of pg_stat_reset().

* stats_block_level and stats_row_level are merged into a single variable
"track_counts", which controls all reports sent to the collector process.

* stats_command_string is renamed to track_activities.

* log_autovacuum is renamed to log_autovacuum_min_duration to better reflect
its meaning.

The log_autovacuum change is not a compatibility issue since it didn't exist
before 8.3 anyway.  The other changes need to be release-noted.

48f7e643

21 9月, 2007 2 次提交

T
Revert ill-fated patch to release exclusive lock early after vacuum · eb5f4d6c
由 Tom Lane 提交于 9月 20, 2007
```
truncates a table.  Introduces race condition, as shown by buildfarm
failures.
```
eb5f4d6c

HOT updates. When we update a tuple without changing any of its indexed · 282d2a03

由 Tom Lane 提交于 9月 20, 2007

columns, and the new version can be stored on the same heap page, we no longer
generate extra index entries for the new version. Instead, index searches
follow the HOT-chain links to ensure they find the correct tuple version.

In addition, this patch introduces the ability to "prune" dead tuples on a
per-page basis, without having to do a complete VACUUM pass to recover space.
VACUUM is still needed to clean up dead index entries, however.

Pavan Deolasee, with help from a bunch of other people.

282d2a03

16 9月, 2007 1 次提交

Fix aboriginal mistake in lazy VACUUM's code for truncating away · 43b0c918

由 Tom Lane 提交于 9月 16, 2007

no-longer-needed pages at the end of a table. We thought we could throw away
pages containing HEAPTUPLE_DEAD tuples; but this is not so, because such
tuples very likely have index entries pointing at them, and we wouldn't have
removed the index entries. The problem only emerges in a somewhat unlikely
race condition: the dead tuples have to have been inserted by a transaction
that later aborted, and this has to have happened between VACUUM's initial
scan of the page and then rechecking it for empty in count_nondeletable_pages.
But that timespan will include an index-cleaning pass, so it's not all that
hard to hit. This seems to explain a couple of previously unsolved bug
reports.

43b0c918

13 9月, 2007 1 次提交

Redefine the lp_flags field of item pointers as having four states, rather · 68893035

由 Tom Lane 提交于 9月 12, 2007

than two independent bits (one of which was never used in heap pages anyway,
or at least hadn't been in a very long time). This gives us flexibility to
add the HOT notions of redirected and dead item pointers without requiring
anything so klugy as magic values of lp_off and lp_len. The state values
are chosen so that for the states currently in use (pre-HOT) there is no
change in the physical representation.

68893035

12 9月, 2007 1 次提交
- A
  Add a CHECK_FOR_INTERRUPTS call in the site where the vacuum delay point · 9588e1bd
  由 Alvaro Herrera 提交于 9月 12, 2007
```
was removed.
```
  9588e1bd
11 9月, 2007 2 次提交

A
Release the exclusive lock on the table early after truncating it in lazy · 6a10f0f7
由 Alvaro Herrera 提交于 9月 10, 2007
```
vacuum, instead of waiting till commit.
```
6a10f0f7

Remove the vacuum_delay_point call in count_nondeletable_pages, because we hold · 21c27af6

由 Alvaro Herrera 提交于 9月 10, 2007

an exclusive lock on the table at this point, which we want to release as soon
as possible. This is called in the phase of lazy vacuum where we truncate the
empty pages at the end of the table.

An alternative solution would be to lower the vacuum delay settings before
starting the truncating phase, but this doesn't work very well in autovacuum
due to the autobalancing code (which can cause other processes to change our
cost delay settings). This case could be considered in the balancing code, but
it is simpler this way.

21c27af6

06 9月, 2007 1 次提交

Implement lazy XID allocation: transactions that do not modify any database · 295e6398

由 Tom Lane 提交于 9月 05, 2007

rows will normally never obtain an XID at all. We already did things this way
for subtransactions, but this patch extends the concept to top-level
transactions. In applications where there are lots of short read-only
transactions, this should improve performance noticeably; not so much from
removal of the actual XID-assignments, as from reduction of overhead that's
driven by the rate of XID consumption. We add a concept of a "virtual
transaction ID" so that active transactions can be uniquely identified even
if they don't have a regular XID. This is a much lighter-weight concept:
uniqueness of VXIDs is only guaranteed over the short term, and no on-disk
record is made about them.

Florian Pflug, with some editorialization by Tom.

295e6398

31 5月, 2007 1 次提交

Make large sequential scans and VACUUMs work in a limited-size "ring" of · d526575f

由 Tom Lane 提交于 5月 30, 2007

buffers, rather than blowing out the whole shared-buffer arena. Aside from
avoiding cache spoliation, this fixes the problem that VACUUM formerly tended
to cause a WAL flush for every page it modified, because we had it hacked to
use only a single buffer. Those flushes will now occur only once per
ring-ful. The exact ring size, and the threshold for seqscans to switch into
the ring usage pattern, remain under debate; but the infrastructure seems
done. The key bit of infrastructure is a new optional BufferAccessStrategy
object that can be passed to ReadBuffer operations; this replaces the former
StrategyHintVacuum API.

This patch also changes the buffer usage-count methodology a bit: we now
advance usage_count when first pinning a buffer, rather than when last
unpinning it. To preserve the behavior that a buffer's lifetime starts to
decrease when it's released, the clock sweep code is modified to not decrement
usage_count of pinned buffers.

Work not done in this commit: teach GiST and GIN indexes to use the vacuum
BufferAccessStrategy for vacuum-driven fetches.

Original patch by Simon, reworked by Heikki and again by Tom.

d526575f

17 5月, 2007 1 次提交

Move the tuple freezing point in CLUSTER to a point further back in the past, · 3b0347b3

由 Alvaro Herrera 提交于 5月 17, 2007

to avoid losing useful Xid information in not-so-old tuples.  This makes
CLUSTER behave the same as VACUUM as far a tuple-freezing behavior goes
(though CLUSTER does not yet advance the table's relfrozenxid).

While at it, move the actual freezing operation in rewriteheap.c to a more
appropriate place, and document it thoroughly.  This part of the patch from
Tom Lane.

3b0347b3

30 4月, 2007 1 次提交

Implement rate-limiting logic on how often backends will attempt to send · 957d08c8

由 Tom Lane 提交于 4月 30, 2007

messages to the stats collector. This avoids the problem that enabling
stats_row_level for autovacuum has a significant overhead for short
read-only transactions, as noted by Arjen van der Meijden. We can avoid
an extra gettimeofday call by piggybacking on the one done for WAL-logging
xact commit or abort (although that doesn't help read-only transactions,
since they don't WAL-log anything).

In my proposal for this, I noted that we could change the WAL log entries
for commit/abort to record full TimestampTz precision, instead of only
time_t as at present. That's not done in this patch, but will be committed
separately.

957d08c8

20 4月, 2007 1 次提交
- A
  
  Silence compiler warnings, per Bruce. · dfa58878
  由 Alvaro Herrera 提交于 4月 19, 2007
  
  dfa58878
19 4月, 2007 1 次提交
- A
  Enable configurable log of autovacuum actions. Initial patch from Simon · ef23a774
  由 Alvaro Herrera 提交于 4月 18, 2007
```
Riggs, additional code and docs by me.  Per discussion.
```
  ef23a774
22 2月, 2007 2 次提交
- B
  
  Update new optional VACUUM FULL hint for translations, per Alvaro. · 50c7e83c
  由 Bruce Momjian 提交于 2月 21, 2007
  
  50c7e83c
- B
  Move increase FSM warning to after lazy_truncate_heap() because the · 3aa37600
  由 Bruce Momjian 提交于 2月 21, 2007
```
function might reduce the number of free pages in the table.  Recommend
VACUUM FULL only if 20% free.

Simon Riggs.
```
  3aa37600
04 2月, 2007 1 次提交

Change vacuum lazy "compacting" warning message to: · c29a0bd5

由 Bruce Momjian 提交于 2月 04, 2007

  errhint("Consider using VACUUM FULL on this relation or increasing the configuration parameter \"max_fsm_pages\".")));

c29a0bd5

06 1月, 2007 1 次提交
- B
  Update CVS HEAD for 2007 copyright. Back branches are typically not · 29dccf5f
  由 Bruce Momjian 提交于 1月 05, 2007
```
back-stamped for this.
```
  29dccf5f
06 11月, 2006 1 次提交

Fix recently-understood problems with handling of XID freezing, particularly · 48188e16

由 Tom Lane 提交于 11月 05, 2006

in PITR scenarios. We now WAL-log the replacement of old XIDs with
FrozenTransactionId, so that such replacement is guaranteed to propagate to
PITR slave databases. Also, rather than relying on hint-bit updates to be
preserved, pg_clog is not truncated until all instances of an XID are known to
have been replaced by FrozenTransactionId. Add new GUC variables and
pg_autovacuum columns to allow management of the freezing policy, so that
users can trade off the size of pg_clog against the amount of freezing work
done. Revise the already-existing code that forces autovacuum of tables
approaching the wraparound point to make it more bulletproof; also, revise the
autovacuum logic so that anti-wraparound vacuuming is done per-table rather
than per-database. initdb forced because of changes in pg_class, pg_database,
and pg_autovacuum catalogs. Heikki Linnakangas, Simon Riggs, and Tom Lane.

48188e16

04 10月, 2006 1 次提交
- B
  
  pgindent run for 8.2. · f99a569a
  由 Bruce Momjian 提交于 10月 04, 2006
  
  f99a569a
22 9月, 2006 1 次提交

Fix free space map to correctly track the total amount of FSM space needed · 9e936693

由 Tom Lane 提交于 9月 21, 2006

even when a single relation requires more than max_fsm_pages pages. Also,
make VACUUM emit a warning in this case, since it likely means that VACUUM
FULL or other drastic corrective measure is needed. Per reports from Jeff
Frost and others of unexpected changes in the claimed max_fsm_pages need.

9e936693

14 9月, 2006 1 次提交

Code review for patch to avoid second scan when vacuuming index-less · 33d3ad46

由 Tom Lane 提交于 9月 13, 2006

table: avoid invoking LockBufferForCleanup without need, put out the
same log message we would have before, minor code beautification.

33d3ad46

05 9月, 2006 1 次提交
- B
  Trivial patch to double vacuum speed on tables with no indexes (prevent · ed8969b1
  由 Bruce Momjian 提交于 9月 04, 2006
```
second scan of table).

Gregory Stark
```
  ed8969b1
01 8月, 2006 1 次提交

Change the relation_open protocol so that we obtain lock on a relation · 09d3670d

由 Tom Lane 提交于 7月 31, 2006

(table or index) before trying to open its relcache entry.  This fixes
race conditions in which someone else commits a change to the relation's
catalog entries while we are in process of doing relcache load.  Problems
of that ilk have been reported sporadically for years, but it was not
really practical to fix until recently --- for instance, the recent
addition of WAL-log support for in-place updates helped.

Along the way, remove pg_am.amconcurrent: all AMs are now expected to support
concurrent update.

09d3670d

14 7月, 2006 2 次提交
- B
  
  Remove 576 references of include files that were not needed. · e0522505
  由 Bruce Momjian 提交于 7月 14, 2006
  
  e0522505
- B
  Allow include files to compile own their own. · a22d76d9
  由 Bruce Momjian 提交于 7月 13, 2006
```
Strip unused include files out unused include files, and add needed
includes to C files.

The next step is to remove unused include files in C files.
```
  a22d76d9
11 7月, 2006 1 次提交

Improve vacuum code to track minimum Xids per table instead of per database. · d4cef0aa

由 Alvaro Herrera 提交于 7月 10, 2006

To this end, add a couple of columns to pg_class, relminxid and relvacuumxid,
based on which we calculate the pg_database columns after each vacuum.

We now force all databases to be vacuumed, even template ones. A backend
noticing too old a database (meaning pg_database.datminxid is in danger of
falling behind Xid wraparound) will signal the postmaster, which in turn will
start an autovacuum iteration to process the offending database. In principle
this is only there to cope with frozen (non-connectable) databases without
forcing users to set them to connectable, but it could force regular user
database to go through a database-wide vacuum at any time. Maybe we should
warn users about this somehow. Of course the real solution will be to use
autovacuum all the time ;-)

There are some additional improvements we could have in this area: for example
the vacuum code could be smarter about not updating pg_database for each table
when called by autovacuum, and do it only once the whole autovacuum iteration
is done.

I updated the system catalogs documentation, but I didn't modify the
maintenance section. Also having some regression tests for this would be nice
but it's not really a very straightforward thing to do.

Catalog version bumped due to system catalog changes.

d4cef0aa