提交 · 78a09145e0f8322e625bbc7d69fcb865ce4f3034 · Greenplum / Gpdb

30 7月, 2009 1 次提交

Support deferrable uniqueness constraints. · 25d9bf2e

由 Tom Lane 提交于 7月 29, 2009

The current implementation fires an AFTER ROW trigger for each tuple that
looks like it might be non-unique according to the index contents at the
time of insertion.  This works well as long as there aren't many conflicts,
but won't scale to massive unique-key reassignments.  Improving that case
is a TODO item.

Dean Rasheed

25d9bf2e

11 6月, 2009 1 次提交
- B
  8.4 pgindent run, with new combined Linux/FreeBSD/MinGW typedef list · d7471402
  由 Bruce Momjian 提交于 6月 11, 2009
```
provided by Andrew.
```
  d7471402
02 1月, 2009 1 次提交
- B
  
  Update copyright for 2009. · 511db38a
  由 Bruce Momjian 提交于 1月 01, 2009
  
  511db38a
31 12月, 2008 1 次提交
- H
  The flag to mark dead tuples is nowadays called LP_DEAD, not LP_DELETE. · 4942ea28
  由 Heikki Linnakangas 提交于 12月 30, 2008
```
Simon Riggs.
```
  4942ea28
14 7月, 2008 1 次提交

Clean up the use of some page-header-access macros: principally, use · 9d035f42

由 Tom Lane 提交于 7月 13, 2008

SizeOfPageHeaderData instead of sizeof(PageHeaderData) in places where that
makes the code clearer, and avoid casting between Page and PageHeader where
possible. Zdenek Kotala, with some additional cleanup by Heikki Linnakangas.

I did not apply the parts of the proposed patch that would have resulted in
slightly changing the on-disk format of hash indexes; it seems to me that's
not a win as long as there's any chance of having in-place upgrade for 8.4.

9d035f42

19 6月, 2008 1 次提交

Improve our #include situation by moving pointer types away from the · a3540b0f

由 Alvaro Herrera 提交于 6月 19, 2008

corresponding struct definitions. This allows other headers to avoid including
certain highly-loaded headers such as rel.h and relscan.h, instead using just
relcache.h, heapam.h or genam.h, which are more lightweight and thus cause less
unnecessary dependencies.

a3540b0f

07 6月, 2008 1 次提交
- A
  
  Change xlog.h to xlogdefs.h in bufpage.h, and fix fallout. · e4ca6cac
  由 Alvaro Herrera 提交于 6月 06, 2008
  
  e4ca6cac
17 4月, 2008 1 次提交

Repair two places where SIGTERM exit could leave shared memory state · d1cbd26d

由 Tom Lane 提交于 4月 16, 2008

corrupted. (Neither is very important if SIGTERM is used to shut down the
whole database cluster together, but there's a problem if someone tries to
SIGTERM individual backends.) To do this, introduce new infrastructure
macros PG_ENSURE_ERROR_CLEANUP/PG_END_ENSURE_ERROR_CLEANUP that take care
of transiently pushing an on_shmem_exit cleanup hook. Also use this method
for createdb cleanup --- that wasn't a shared-memory-corruption problem,
but SIGTERM abort of createdb could leave orphaned files lying around.

Backpatch as far as 8.2. The shmem corruption cases don't exist in 8.1,
and the createdb usage doesn't seem important enough to risk backpatching
further.

d1cbd26d

11 4月, 2008 1 次提交

Replace "amgetmulti" AM functions with "amgetbitmap", in which the whole · 4e82a954

由 Tom Lane 提交于 4月 10, 2008

indexscan always occurs in one call, and the results are returned in a
TIDBitmap instead of a limited-size array of TIDs.  This should improve
speed a little by reducing AM entry/exit overhead, and it is necessary
infrastructure if we are ever to support bitmap indexes.

In an only slightly related change, add support for TIDBitmaps to preserve
(somewhat lossily) the knowledge that particular TIDs reported by an index
need to have their quals rechecked when the heap is visited.  This facility
is not really used yet; we'll need to extend the forced-recheck feature to
plain indexscans before it's useful, and that hasn't been coded yet.
The intent is to use it to clean up 8.3's horrid @@@ kluge for text search
with weighted queries.  There might be other uses in future, but that one
alone is sufficient reason.

Heikki Linnakangas, with some adjustments by me.

4e82a954

02 1月, 2008 1 次提交
- B
  
  Update copyrights in source tree to 2008. · 9098ab9e
  由 Bruce Momjian 提交于 1月 01, 2008
  
  9098ab9e
17 11月, 2007 1 次提交

Repair still another bug in the btree page split WAL reduction patch: · 93190c30

由 Tom Lane 提交于 11月 16, 2007

it failed for splits of non-leaf pages because in such pages the first
data key on a page is suppressed, and so we can't just copy the first
key from the right page to reconstitute the left page's high key.
Problem found by Koichi Suzuki, patch by Heikki.

93190c30

16 11月, 2007 1 次提交
- B
  
  pgindent run for 8.3. · fdf5a5ef
  由 Bruce Momjian 提交于 11月 15, 2007
  
  fdf5a5ef
12 4月, 2007 1 次提交

Code review for btree page split WAL reduction patch. Make it actually work · 226a1005

由 Tom Lane 提交于 4月 11, 2007

(original code *always* created a full-page image for the left page, thus
leaving the intended savings unrealized), avoid risk of not having enough room
on the page during xlog restore, squeeze out another couple bytes in the xlog
record, clean up neglected comments.

226a1005

10 4月, 2007 1 次提交

Minor tweaking of index special-space definitions so that the various · 56218fbc

由 Tom Lane 提交于 4月 09, 2007

index types can be reliably distinguished by examining the special space
on an index page. Per my earlier proposal, plus the realization that
there's no need for btree's vacuum cycle ID to cycle through every possible
16-bit value. Restricting its range a little costs nearly nothing and
eliminates the possibility of collisions.
Memo to self: remember to make bitmap indexes play along with this scheme,
assuming that patch ever gets accepted.

56218fbc

08 2月, 2007 1 次提交

Reduce WAL activity for page splits: · b79575ce

由 Bruce Momjian 提交于 2月 08, 2007

> Currently, an index split writes all the data on the split page to
> WAL. That's a lot of WAL traffic. The tuples that are copied to the
> right page need to be WAL logged, but the tuples that stay on the
> original page don't.

Heikki Linnakangas

b79575ce

05 2月, 2007 1 次提交

Rename MaxTupleSize to MaxHeapTupleSize to clarify that it's not meant to · 23c4978e

由 Tom Lane 提交于 2月 05, 2007

describe the maximum size of index tuples (which is typically AM-dependent
anyway); and consequently remove the bogus deduction for "special space"
that was built into it.

Adjust TOAST_TUPLE_THRESHOLD and TOAST_MAX_CHUNK_SIZE to avoid wasting two
bytes per toast chunk, and to ensure that the calculation correctly tracks any
future changes in page header size. The computation had been inaccurate in a
way that didn't cause any harm except space wastage, but future changes could
have broken it more drastically.

Fix the calculation of BTMaxItemSize, which was formerly computed as 1 byte
more than it could safely be. This didn't cause any harm in practice because
it's only compared against maxalign'd lengths, but future changes in the size
of page headers or btree special space could have exposed the problem.

initdb forced because of change in TOAST_MAX_CHUNK_SIZE, which alters the
storage of toast tables.

23c4978e

21 1月, 2007 1 次提交

Refactor the index AM API slightly: move currentItemData and · 2b7334d4

由 Neil Conway 提交于 1月 20, 2007

currentMarkData from IndexScanDesc to the opaque structs for the
AMs that need this information (currently gist and hash).

Patch from Heikki Linnakangas, fixes by Neil Conway.

2b7334d4

09 1月, 2007 1 次提交

Support ORDER BY ... NULLS FIRST/LAST, and add ASC/DESC/NULLS FIRST/NULLS LAST · 44317582

由 Tom Lane 提交于 1月 09, 2007

per-column options for btree indexes. The planner's support for this is still
pretty rudimentary; it does not yet know how to plan mergejoins with
nondefault ordering options. The documentation is pretty rudimentary, too.
I'll work on improving that stuff later.

Note incompatible change from prior behavior: ORDER BY ... USING will now be
rejected if the operator is not a less-than or greater-than member of some
btree opclass. This prevents less-than-sane behavior if an operator that
doesn't actually define a proper sort ordering is selected.

44317582

06 1月, 2007 1 次提交
- B
  Update CVS HEAD for 2007 copyright. Back branches are typically not · 29dccf5f
  由 Bruce Momjian 提交于 1月 05, 2007
```
back-stamped for this.
```
  29dccf5f
02 11月, 2006 1 次提交

Fix "failed to re-find parent key" btree VACUUM failure by revising page · 70ce5c90

由 Tom Lane 提交于 11月 01, 2006

deletion code to avoid the case where an upper-level btree page remains "half
dead" for a significant period of time, and to block insertions into a key
range that is in process of being re-assigned to the right sibling of the
deleted page's parent. This prevents the scenario reported by Ed L. wherein
index keys could become out-of-order in the grandparent index level.

Since this is a moderately invasive fix, I'm applying it only to HEAD.
The bug exists back to 7.4, but the back branches will get a different patch.

70ce5c90

04 10月, 2006 1 次提交
- B
  
  pgindent run for 8.2. · f99a569a
  由 Bruce Momjian 提交于 10月 04, 2006
  
  f99a569a
24 8月, 2006 1 次提交

Optimize the case where a btree indexscan has current and mark positions · 08ae5edc

由 Tom Lane 提交于 8月 24, 2006

on the same index page; we can avoid data copying as well as buffer refcount
manipulations in this common case.  Makes for a small but noticeable
improvement in mergejoin speed.

Heikki Linnakangas

08ae5edc

08 8月, 2006 1 次提交

Make recovery from WAL be restartable, by executing a checkpoint-like · e0028369

由 Tom Lane 提交于 8月 07, 2006

operation every so often. This improves the usefulness of PITR log
shipping for hot standby: formerly, if the standby server crashed, it
was necessary to restart it from the last base backup and replay all
the WAL since then. Now it will only need to reread about the same
amount of WAL as the master server would. The behavior might also
come in handy during a long PITR replay sequence. Simon Riggs,
with some editorialization by Tom Lane.

e0028369

26 7月, 2006 1 次提交

Modify btree to delete known-dead index entries without an actual VACUUM. · e6284649

由 Tom Lane 提交于 7月 25, 2006

When we are about to split an index page to do an insertion, first look
to see if any entries marked LP_DELETE exist on the page, and if so remove
them to try to make enough space for the desired insert.  This should reduce
index bloat in heavily-updated tables, although of course you still need
VACUUM eventually to clean up the heap.

Junji Teramoto

e6284649

12 7月, 2006 1 次提交

Tweak fillfactor code as per my recent proposal. Fix nbtsort.c so that · d29b6688

由 Tom Lane 提交于 7月 11, 2006

it can handle small fillfactors for ordinary-sized index entries without
failing on large ones; fix nbtinsert.c to distinguish leaf and nonleaf
pages; change the minimum fillfactor to 10% for all index types.

d29b6688

04 7月, 2006 1 次提交

Code review for FILLFACTOR patch. Change WITH grammar as per earlier · b7b78d24

由 Tom Lane 提交于 7月 03, 2006

discussion (including making def_arg allow reserved words), add missed
opt_definition for UNIQUE case. Put the reloptions support code in a less
random place (I chose to make a new file access/common/reloptions.c).
Eliminate header inclusion creep. Make the index options functions safely
user-callable (seems like client apps might like to be able to test validity
of options before trying to make an index). Reduce overhead for normal case
with no options by allowing rd_options to be NULL. Fix some unmaintainably
klugy code, including getting rid of Natts_pg_class_fixed at long last.
Some stylistic cleanup too, and pay attention to keeping comments in sync
with code.

Documentation still needs work, though I did fix the omissions in
catalogs.sgml and indexam.sgml.

b7b78d24

02 7月, 2006 1 次提交
- B
  Add FILLFACTOR to CREATE INDEX. · 277807bd
  由 Bruce Momjian 提交于 7月 02, 2006
```
ITAGAKI Takahiro
```
  277807bd
08 5月, 2006 1 次提交

Rewrite btree vacuuming to fold the former bulkdelete and cleanup operations · 5749f6ef

由 Tom Lane 提交于 5月 08, 2006

into a single mostly-physical-order scan of the index. This requires some
ticklish interlocking considerations, but should create no material
performance impact on normal index operations (at least given the
already-committed changes to make scans work a page at a time). VACUUM
itself should get significantly faster in any index that's degenerated to a
very nonlinear page order. Also, we save one pass over the index entirely,
except in the case where there were no deletions to do and so only one pass
happened anyway.

Original patch by Heikki Linnakangas, rework by Tom Lane.

5749f6ef

07 5月, 2006 1 次提交

Rewrite btree index scans to work a page at a time in all cases (both · 09cb5c0e

由 Tom Lane 提交于 5月 07, 2006

btgettuple and btgetmulti).  This eliminates the problem of "re-finding" the
exact stopping point, since the stopping point is effectively always a page
boundary, and index items are never moved across pre-existing page boundaries.
A small penalty is that the keys_are_unique optimization is effectively
disabled (and, therefore, is removed in this patch), causing us to apply
_bt_checkkeys() to at least one more tuple than necessary when looking up a
unique key.  However, the advantages for non-unique cases seem great enough to
accept this tradeoff.  Aside from simplifying and (sometimes) speeding up the
indexscan code, this will allow us to reimplement btbulkdelete as a largely
sequential scan instead of index-order traversal, thereby significantly
reducing the cost of VACUUM.  Those changes will come in a separate patch.

Original patch by Heikki Linnakangas, rework by Tom Lane.

09cb5c0e

13 4月, 2006 1 次提交

Fix an ancient oversight in btree xlog replay. When trying to determine if an · 49a7610c

由 Tom Lane 提交于 4月 13, 2006

upper-level insertion completes a previously-seen split, we cannot simply grab
the downlink block number out of the buffer, because the buffer could contain
a later state of the page --- or perhaps the page doesn't even exist at all
any more, due to relation truncation. These possibilities have been masked up
to now because the use of full_page_writes effectively ensured that no xlog
replay routine ever actually saw a page state newer than its own change.
Since we're deprecating full_page_writes in 8.1.*, there's no need to fix this
in existing release branches, but we need a fix in HEAD if we want to have any
hope of re-allowing full_page_writes. Accordingly, adjust the contents of
btree WAL records so that we can always get the downlink block number from the
WAL record rather than having to depend on buffer contents. Per report from
Kevin Grittner and Peter Brant.

Improve a few comments in related code while at it.

49a7610c

01 4月, 2006 2 次提交

Remove the 'slow' path for btree index build, which built the btree · 89bda95d

由 Tom Lane 提交于 4月 01, 2006

incrementally by successive inserts rather than by sorting the data.
We were only using the slow path during bootstrap, apparently because
when first written it failed during bootstrap --- but it works fine now
AFAICT. Removing it saves a hundred or so lines of code and produces
noticeably (~10%) smaller initial states of the system catalog indexes.
While that won't make much difference for heavily-modified catalogs,
for the more static ones there may be a useful long-term performance
improvement.

89bda95d

Clean up WAL/buffer interactions as per my recent proposal. Get rid of the · a8b8f4db

由 Tom Lane 提交于 3月 31, 2006

misleadingly-named WriteBuffer routine, and instead require routines that
change buffer pages to call MarkBufferDirty (which does exactly what it says).
We also require that they do so before calling XLogInsert; this takes care of
the synchronization requirement documented in SyncOneBuffer. Note that
because bufmgr takes the buffer content lock (in shared mode) while writing
out any buffer, it doesn't matter whether MarkBufferDirty is executed before
the buffer content change is complete, so long as the content change is
completed before releasing exclusive lock on the buffer. So it's OK to set
the dirtybit before we fill in the LSN.
This eliminates the former kluge of needing to set the dirtybit in LockBuffer.
Aside from making the code more transparent, we can also add some new
debugging assertions, in particular that the caller of MarkBufferDirty must
hold the buffer content lock, not merely a pin.

a8b8f4db

24 3月, 2006 1 次提交

Arrange to emit a description of the current XLOG record as error context · 0a202070

由 Tom Lane 提交于 3月 24, 2006

when an error occurs during xlog replay.  Also, replace the former risky
'write into a fixed-size buffer with no overflow detection' API for XLOG
record description routines; use an expansible StringInfo instead.  (The
latter accounts for most of the patch bulk.)

Qingqing Zhou

0a202070

05 3月, 2006 1 次提交
- B
  
  Update copyright for 2006. Update scripts. · f2f5b056
  由 Bruce Momjian 提交于 3月 05, 2006
  
  f2f5b056
26 1月, 2006 1 次提交

Remove the no-longer-useful BTItem/BTItemData level of structure, and · c389760c

由 Tom Lane 提交于 1月 25, 2006

just refer to btree index entries as plain IndexTuples, which is what
they have been for a very long time.  This is mostly just an exercise
in removing extraneous notation, but it does save a palloc/pfree cycle
per index insertion.

c389760c

24 1月, 2006 1 次提交

Instead of using a numberOfRequiredKeys count to distinguish required · 7ccaf13a

由 Tom Lane 提交于 1月 23, 2006

and non-required keys in a btree index scan, mark the required scankeys
with private flag bits SK_BT_REQFWD and/or SK_BT_REQBKWD. This seems
at least marginally clearer to me, and it eliminates a wired-into-the-
data-structure assumption that required keys are consecutive. Even though
that assumption will remain true for the foreseeable future, having it
in there makes the code seem more complex than necessary.

7ccaf13a

08 12月, 2005 1 次提交

Push the responsibility for handling ignore_killed_tuples down into · cefcbbf1

由 Tom Lane 提交于 12月 07, 2005

_bt_checkkeys(), instead of checking it in the top-level nbtree.c routines
as formerly.  This saves a little bit of loop overhead, but more importantly
it lets us skip performing the index key comparisons for dead tuples.

cefcbbf1

07 11月, 2005 1 次提交

Add defenses to btree and hash index AMs to do simple sanity checks · 766dc45d

由 Tom Lane 提交于 11月 06, 2005

on every index page they read; in particular to catch the case of an
all-zero page, which PageHeaderIsValid allows to pass. It turns out
hash already had this idea, but it was just Assert()ing things rather
than doing a straight error check, and the Asserts were partially
redundant with PageHeaderIsValid anyway. Per recent failure example
from Jim Nasby. (gist still needs the same treatment.)

766dc45d

15 10月, 2005 1 次提交
- B
  
  Standard pgindent run for 8.1. · 1dc34982
  由 Bruce Momjian 提交于 10月 15, 2005
  
  1dc34982
07 6月, 2005 1 次提交

Remove the mostly-stubbed-out-anyway support routines for WAL UNDO. · 4c8495a1

由 Tom Lane 提交于 6月 06, 2005

That code is never going to be used in the foreseeable future, and
where it's more than a stub it's making the redo routines harder to
read.

4c8495a1