提交 · 3637c5d4e70cd06c1220c1b9280ba4d7aea8d9bc · Greenplum / Gpdb

27 6月, 2018 1 次提交
- A
  Remove beginLoc from rm_desc routines signature. · 14b79227
  由 Ashwin Agrawal 提交于 6月 21, 2018
```
Upstream doesn't have it and not used anymore in Greenplum, so loose it.
```
  14b79227
16 3月, 2018 1 次提交

Revert "XLOG_HINT integration with persistent table (PT)" · 8c8b5c39

由 Taylor Vesely 提交于 3月 14, 2018

This reverts commit a21da89e. These
changes are obsolete now that persistent tables have been removed.
Co-authored-by: NAsim R P <apraveen@pivotal.io>

8c8b5c39

07 3月, 2018 1 次提交

Don't access relcache entry after closing it. · 64cbb6b2

由 Heikki Linnakangas 提交于 3月 07, 2018

This is essentially a use-after-free bug. Found by running regression
tests with CLOBBER_CACHE_ALWAYS and RELCACHE_FORCE_RELEASE.

64cbb6b2

13 1月, 2018 2 次提交

Fix masking of sequence values for consistency checks · d3b9043f

由 Daniel Gustafsson 提交于 1月 08, 2018

SEQ_LOG_VALS are pre-logged to avoid logging every fetch from a
sequence, so we need to mask out last_value and log_cnt for sequence
pages before running consistency checks.

d3b9043f

Remove a lot of persistent table and mirroring stuff. · 5c158ff3

由 Heikki Linnakangas 提交于 12月 14, 2017

* Revert almost all the changes in smgr.c / md.c, to not go through
  the Mirrored* APIs.

* Remove mmxlog stuff. Use upstream "pending relation deletion" code
  instead.

* Get rid of multiple startup passes. Now it's just a single pass like
  in the upstream.

* Revert the way database drop/create are handled to the way it is in
  upstream. Doesn't use PT anymore, but accesses file system directly,
  and WAL-logs a single CREATE/DROP DATABASE WAL record.

* Get rid of MirroredLock

* Remove a few tests that were specific to persistent tables.

* Plus a lot of little removals and reverts to upstream code.

5c158ff3

14 11月, 2017 1 次提交

Cherry-pick 'Avoid transient bogus page contents when creating a sequence.' · 5ebd8cdf

由 Jimmy Yih 提交于 11月 10, 2017

The sequence xlog record generated at the end of DefineSequence
records the local tuple->t_data when the actual page item is
available. Not using the actual page item caused gp_replica_check
extension to fail for sequences because the tuple->t_data ctid would
be replicated as (0,0) instead of the actual page item ctid
(0,1). This is because tuple->t_data is used in the xlog record
creation but its t_ctid is not set even if tuple->t_self is fine.

To fix this, we cherry-pick and modify upstream fix.

Reference commit message:
commit 8d34f686
Author: Heikki Linnakangas <heikki.linnakangas@iki.fi>
Date:   Tue Apr 22 09:50:47 2014 +0300

    Avoid transient bogus page contents when creating a sequence.

    Don't use simple_heap_insert to insert the tuple to a sequence relation.
    simple_heap_insert creates a heap insertion WAL record, and replaying that
    will create a regular heap page without the special area containing the
    sequence magic constant, which is wrong for a sequence. That was not a bug
    because we always created a sequence WAL record after that, and replaying
    that overwrote the bogus heap page, and the transient state could never be
    seen by another backend because it was only done when creating a new
    sequence relation. But it's simpler and cleaner to avoid that in the first
    place.

5ebd8cdf

02 9月, 2017 1 次提交

Add WAL consistency checking facility. · aa459523

由 Ashwin Agrawal 提交于 7月 14, 2017

This is majorly backport of Postgres commit.
------------------------------------------------
commit a507b869
Author: Robert Haas <rhaas@postgresql.org>
Date:   Wed Feb 8 15:45:30 2017 -0500

    Add WAL consistency checking facility.

    When the new GUC wal_consistency_checking is set to a non-empty value,
    it triggers recording of additional full-page images, which are
    compared on the standby against the results of applying the WAL record
    (without regard to those full-page images).  Allowable differences
    such as hints are masked out, and the resulting pages are compared;
    any difference results in a FATAL error on the standby.

    Kuntal Ghosh, based on earlier patches by Michael Paquier and Heikki
    Linnakangas.  Extensively reviewed and revised by Michael Paquier and
    by me, with additional reviews and comments from Amit Kapila, Álvaro
    Herrera, Simon Riggs, and Peter Eisentraut.
------------------------------------------------

Its modified to work with current xlog format of Greenplum, which is different
from Postgres code when this patch was committed. Main changes are to fit
current backup block format in xlog records. Also, some masking routines differ
compared to upstream as some of the masked flags would come once Greenplum
catches up to latest versions of Postgres.

aa459523

01 9月, 2017 1 次提交

Fix Copyright and file headers across the tree · ed7414ee

由 Daniel Gustafsson 提交于 9月 01, 2017

This bumps the copyright years to the appropriate years after not
having been updated for some time. Also reformats existing code
headers to match the upstream style to ensure consistency.

ed7414ee

24 8月, 2017 1 次提交

Remove obsolete FIXME comments. · a37c6612

由 Heikki Linnakangas 提交于 8月 24, 2017

When the new OID-dispatching mechanism was introduced (commit f9016da2),
a lot of FIXME comments were left in calls to CdbDispatchUtilityStatement.
These were places where I wasn't sure if the command that was executed
might create new objects, with new OIDs assigned to them, that would need
to be dispatched to the QE. I've now skimmed through the call sites, and
checked that they are all for ALTER or DROP commands that should not create
new OIDs, so remove the FIXME comments.

a37c6612

05 8月, 2017 1 次提交

XLOG_HINT integration with persistent table (PT) · a21da89e

由 Ashwin Agrawal 提交于 8月 03, 2017

We add the PT information in addition to the backup block for proper recovery.
In this case, during XLog redo, the blocks of the dropped tables will not be
restored after consulting PT.

In order to fetch the PT information, we have to pass Relation to
MarkBufferDirtyHint(). For that interface change, we refactored
MarkBufferDirtyHint() with additional Relation parameter. The relation
information is eventually passed to XLogSaveBufferForHint() to fetch the PT
information when preparing the XLOG_HINT record.

a21da89e

03 8月, 2017 2 次提交

Allow I/O reliability checks using 16-bit checksums · ed0efd2a

由 Simon Riggs 提交于 3月 22, 2013

Checksums are set immediately prior to flush out of shared buffers
and checked when pages are read in again. Hint bit setting will
require full page write when block is dirtied, which causes various
infrastructure changes. Extensive comments, docs and README.

WARNING message thrown if checksum fails on non-all zeroes page;
ERROR thrown but can be disabled with ignore_checksum_failure = on.

Feature enabled by an initdb option, since transition from option off
to option on is long and complex and has not yet been implemented.
Default is not to use checksums.

Checksum used is WAL CRC-32 truncated to 16-bits.

Simon Riggs, Jeff Davis, Greg Smith
Wide input and assistance from many community members. Thank you.

(cherry picked from commit 96ef3b8f)

ed0efd2a

Remove PageSetTLI and rename pd_tli to pd_checksum · 626df6b4

由 Simon Riggs 提交于 3月 18, 2013

Remove use of PageSetTLI() from all page manipulation functions
and adjust README to indicate change in the way we make changes
to pages. Repurpose those bytes into the pd_checksum field and
explain how that works in comments about page header.

Refactoring ahead of actual feature patch which would make use
of the checksum field, arriving later.

Jeff Davis, with comments and doc changes by Simon Riggs
Direction suggested by Robert Haas; many others providing
review comments.

(cherry picked from bb7cc262)

626df6b4

23 5月, 2017 1 次提交
- K
  Renamed `typname` or other variant to `typeName`. · 29bdd6ef
  由 Karthikeyan Jambu Rajaraman 提交于 5月 22, 2017
```
This way we are consistent now with the upstream Postgres.
```
  29bdd6ef
27 1月, 2017 1 次提交

Locking on sequence relation is not required for sequence server · 957d93af

由 Abhijit Subramanya 提交于 1月 25, 2017

Postgres merge introduced init_sequence() which can optionally lock sequence
relation.

In GPDB, single sequence server instance is used to generate sequence values
for all requests coming from segments, hence it doesn't require a lock on
sequence relation.

There are three concurrent scenarios when using sequence:
Scenario A: concurrent requests from segments:
create table t1 (c int, d serial) distributed by (c);
insert into t1 select i from generate_series(1, 100) i;

Scenario B: concurrent requests from master:
tx1: select nextval('t1_c_seq'::regclass);
tx2: select nextval('t1_c_seq'::regclass);

Scenario C: concurrent requests from both master and segments
tx1: select nextval('t1_c_seq'::regclass);
tx2: insert into t1 values (200, default);

Scenario A is protected by the single instance of sequence server.
Scenario B and C are protected by the BUFFER_LOCK_EXCLUSIVE on shared
buffer of the sequence relation.

With that said, we don't need to hold additional lock on sequence relation.
Signed-off-by: NXin Zhang <xzhang@pivotal.io>

957d93af

07 11月, 2016 1 次提交

Revamp the way OIDs are dispatched to segments on CREATE statements. · f9016da2

由 Heikki Linnakangas 提交于 11月 07, 2016

Instead of carrying a "new OID" field in all the structs that represent
CREATE statements, introduce a generic mechanism for capturing the OIDs
of all created objects, dispatching them to the QEs, and using those same
OIDs when the corresponding objects are created in the QEs. This allows
removing a lot of scattered changes in DDL command handling, that was
previously needed to ensure that objects are assigned the same OIDs in
all the nodes.

This also provides the groundwork for pg_upgrade to dictate the OIDs to use
for upgraded objects. The upstream has mechanisms for pg_upgrade to dictate
the OIDs for a few objects (relations and types, at least), but in GPDB,
we need to preserve the OIDs of almost all object types.

f9016da2

01 11月, 2016 1 次提交

Fix gcc warnings on misleading indentation. · cdfe1917

由 Heikki Linnakangas 提交于 11月 01, 2016

In many places where we had used a mixture of spaces and tabs for
indentation, new versions of gcc complained about misleading indentation,
because gcc doesn't know we're using tab width of 4. To fix, make the
indentation consistent in all the places where gcc gave a warning. Would
be nice to fix it all around, but that's a lot of work, so let's do it
in a piecemeal fashion whenever we run into issues or need to modify a
piece of code anyway.

For some files, especially the GPDB-specific ones, I ran pgindent over
the whole file. I used the pgindent from PostgreSQL master, which is
slightly different from what was used back 8.3 days, but that's what I had
easily available, and that's what we're heading to in the future anyway.
In some cases, I didn't commit the pgindented result if there were
funnily formatted code or comments that would need special treatment.

For other places, I fixed the indentation locally, just enough to make the
warnings go away.

I also did a tiny bit of other trivial cleanup, that I happened to spot
while working on this, although I tried to refrain from anything more
extensive.

cdfe1917

04 10月, 2016 1 次提交

Remove pg_aoseg and pg_aocsseg index references · 6580341a

由 Jimmy Yih 提交于 9月 19, 2016

The pg_aoseg and pg_aocsseg index scans were replaced with heap
scans. Therefore, we no longer need the pg_aoseg and pg_aocsseg
indexes. This commit removes all references to them in the code and
catalog.

Authors: Jimmy Yih, Asim Praveen, and Abhijit Subramanya

6580341a

01 9月, 2016 1 次提交

Check for zero persistentTID in xlog record · 9b2eb416

由 Ashwin Agrawal 提交于 8月 22, 2016

Refactor code to use common routine to fetch PT info for xlogging. Check can
be easliy added at this common place to validate persistent info is
available. Plus still add check during recovery for persistentTID zero. As
with postgres upstream merges possible the function to populate persistent info is
not called at all, so this check will not hit during xlog record construction
but atleast gives clear clue during recovery.

9b2eb416

16 8月, 2016 1 次提交
- H
  Revert a bunch of CaQL calls with upstream code. · 6d7f09f3
  由 Heikki Linnakangas 提交于 8月 16, 2016
```
To reduce our diff vs upstream, to make diffing and merging easier.
```
  6d7f09f3
25 7月, 2016 1 次提交

Refactor utility statement dispatch interfaces · 01769ada

由 Pengzhou Tang 提交于 7月 08, 2016

refactor CdbDispatchUtilityStatement() to make it flexible for cdbCopyStart(),
dispatchVacuum() to call directly. Introduce flags like DF_NEED_TWO_SNAPSHOT,
DF_WITH_SNAPSHOT, DF_CANCEL_ON_ERROR to make function call much clearer

01769ada

28 5月, 2016 1 次提交

Remove Debug_check_for_invalid_persistent_tid option. · b3ec6f00

由 Heikki Linnakangas 提交于 5月 28, 2016

The checks for invalid TIDs are very cheap, a few CPU instructions. It's
better to catch bugs involving invalid TID early, so let's always check
for them.

The LOGs in storeAttrDefault() that were also tied to this GUC seemed
oddly specific. They were probably added long time ago to hunt for some
particular bug, and don't seem generally useful, so I just removed them.

b3ec6f00

19 5月, 2016 1 次提交

Split cdbdisp.c into several files, and put them into a new · 895b7d50

由 Pengzhou Tang 提交于 5月 12, 2016

dispatcher/ directory

This commit has no logic change, it just contains movement of code across
files, to make dispatcher code clearer, and easier for unit testing.

Signed-off-by: Kenan Yao

895b7d50

12 2月, 2016 2 次提交

H
Misc header file cleanup · 442c105e
由 Heikki Linnakangas 提交于 2月 11, 2016
```
Remove unnecessary #includes, add #includes that are actually needed by
some headers.
```
442c105e

Remove unnecessary #includes. · 9aa7a22f

由 Heikki Linnakangas 提交于 2月 11, 2016

In cdbcat.h, include only the header files that are actually needed for
the single function prototype in that file. And don't include cdbcat.h
unnecessarily. A couple of .c files were including cdbcat.h to get
GpPolicy, but that's actually defined in catalog/gp_policy.h, so #include
that directly instead where needed.

9aa7a22f

20 11月, 2015 1 次提交

Error out when a sequence with NO CYCLE specified reaches maxvalue. · fa6cbd47

由 Nikos Armenatzoglou 提交于 11月 19, 2015

In a sequence, if NO CYCLE is specified, any calls to nextval after the sequence has reached its maximum value will return an error.

fa6cbd47

28 10月, 2015 1 次提交
- I
  
  Import Greenplum source code. · 6b0e52be
  由 Initial Greenplum code dump 提交于 10月 23, 2015
  
  6b0e52be
26 7月, 2012 1 次提交

Fix longstanding crash-safety bug with newly-created-or-reset sequences. · 8468bcc8

由 Tom Lane 提交于 7月 25, 2012

If a crash occurred immediately after the first nextval() call for a serial
column, WAL replay would restore the sequence to a state in which it
appeared that no nextval() had been done, thus allowing the first sequence
value to be returned again by the next nextval() call; as reported in
bug #6748 from Xiangming Mei.

More generally, the problem would occur if an ALTER SEQUENCE was executed
on a freshly created or reset sequence. (The manifestation with serial
columns was introduced in 8.2 when we added an ALTER SEQUENCE OWNED BY step
to serial column creation.) The cause is that sequence creation attempted
to save one WAL entry by writing out a WAL record that made it appear that
the first nextval() had already happened (viz, with is_called = true),
while marking the sequence's in-database state with log_cnt = 1 to show
that the first nextval() need not emit a WAL record. However, ALTER
SEQUENCE would emit a new WAL entry reflecting the actual in-database state
(with is_called = false). Then, nextval would allocate the first sequence
value and set is_called = true, but it would trust the log_cnt value and
not emit any WAL record. A crash at this point would thus restore the
sequence to its post-ALTER state, causing the next nextval() call to return
the first sequence value again.

To fix, get rid of the idea of logging an is_called status different from
reality. This means that the first nextval-driven WAL record will happen
at the first nextval call not the second, but the marginal cost of that is
pretty negligible. In addition, make sure that ALTER SEQUENCE resets
log_cnt to zero in any case where it touches sequence parameters that
affect future nextval results. This will result in some user-visible
changes in the contents of a sequence's log_cnt column, as reflected in the
patch's regression test changes; but no application should be depending on
that anyway, since it was already true that log_cnt changes rather
unpredictably depending on checkpoint timing.

In addition, make some basically-cosmetic improvements to get rid of
sequence.c's undesirable intimacy with page layout details. It was always
really trying to WAL-log the contents of the sequence tuple, so we should
have it do that directly using a HeapTuple's t_data and t_len, rather than
backing into it with some magic assumptions about where the tuple would be
on the sequence's page.

Back-patch to all supported branches.

8468bcc8

10 6月, 2011 1 次提交
- B
  
  Pgindent run before 9.1 beta2. · 6560407c
  由 Bruce Momjian 提交于 6月 09, 2011
  
  6560407c
03 6月, 2011 2 次提交

Clean up after erroneous SELECT FOR UPDATE/SHARE on a sequence. · ea6eda64

由 Tom Lane 提交于 6月 02, 2011

My previous commit disallowed this operation, but did nothing about
cleaning up the damage if one had already been done. With the operation
disallowed, it's okay to just forcibly clear xmax in a sequence's tuple,
since any value seen there could not represent a live transaction's lock.
So, any sequence-specific operation will repair the problem automatically,
whether or not the user has already seen "could not access status of
transaction" failures.

ea6eda64

Clean up after erroneous SELECT FOR UPDATE/SHARE on a sequence. · 19fed642

由 Tom Lane 提交于 6月 02, 2011

19fed642

10 4月, 2011 1 次提交
- B
  
  pgindent run before PG 9.1 beta 1. · bf50caf1
  由 Bruce Momjian 提交于 4月 10, 2011
  
  bf50caf1
27 3月, 2011 1 次提交

Clean up cruft around collation initialization for tupdescs and scankeys. · 7208fae1

由 Tom Lane 提交于 3月 26, 2011

I found actual bugs in GiST and plpgsql; the rest of this is cosmetic
but meant to decrease the odds of future bugs of omission.

7208fae1

10 3月, 2011 1 次提交

Remove collation information from TypeName, where it does not belong. · a051ef69

由 Tom Lane 提交于 3月 09, 2011

The initial collations patch treated a COLLATE spec as part of a TypeName,
following what can only be described as brain fade on the part of the SQL
committee. It's a lot more reasonable to treat COLLATE as a syntactically
separate object, so that it can be added in only the productions where it
actually belongs, rather than needing to reject it in a boatload of places
where it doesn't belong (something the original patch mostly failed to do).
In addition this change lets us meet the spec's requirement to allow
COLLATE anywhere in the clauses of a ColumnDef, and it avoids unfriendly
behavior for constructs such as "foo::type COLLATE collation".

To do this, pull collation information out of TypeName and put it in
ColumnDef instead, thus reverting most of the collation-related changes in
parse_type.c's API. I made one additional structural change, which was to
use a ColumnDef as an intermediate node in AT_AlterColumnType AlterTableCmd
nodes. This provides enough room to get rid of the "transform" wart in
AlterTableCmd too, since the ColumnDef can carry the USING expression
easily enough.

Also fix some other minor bugs that have crept in in the same areas,
like failure to copy recently-added fields of ColumnDef in copyfuncs.c.

While at it, document the formerly secret ability to specify a collation
in ALTER TABLE ALTER COLUMN TYPE, ALTER TYPE ADD ATTRIBUTE, and
ALTER TYPE ALTER ATTRIBUTE TYPE; and correct some misstatements about
what the default collation selection will be when COLLATE is omitted.

BTW, the three-parameter form of format_type() should go away too,
since it just contributes to the confusion in this area; but I'll do
that in a separate patch.

a051ef69

23 2月, 2011 1 次提交

Fix a couple of unlogged tables goofs. · 3e6b305d

由 Robert Haas 提交于 2月 22, 2011

"SELECT ... INTO UNLOGGED tabname" works, but wasn't documented; CREATE
UNLOGGED SEQUENCE and CREATE UNLOGGED VIEW failed an assertion, instead
of throwing a sensible error.

Latter issue reported by Itagaki Takahiro; patch review by Tom Lane.

3e6b305d

09 2月, 2011 1 次提交

Per-column collation support · 414c5a2e

由 Peter Eisentraut 提交于 2月 08, 2011

This adds collation support for columns and domains, a COLLATE clause
to override it per expression, and B-tree index support.

Peter Eisentraut
reviewed by Pavel Stehule, Itagaki Takahiro, Robert Haas, Noah Misch

414c5a2e

02 1月, 2011 2 次提交

Implement remaining fields of information_schema.sequences view · 39b88432

由 Peter Eisentraut 提交于 1月 02, 2011

Add new function pg_sequence_parameters that returns a sequence's start,
minimum, maximum, increment, and cycle values, and use that in the view.
(bug #5662; design suggestion by Tom Lane)

Also slightly adjust the view's column order and permissions after review of
SQL standard.

39b88432

B

Stamp copyrights for year 2011. · 5d950e3b
由 Bruce Momjian 提交于 1月 01, 2011

5d950e3b

14 12月, 2010 1 次提交

Generalize concept of temporary relations to "relation persistence". · 5f7b58fa

由 Robert Haas 提交于 12月 13, 2010

This commit replaces pg_class.relistemp with pg_class.relpersistence;
and also modifies the RangeVar node type to carry relpersistence rather
than istemp. It also removes removes rd_istemp from RelationData and
instead performs the correct computation based on relpersistence.

For clarity, we add three new macros: RelationNeedsWAL(),
RelationUsesLocalBuffers(), and RelationUsesTempNamespace(), so that we
can clarify the purpose of each check that previous depended on
rd_istemp.

This is intended as infrastructure for the upcoming unlogged tables
patch, as well as for future possible work on global temporary tables.

5f7b58fa

18 11月, 2010 1 次提交

Make TRUNCATE ... RESTART IDENTITY restart sequences transactionally. · 511e902b

由 Tom Lane 提交于 11月 17, 2010

In the previous coding, we simply issued ALTER SEQUENCE RESTART commands,
which do not roll back on error. This meant that an error between
truncating and committing left the sequences out of sync with the table
contents, with potentially bad consequences as were noted in a Warning on
the TRUNCATE man page.

To fix, create a new storage file (relfilenode) for a sequence that is to
be reset due to RESTART IDENTITY. If the transaction aborts, we'll
automatically revert to the old storage file. This acts just like a
rewriting ALTER TABLE operation. A penalty is that we have to take
exclusive lock on the sequence, but since we've already got exclusive lock
on its owning table, that seems unlikely to be much of a problem.

The interaction of this with usual nontransactional behaviors of sequence
operations is a bit weird, but it's hard to see what would be completely
consistent. Our choice is to discard cached-but-unissued sequence values
both when the RESTART is executed, and at rollback if any; but to not touch
the currval() state either time.

In passing, move the sequence reset operations to happen before not after
any AFTER TRUNCATE triggers are fired. The previous ordering was not
logically sensible, but was forced by the need to minimize inconsistency
if the triggers caused an error. Transactional rollback is a much better
solution to that.

Patch by Steve Singer, rather heavily adjusted by me.

511e902b

15 11月, 2010 1 次提交
- R
  Cleanup various comparisons with the constant "true". · 5aa446c9
  由 Robert Haas 提交于 11月 14, 2010
```
Itagaki Takahiro, with slight modifications.
```
  5aa446c9