提交 · 895b7d506095c31b0938c8574d364dc011faba30 · Greenplum / Gpdb

19 5月, 2016 7 次提交
- P
  Split cdbdisp.c into several files, and put them into a new · 895b7d50
  由 Pengzhou Tang 提交于 5月 12, 2016
```
dispatcher/ directory

This commit has no logic change, it just contains movement of code across
files, to make dispatcher code clearer, and easier for unit testing.

Signed-off-by: Kenan Yao
```
  895b7d50
- V
  
  Update Non-deterministic test · be43f626
  由 Venkatesh (Venky) Raghavan 提交于 5月 18, 2016
  
  be43f626
- A
  s3ext: add a standalone config checking app · 1a2bb01d
  由 Adam Lee 提交于 5月 19, 2016
```
Usage: s3chkcfg -c "s3://endpoint/bucket/prefix config=path_to_config_file", to check the configuration.
       s3chkcfg -d "s3://endpoint/bucket/prefix config=path_to_config_file", to download and output to stdout.
       s3chkcfg -t, to show the config template.
       s3chkcfg -h, to show this help.

Also did refactoring to reuse functions common with s3ext module.
```
  1a2bb01d
- S
  
  Fix build error for Annotated types · b9ef5e3f
  由 Shreedhar Hardikar 提交于 5月 19, 2016
  
  b9ef5e3f
- D
  
  Fix typo in fstream Makefile comment · 6cd2d2d2
  由 Daniel Gustafsson 提交于 5月 18, 2016
  
  6cd2d2d2
- D
  Add missing quote character around codegen prefix path list · e289cb26
  由 Daniel Gustafsson 提交于 5月 18, 2016
```
The codegen-prefix path list is semicolon separated and must thus
be quoted when passed to autoconf as it otherwise breaks up the
commandline. Add missing quote character in the README documentation.
```
  e289cb26
- V
  
  Translator changes due to removing of UlSystemColumns api in IMDRelation [#119090047] · 471c025e
  由 Venkatesh (Venky) Raghavan 提交于 5月 18, 2016
  
  471c025e
18 5月, 2016 5 次提交

Revert changes related to backend shutdown. · af7b1b51

由 Heikki Linnakangas 提交于 5月 18, 2016

There were a bunch of changes vs. upstream in the way the PGPROC free list
was managed, and the way backend exit was handled. They seemed largely
unnecessary, and somewhat buggy, so I reverted them. Avoiding unnecessary
differences makes merging with upstream easier too.

* The freelist was protected by atomic operations instead of a spinlock.
There was an ABA problem in the implementation, however. In Prepend(), if
another backend grabbed the PGPROC we were just about to grab for ourselves,
and returned it to the freelist before we iterate and notice, we might
set the head of the free list to a PGPROC that's actually already in use.
It's a tight window, and backend startup is quite heavy, so that's unlikely
to happen in practice. Still, it's a bug. Because backend start up is such
a heavy operation, this codepath is not so performance-critical that you
would gain anything from using atomic operations instead of a spinlock, so
just switch back to using a spinlock like in the upstream.

* When a backend exited, the responsibility to recycle the PGPROC entry
to the free list was moved to the postmaster, from the backend itself.
That's not broken per se, AFAICS, but it violates the general principle of
avoiding shared memory access in postmaster.

* There was a dead-man's switch, in the form of the postmasterResetRequired
flag in the PGPROC entry. If a backend died unexpectedly, and the flag
was set, postmaster would restart the whole server. If the flag was not
set, it would clean up only the PGPROC entry that was left behind and
let the system run normally. However, the flag was in fact always set,
except after ProcKill had already run, i.e. when the process had exited
normally. So I don't see the point of that, we might as well rely on the
exit status to signal normal/abnormal exit, like we do in the upstream. That
has worked fine for PostgreSQL.

* There was one more case where the dead-man's switch was activated, even
though the backend exited normally: In AuxiliaryProcKill(), if a filerep
subprocess died, and it didn't have a parent process anymore. That means
that the master filerep process had already died unexpectedly (filerep
subprocesses are children of the are not direct children of postmaster).
That seems unnecessary, however: if the filerep process had died
unexpectedly, the postmaster should wake up to that, and would restart
the server. To play it safe, though, make the subprocess exit with non-zero
exit status in that case, so that the postmaster will wake up to that, if
it didn't notice the master filerep process dying for some reason.

* HaveNFreeProcs() was rewritten by maintaining the number of entries
in the free list in a variable, instead of walking the list to count them.
Presumably to make backend startup cheaper, when max_connections is high.
I kept that, but it's slightly simpler now that we use a spinlock to protect
the free list again: no need to use atomic ops for the variable anymore.

* The autovacFreeProcs list was not used. Autovacuum workers got their
PGPROC entry from the regular free list. Fix that, and also add
missing InitSharedLatch() call to the initialization of the autovacuum
workers list.

af7b1b51

V

Delete Deadcode: CTranslatorDXLToQuery [#119780063] · c5737234
由 Venkatesh (Venky) Raghavan 提交于 5月 17, 2016

c5737234
S
Add support for array types in codegen_utils. · a5cfefd9
由 Shreedhar Hardikar 提交于 5月 12, 2016
```
This is specially useful for using to get a pointer to a member in a
structure that is an embedded array.
```
a5cfefd9
S

Fix static_assert call · 2b883384
由 Shreedhar Hardikar 提交于 5月 11, 2016

2b883384
C

Modified gpdiff option --gp_init_file to --gpd_init in mgmt_util for behave tests · 66881ab4
由 Chumki Roy 提交于 5月 17, 2016

66881ab4

17 5月, 2016 3 次提交

F
Fixing gang leak during disconnectAndDestroyAllGangs · 17ab121e
由 Foyzur Rahman 提交于 5月 13, 2016
```
Signed-off-by: NKarthikeyan Jambu Rajaraman <karthi.jrk@gmail.com>
```
17ab121e

Fix relcache reference leaks if readindex() function isn't run to completion. · 39a15ad8

由 Heikki Linnakangas 提交于 5月 16, 2016

You get warnings about relcache reference leaks if you run something like
"select readindex('pg_class_oid_index'::regclass) limit 1;". To fix,
don't hold the relcache entries or the buffer pin across calls. Looking
up the relcache entry on every call adds some overhead, of course, but a
full index scan isn't exactly cheap anyway. And this is just a debugging
function, not performance critical.

Spotted by Ashwin Agrawal.

39a15ad8

Validate the previous free TID in gp_persistent_relation_node. · e8c990fd

由 Jimmy Yih 提交于 4月 22, 2016

Previously, previous free TID validation was done under the GUC
persistent_integrity_checks. This commit extracts the previous free TID
validation into another GUC validate_previous_free_tid and is enabled by
default. If the validation detects a corruption in the free TID list, we
will now switch to a new free TID list and leave the corrupted one detached
for cleanup during persistent table rebuild or during crash recovery.

Authors: Jimmy Yih and Abhijit Subramanya

e8c990fd

16 5月, 2016 1 次提交

Update gparray.py · 308f0c8c

由 water32 提交于 4月 18, 2016

Old code query instance's database data directory, one query per instance.
This will generate many logs to log file, example: gpstate
New code, query all data about database's data directory, map date to key -> array, then set to segments object.
So use one query replace many query avoid many times query in database.

308f0c8c

13 5月, 2016 16 次提交

Add a rudimentary regression test suite for Distributed Transactions. · 4229f800

由 Heikki Linnakangas 提交于 5月 13, 2016

These tests use the existing fault injection mechanism built into the
server, to cause errors to happen at strategic places, and checks the
results.

This is almost just a placeholder, there are very few actual tests for
now. But it's a start.

The suite uses plain old pg_regress to run the tests and check the
results. That's enough for the tests included here, but in the future
we'll probably want to do server restarts, crashes, etc. as part of
the suite, and will have to refactor this to something that can do those
things more easily. But let's cross that bridge when we get there.

Also, the test actually leaves the connections to the segments in a
funny state, which shouldn't really happen. The test fails currently
because of that; let's fix it together with the state issue. But
even in this state, this has been useful to me right now, to reproduce
an issue on the merge_8_3_22 branch that I'm working on at the same
time (this test currently causes a PANIC there).

This also isn't hooked up to any top-level targets yet; you have to
run the suite manually from the src/test/dtm directory.

4229f800

Avoid infinite loop on error. · 4624dcc5

由 Heikki Linnakangas 提交于 5月 13, 2016

While hacking, I ran into the "Expected a CREATE for file-system object
name" error. But instead of printing the error, it got into an infinite
loop. smgrSortDeletesList() elog(ERROR)'d out of the function, while it
was in the middle of putting the linked list back together, leaving
pendingDeletes list corrupt, with a loop. AbortTransaction() processing
called smgrIsPendingFileSysWork(), which traversed the list, and it got
stuck in the loop.

To avoid that, don't leave the list in an invalid state on error. I don't
know why I ran into the error in the first place, but that's a different
story.

4624dcc5

Clean up the way the results array is allocated in cdbdisp_returnResults(). · 6a28c978

由 Heikki Linnakangas 提交于 5月 13, 2016

I saw the "nresults < nslots" assertion fail, while hacking on something
else. It happened when a Distributed Prepare command failed, and there were
several error result sets from a segment. I'm not sure how normal it is to
receive multiple ERROR responses to a single query, but the protocol
certainly allows it, and I don't see any explanation for why the code used
to assume that there can be at most 2 result sets from each segment.

Remove that assumption, and make the code cope with more than two result
sets from a segment, by calculating the required size of the array
accurately.

In the passing, remove the NULL-terminator from the array, and change the
callers that depended on it to use the returned size variable instead.
Makes the loops in the callers look less funky.

6a28c978

Fix memory leak in gp_read_error_log(). · c04d827d

由 Heikki Linnakangas 提交于 5月 13, 2016

The code incorrectly called free() on last+1 element of the array. The
array returned by cdbdisp_dispatchRMCommand() always has a NULL element as
terminator, and free(NULL) is a no-op, which is why this didn't outright
crash. But clearly the intention here was to free() the array itself,
otherwise it's leaked.

c04d827d

D
Update the bug report email listed in Perl scripts · 10583d9f
由 Daniel Gustafsson 提交于 5月 11, 2016
```
This is a follow-up to commit b7365f58 which replaced the PostgreSQL
bug report email with the Greenplum one.
```
10583d9f

Minor touchups on documentation and comments · 3c5e9684

由 Daniel Gustafsson 提交于 5月 11, 2016

Removes unused CVS $Header$ tags and moves comments closer to where
they make sense as well as updates a few comments to match reality.

3c5e9684

Compare the tuplevalue in question rather than saving full output · f71358a9

由 Daniel Gustafsson 提交于 5月 12, 2016

Rather than storing the full 100kb string in the outfile (which adds
200kb for the header) and passing to diff instead compare the tuple
with the expected value and store the boolean result in the outfile
instead.

This shaves 1.25 seconds off the testsuite on my laptop but the
primary win is to shrink the size of the outfiles. Tests on Pulse
show consistently lower diff times.

f71358a9

Fix broken version printing in gpdiff.pl · ce4e7376

由 Daniel Gustafsson 提交于 5月 12, 2016

Invoking gpdiff -version was broken since it relied on an old CVS
$Revision$ tag in the sourcecode to be replaced with an actual value.
Since this clearly isn't the most important part I for now copied
in the contents of VERSION which seems like enough attention to
spend on this.

ce4e7376

D
Remove unused variables · b2e5fce2
由 Daniel Gustafsson 提交于 5月 11, 2016
```
These variables are not used since the split into a program and a
module.
```
b2e5fce2

Use Getopt::Long module for parsing commandline options · 4827c951

由 Daniel Gustafsson 提交于 5月 11, 2016

Rather than our own bespoke code, use the Getopt::Long core module
for parsing the command line options. By specifying pass_through in
the Getopt configuration we can preserve the options to pass down to
the diff command while extracting the gpdiff specific options. The
variants that were previously allowed are added as aliases to the
primary option names.

4827c951

Replace tmpnam() based filename generation with race free version · 7b8cdd9c

由 Daniel Gustafsson 提交于 5月 11, 2016

The tempfile() interface in File::Temp is race free and has been
available as a core module since Perl 5.6.1 (released in April
2001) so replace to simplify the code and avoid excessive looping
around a solved problem.

7b8cdd9c

Ensure that filespace hostname is duplicated before freed · f0d5e314

由 Daniel Gustafsson 提交于 5月 06, 2016

Before freeing the CdbComponentDatabase struct we need to copy the
hostname member since we are passing that to the QE's. Should the
memory be reclaimed before the command is serialized to be passed
down the hostname part will contain rubbish and either not work or
crash the backend.

f0d5e314

D

Fix typo in get_parts() comment documentation · da66ec51
由 Daniel Gustafsson 提交于 5月 11, 2016

da66ec51

Handle incorrect relation OID passed to pg_partition_oid() · bc1035b4

由 Daniel Gustafsson 提交于 5月 11, 2016

The PartitionNode tree returned from RelationBuildPartitionDescByOid
will be NULL in case the OID passed isn't present in pg_partition so
we must abort with error to avoid segfaulting on NULL pointer deref.
Also add a test in the partition suite for this.

Reported by Github user @liruto.

bc1035b4

K
Removed lc_numeric icg test · 2d2e9d5d
由 Karthikeyan Jambu Rajaraman 提交于 5月 12, 2016
```
Also adding comments in lc_numeric guc for not to
remove GUC_GPDB_ADDOPT
```
2d2e9d5d

Use the resolved path for gpstringsubs.pl instead of hardcoding · 376ef413

由 Daniel Gustafsson 提交于 5月 11, 2016

We resolve the path for gpstringsubs.pl with find_other_exec() so
use the outcome of that rather than hardcoding it at invocation.

376ef413

12 5月, 2016 2 次提交

Add two GUC controlled index validations · 17c4dd19

由 Asim R P 提交于 4月 08, 2016

Validation 1: look up new tuple's heap tid in unique index before insert.

The tuple being inserted is already inserted in heap. Before its entry is
added to a unique index, we want to see if the index already has an entry with
this heap tid. This should catch duplicate entries created in index but not in
the heap relation. The validation is enabled by GUC "gp_indexcheck_insert".

Validation 2: index should point to all visible tuples after vacuum.

For each entry in index after it was vacuumed, fetch the heap tuple
and validate that it is visible. For specific tables, validate that
the key is the same. The validation is controlled by GUC
"gp_indexcheck_vacuum".

Closes #673.

17c4dd19

Explicitly initialize GPOPT and its dependencies. · efbc6186

由 Shreedhar Hardikar 提交于 4月 28, 2016

These were initialized by constructors earlier. To pass any parameters
for initializing GPOPT or any of its dependendencies, we need to do that
explicitly.

efbc6186

11 5月, 2016 6 次提交

Remove a lot of unnecessary setup steps from qp_misc regression test. · 8eedfe9f

由 Heikki Linnakangas 提交于 5月 11, 2016

There were a lot of unused tables and functions, and chaff like comments
that are not needed for the actual tests in the file. At first glance,
some of the things seemed marginally useful to test on their own right,
like loading data with non-ASCII characters in it, but all the setup
stuff was in a large ignore-block, so any failures there would go unnoticed
anyway.

Removing unnecessary stuff is a virtue of its own, but this also speeds up
the test nicely.

8eedfe9f

H

Convert test case to Unix line endings. · eb1db513
由 Heikki Linnakangas 提交于 5月 11, 2016

eb1db513

Split GPDB additions to create_table test to a separate test. · 35c3c579

由 Heikki Linnakangas 提交于 5月 11, 2016

The stuff that's inherited from upstream stays in create_table, while the
stuff that we've added in GPDB is split off to gp_create_table. Separating
them makes merging and diffing with upstream easier.

35c3c579

Remove redundant test case from qp_functions test file. · ac989241

由 Heikki Linnakangas 提交于 5月 11, 2016

The test with stress_test() function (and accompanying tables) was created
and executed once. Then it was dropped, and recreated, and then executed
two times. Executing the same function twice might reveal bugs in plan
caching, so I kept that (although TBH we have better coverage for that
elsewhere). But I don't see the point of dropping and recreating it in
between: surely it's good enough to just create the function once, and
execute it twice.

This reduces the runtime of qp_functions test by about 1/3 (from 3 minutes
to 2 minutes on my laptop).

ac989241

Avoid checking the integer exponent for infinity · 7fca740a

由 Daniel Gustafsson 提交于 5月 10, 2016

The exponent in the pow calculation is integer and can thus not be
infinity, remove for logical OR in check.

Andreas Scherbaum and Atri Sharma

7fca740a

Avoid deadlock on catchup interrupt. · 698603da

由 Heikki Linnakangas 提交于 5月 11, 2016

An earlier attempt at this checked AmIInSIGUSR1Handler() to see if we
are currently processing a catchup event. But that's not good enough:
we also process catchup interrupts outside the signal handler, in
EnableCatchupInterrupt(). I saw lockups during "make installcheck-good"
with a stack trace that shows a backend waiting for lock on a temporary
relation, while trying to truncate it when committing the transaction
opened for processing a catchup event.

For reference, the commit message for the commit that introduced the
AmIInSIGUSR1Handler check said:

Recent parallel installcheck-good revealed we have a chance to process
catchup interrupt while waiting for commit-prepare, and if the prepared
transaction has created a temporary table with on commit option, the
newly opened transaction for the sake of AcceptInvalidationMessages()
cannot see and fails before the commit-prepare. It's even not clear if
we are safe to open and commit another transaction between prepare and
commit-prepare, but for now just skip the oncommit operation as it
doesn't have any effect anyway.

698603da