提交 · c2f00f144b4d7c1ec755bf6dd2462dfcf4a063dc · Greenplum / Gpdb

01 5月, 2019 8 次提交

C

Reorganize flowchart for readability (#7561) · c2f00f14
由 Chuck Litzell 提交于 4月 30, 2019

c2f00f14

docs - pxf jdbc cfg supports connection- and session-level properties (#7571) · cc8782d1

由 Lisa Owen 提交于 4月 30, 2019

* docs - jdbc cfg supports connection- and session-level properties

* some edits requested by david

* reword jdbc server cfg opening paragraph

* clarify rejected session prop/value chars as requested by ivan

cc8782d1

L

docs - add content for pxf cluster status cmd (#7570) · 883c124f
由 Lisa Owen 提交于 4月 30, 2019

883c124f
L
docs - discuss pxf fragment metadata caching (#7548) · dec335df
由 Lisa Owen 提交于 4月 30, 2019
```
* docs - discuss pxf fragment metadata caching

* a large number of
```
dec335df
K
Behave: Remove unused step · 1fddefd1
由 Kalen Krempely 提交于 4月 23, 2019
```
Remove unused behave step:
"the user waits for "{process_name}" to finish running"
```
1fddefd1

gpaddmirrors behave tests: don't use pkill · 5252e7dc

由 Kalen Krempely 提交于 4月 23, 2019

Remove behave step "the database is killed on hosts mdw,sdw1,sdw2" in
favor of "the database is not running. This shuts down the databse more
cleanly, and avoids any potential race conditons.

5252e7dc

Behave: Rename step for accuracy · ecf4b123

由 Kalen Krempely 提交于 4月 29, 2019

Remove "user kills a primary postmaster process" in favor of the more
generic equivalent step "user stops all {segment_type} processes" which
can take either a primary or mirror.

For clarity and accuracy rename "user kills all {segment_type}
processes" to "user stops all {segment_type} processes".

ecf4b123

Behave: Use pg_ctl stop instead of killing processes · 525b91fa

由 Kalen Krempely 提交于 4月 23, 2019

Cleanly shutdown segments using pg_ctl to avoid potential race
conditions by grep'ing for the pid via ps. This is a much simplier
approach to help with maintainability and extensibility.

525b91fa

30 4月, 2019 3 次提交

Fix flaky gpcloud S3KeyReaderTest unit test (#7576) · 4ff071cf

由 Peifeng Qiu 提交于 4月 30, 2019

S3KeyReaderTest.MTReadWithUnexpectedFetchDataAtSecondRound is a
flaky case, related to multithread timing.

The case setup S3KeyReader and try to download in parallel with 2
chunks(threads). When any of them encounters an error, all thread
will abort with the shared error.

The case assumed that the first created thread will call fetchData()
twice before another thread fetch with error. But if the first
thread is never scheduled to run, the second thread will call
fetchData() first and sets the shared error. Then the first thread
continues and will exit at the first call to fetchData(), reporting
shared error.

Modify the second call to fetchData() to be at most once.

4ff071cf

Recursively create tablespace directories if they do not exist but we need... · 7a09e80d

由 Paul Guo 提交于 4月 18, 2019

Recursively create tablespace directories if they do not exist but we need them when re-redoing some tablespace related xlogs (e.g. database create with a tablespace) on mirror.

It is observed many time that gp_replica_check test fails because some mirror nodes
can not be brought up before testing recently. The related log looks like this:

2019-04-17 14:52:14.951 CST [23030] FATAL: could not create directory "pg_tblspc/65546/PG_12_201904072/65547": No such file or directory
2019-04-17 14:52:14.951 CST [23030] CONTEXT: WAL redo at 0/3011650 for Database/CREATE: copy dir 1663/1 to 65546/65547

That is because some mirror nodes can not be recovered after previous testing,
not due to gp_replica_check itself. The root cause is that tablespace recovery
related. Pengzhou Tang and Hao Wu digged that intially and kindly found a mini
repro as below.

run on shell:
rm -rf /tmp/some_isolation2_pg_basebackup_tablespace
mkdir -p /tmp/some_isolation2_pg_basebackup_tablespace

copy and run the below sql on psql client:
drop tablespace if exists some_isolation2_pg_basebackup_tablespace;
create tablespace some_isolation2_pg_basebackup_tablespace location '/tmp/some_isolation2_pg_basebackup_tablespace';
\!gpstop -ra -M fast;
drop database if exists some_database_with_tablespace;
create database some_database_with_tablespace tablespace some_isolation2_pg_basebackup_tablespace;
drop database some_database_with_tablespace;
drop tablespace some_isolation2_pg_basebackup_tablespace;
\!gpstop -ra -M immediate;

The root cause is on mirror after drop database & drop tablespace, 'immediate'
stop causes the pg_control file not up-to-date with latest redo start lsn (this
is allowed), when the node restarts, it re-redoes 'create database
some_database_with_tablespace tablespace
some_isolation2_pg_basebackup_tablespace' but the tablespace directories have
been deleted in previous redoing.

The 'could not create directory' error could happen on re-redoing create table
in a tablespace also. We've seen this case on the ci environment, but that is
because missing of a get_parent_directory() call in the 'create two parents'
code block in TablespaceCreateDbspace(). Changing it to a simpler call
pg_mkdir_p() instead.

Also it seems that the src_path could be missing also in dbase_redo() for the
example below. For example re-redoing at the alter step since tbs1 directory is
deleted in later 'drop tablespace tbs1'.
alter database db1 set tablespace tbs2;
drop tablespace tbs1;

There is discussion on upstream about this,
https://www.postgresql.org/message-id/flat/CAEET0ZGx9AvioViLf7nbR_8tH9-%3D27DN5xWJ2P9-ROH16e4JUA%40mail.gmail.com

In this patch I recreate those directories to avoid this error. Other solutions
include ignoring the directory-not-existing error or forcing a flush when
redoing those kind of checkpoint xlogs which are added normally in drop
database, etc.

Let's revert or update the code change after the solution is finalized on
upstream.

7a09e80d

docs - pxf init/sync support to master standby (#7540) · 433a6ebb

由 Lisa Owen 提交于 4月 29, 2019

* docs - pxf init/sync support to master standby

* edits requested by david

* edits requested by francisco and oliver

* pxf sync from master TO standby or seg host

* identify sync run on master in pxf sync option description

433a6ebb

29 4月, 2019 8 次提交

Silece stringop-overflow warning and protect against invalid read · 689959e2

由 Georgios Kokolatos 提交于 4月 29, 2019

strncat unfortunately takes a misleading size argument which means
at most size from src. It is a bit of an antipattern in the string
family of functions and for that compilers will emit a warning if
it happens that the size argument matches the size of src, since that
is not what usually users of strncat want to do.

The usage of the function in the code was correct. However instead
of silencing the compiler, strncat was replaced with the dynamic buffer
family of operations that postgres provides for the frontend.

Also protect against an invalid read in case that the size of the
result is zero.
Reviewed-by: NAsim R P <apraveen@pivotal.io>
Reviewed-by: NDaniel Gustafsson <dgustafsson@pivotal.io>

689959e2

Remove superfluous NULL check · 6f3ed938

由 Daniel Gustafsson 提交于 4月 29, 2019

The curl slist API properly handle NULLs so we can be less verbose
and skip the check before passing to the slist cleanup function.
Reviewed-by: NFrancisco Guerrero <aguerrero@pivotal.io>

6f3ed938

Skip storing the last response as it's unused · c50f7e5d

由 Daniel Gustafsson 提交于 4月 29, 2019

The header callback was storing the response in the context, but as
it was never used we might as well save the memory and just return
the required return of the number of bytes we would've saved should
we have allocated.
Reviewed-by: NFrancisco Guerrero <aguerrero@pivotal.io>

c50f7e5d

Avoid zeroing out memory when not required · b940863c

由 Daniel Gustafsson 提交于 4月 29, 2019

The only time the internal buffer cleanup code was called was just
before freeing the entire context, so individually zeroing out the
members is pointless. Remove the function entirely and inline the
buffer freeing into the context cleanup codepath.

For zeroing the error buffer, it's only called right after allocating
the error buffer with palloc0() in the first place so the memory will
always be zeroed out when reaching here.
Reviewed-by: NFrancisco Guerrero <aguerrero@pivotal.io>

b940863c

Fix potential pfree of NULL pointer · 43816bbc

由 Daniel Gustafsson 提交于 4月 29, 2019

If strlen(addr) is zero then based on how get_dest_address() works
addr will be NULL, and pfree() on NULL is not permitted. Also, we
know that addr will either be a non-empty string or NULL, so we can
just as well test for addr being NULL and avoid a strlen() call.

Fix by only pfreeing when addr is set. (this is in an elog(ERROR..)
context so freeing isn't terribly interesting but it also doesn't
hurt so I'm keeping the current codepath.)
Reviewed-by: NFrancisco Guerrero <aguerrero@pivotal.io>

43816bbc

Mark internal functions as static · 67825c03

由 Daniel Gustafsson 提交于 4月 29, 2019

The libchurl abstraction layer has many internal helper functions
which weren't marked static and thus exported. Fix by marking all
as static.
Reviewed-by: NFrancisco Guerrero <aguerrero@pivotal.io>

67825c03

D
Fix typos in SharedInputScan comments · 50144063
由 Daniel Gustafsson 提交于 4月 29, 2019
```
Spotted while reading code.
```
50144063

gpcloud: sleep 1s and retry if AWS returns "NoSuchKey" error · 0639ee8e

由 Adam Lee 提交于 4月 29, 2019

CI reported some read failures just after the write action, it's
probably because the write action is not flushed yet by AWS.

Sleep and retry if AWS returns "NoSuchKey" error. Update the workflow
but not the test cases because users might have the same issue.
Co-authored-by: NPeifeng Qiu <pqiu@pivotal.io>

0639ee8e

28 4月, 2019 3 次提交

A
Revert "gpcloud: sleep and retry if AWS returns "NoSuchKey" error" · b7155ab1
由 Adam Lee 提交于 4月 28, 2019
```
This reverts commit 668558dd.
```
b7155ab1

gpcloud: sleep and retry if AWS returns "NoSuchKey" error · 668558dd

由 Adam Lee 提交于 4月 28, 2019

CI reported some read failures just after the write action, it's
probably because the write action is not flushed yet by AWS.

Sleep and retry if AWS returns "NoSuchKey" error. Update the workflow
but not the test cases because users might have the same issue.

668558dd

Print more detailed message for gp_replica_check test failure due to bad cluster states. (#7556) · 730d31d5

由 Paul Guo 提交于 4月 28, 2019

Previously we error out in gp_replica_check test if the cluster is not in
synced, which is usually due to bugs revealed by previous tests, instead of
gp_replica_check itself.

Print more detailed message so that people are not confused with the status and
this failure reason of the gp_replica_check test.

730d31d5

27 4月, 2019 2 次提交
- C
  Docs updates to gucs (#7550) · 3b6fdd98
  由 Chuck Litzell 提交于 4月 26, 2019
```
* Updates to GUCs from Cyrille's reviews

* Revise description of temp_buffers to match 6.0 behavior

* Fix a couple typos

* For effective_cache_size GUC, show default in blocks and size.
```
  3b6fdd98
- C
  
  Remove implication that transactions restart themselves after segment failure. (#7534) · 1a31dccd
  由 Chuck Litzell 提交于 4月 26, 2019
  
  1a31dccd
26 4月, 2019 6 次提交

C

Correction to spread mirroring illustration in Best Practices Guide (#7538) · 685f45c0
由 Chuck Litzell 提交于 4月 25, 2019

685f45c0
C

Remove add_missing_from GUC from all docs (#7551) · e9df3e35
由 Chuck Litzell 提交于 4月 25, 2019

e9df3e35
C
Docs - update pg_class and pg_index catalog table references (#7504) · 733e25c7
由 Chuck Litzell 提交于 4月 25, 2019
```
* Docs - update pg_class and pg_index catalog table references

* dyozie review comments
```
733e25c7
L

docs - replace < in CREATE EXTERNAL TABLE syntax (#7536) · eb130b16
由 Lisa Owen 提交于 4月 25, 2019

eb130b16
A

Updates the docker file to use OpenSSL's version of libcurl. · 6e809afd
由 Adam Berlin 提交于 4月 25, 2019

6e809afd

Adding options to generate diff files to plan comparison script (#7349) · 9b1835c0

由 Hans Zeller 提交于 4月 25, 2019

The script used in pipelines that search for explain plan changes
lists those queries that have cost, row or plan changes. In many
cases the user will want to investigate those changes further.

A new set of options generates two directories that are easy to
compare, one contains the baseline plans, one file per plan, and
the other contains the changed plans of the test.

$ ~/workspace/gpdb/concourse/scripts/perfsummary.py --help
usage: perfsummary.py [-h] [--baseLog BASELOG] [--diffDir DIFFDIR]
                      [--diffThreshold DIFFTHRESHOLD] [--diffLevel DIFFLEVEL]
                      [log_file]

Summarize the test suite execute and explain log

positional arguments:
  log_file              log file with explain/execute output

optional arguments:
  -h, --help            show this help message and exit
  --baseLog BASELOG     specify a log file from a base version to compare to
  --diffDir DIFFDIR     request diff files to be created and specify a
                        directory to place diffs into
  --diffThreshold DIFFTHRESHOLD
                        specify a numerical threshold to record plan diffs
                        with a performance regression of more than n percent
  --diffLevel DIFFLEVEL
                        specify which diff files to generate: 1 = all diffs, 2
                        = ignore cost diffs, 3 = plan diffs only

9b1835c0

25 4月, 2019 10 次提交

P

Add README for building client tools on Windows · 24ea1af7
由 Peifeng Qiu 提交于 4月 23, 2019

24ea1af7

Fix windows client package script · a2571325

由 Peifeng Qiu 提交于 4月 22, 2019

Connectivity and loaders are no longer vendored. Loaders will be
merged into clients. Add script to create msi package.

a2571325

P
Compile pygresql with CMake · defab9ed
由 Peifeng Qiu 提交于 4月 22, 2019
```
Requires a working libpq.dll. Set CMAKE_PREFIX_PATH to a successful
client build.
```
defab9ed

Fix gpfdist pipe failure with latest windows build · f044d844

由 Peifeng Qiu 提交于 4月 22, 2019

Behavior of stat() is not stable across C Runtime versions.
Implementation of msvcrt.dll calls FindFirstFile(), while in normal
CRT that omes with Visual Studio or Redistribute packages, it calls
CreateFile().
CreateFile() is problematic here, if path is a pipe, it will open
the pipe once then close, causing the other side to connect to the
wrong pipe. Skip stat() if the name is pipe and pretend there is one.

f044d844

Compile gpfdist on windows with CMake · 4a4e4bbf

由 Peifeng Qiu 提交于 4月 22, 2019

- Don't include strings.h when build with MSVC
- C99 Syntax fix for struct initializer.
- Call event_set to initialize event struct. Later libevent version
expect this behavior.
- Use recv instead of read. Latest CRT implementation of read only
works on files, not sockets.

4a4e4bbf

P

Fix 64-bit kerberos lib path · 6b4621ec
由 Peifeng Qiu 提交于 4月 22, 2019

6b4621ec

Fix msvc build for client tools · 8a5ad4bf

由 Peifeng Qiu 提交于 1月 10, 2019

- Fix upstream build system for MSVC, add option to build.pl to
only build client tools.
- Add detection for MSVC SDK version. If this field is missing,
8.1 is assumed. The latest compiler will complain about this.
- Various small fixes

8a5ad4bf

C
Add a list of SQL keywords to the ref guide (#7449) · 052bdb22
由 Chuck Litzell 提交于 4月 24, 2019
```
* Add a list of SQL keywords to the ref guide

* Fix error from review

* Update from review comment
```
052bdb22

Add the gpdb clients release candidates (#7531) · 48bdaf2a

由 Tingfang Bao 提交于 4月 25, 2019

Story: https://www.pivotaltracker.com/story/show/164917628

   The gp-integration-testing pipeline needs the gp-clients RC
   package as a input, and then create the clients RPM base on it.
   The custom installer patch also need it to build client bin installer.
Co-authored-by: NXiaoran Wang <xiwang@pivotal.io>
Co-authored-by: NShaoqi Bai <sbai@pivotal.io>
Co-authored-by: NBob Bao <bbao@pivotal.io>

48bdaf2a

Rename instances of "PQO" to "Pivotal Optimizer (GPORCA)" (#7511) · 5212dba1

由 Chris Hajas 提交于 4月 24, 2019

ORCA Explain plans will now contain:
`Optimizer: Pivotal Optimizer (GPORCA) version 3.35.0`

instead of:
`Optimizer: PQO version 3.35.0`

Authored-by: Chris Hajas chajas@pivotal.io

5212dba1