提交 · 42b915a27e04ef8376ddca69f58ae5aa96a120b1 · Greenplum / Gpdb

08 5月, 2019 2 次提交

Introduce new tablespace directory layout. · 42b915a2

由 Taylor Vesely 提交于 3月 29, 2019

This commit includes changes to the server to ensure that the utilities:
pg_rewind and pg_basebackup can be changed to support recovery in a
multi-segment-singular-host setting. We link pg_tblspc to a <dbid>
subdirectory of the tablespace, rather than to the path of the
tablespace directly, and we remove the <dbid> from the tablespace
version directory. At the same time, we have designed towards preserving
the response to pg_tablespace_location(<tablespace_oid>) such that it
does not return the dbid suffix. The design is such that it is the
responsibility of the utilities to append the dbid as and when required.

Before this commit:
  * the symlink to the tablespace directory looks like:
      pg_tblspc/spcoid/ -> /<tablespace_location>/
  * Under the symlink target, we would have the following:
    GPDB_MAJORVER_CATVER_db<dbid>/dboid/relfilenode
  * pg_tablespace_location(tsoid) returns: <tablespace_location>
e.g.
  * pg_tblspc/20981/ -> /data1/tsp1
  * Under /data1/tsp1: GPDB_6_201902061_db1/19849/192814
  * pg_tablespace_location(20981) returns: /data1/tsp1

After this commit:
  * the symlink to the tablespace directory looks like:
      pg_tblspc/spcoid/ -> /<tablespace_location>/<dbid>
  * Under the symlink target, we would have the following:
      GPDB_MAJORVER_CATVER/dboid/relfilenode
  * pg_tablespace_location(tsoid) returns: <tablespace_location>

e.g.
  * pg_tblspc/20981/ -> /data1/tsp1/1
  * Under /data1/tsp1/1: GPDB_6_201902061/19849/192814
  * pg_tablespace_location(20981) returns: /data1/tsp1

Motivation:

When tablespaces were aligned to upstream postgres, while removing
filespaces, we added the `tablespace_version_directory()` function to
supply each segment with a unique tablespace directory name. This was
accomplished by appending the 'magic' `GpIdentity.dbid` global variable
to the `GP_TABLESPACE_VERSION_DIRECTORY` in `tablespace_version_directory()`.

This is problematic for several reasons- but perhaps most severely is
the fact that in order to use any code in libpgcommon.so that references
this value, you need to first set the `GpIdentity.dbid` global,
otherwise any functions that deal with tablespaces will be broken in
unpredictable ways.

An example is pg_rewind- where `GetRelationPath()` will not return a valid
relation unless you repeatedly toggle the `GpIdentity.dbid` between the
value of the source or target segment dependant on the context of which
relfiles are being examined.

This commit bumps the catalog version here we have made breaking changes
in the tablespace filesystem layout.
Co-authored-by: NAdam Berlin <aberlin@pivotal.io>
Co-authored-by: NTaylor Vesely <tvesely@pivotal.io>
Co-authored-by: NSoumyadeep Chakraborty <sochakraborty@pivotal.io>

42b915a2

concourse: Remove ubuntu16 build · 8af57a90

由 David Sharp 提交于 5月 03, 2019

Co-authored-by: NDavid Sharp <dsharp@pivotal.io>
Co-authored-by: NGoutam Tadi <gtadi@pivotal.io>

8af57a90

07 5月, 2019 8 次提交

L
docs - address lock exhaustion shared mem error msg (#7594) · 50918ec2
由 Lisa Owen 提交于 5月 07, 2019
```
* docs - address lock exhaustion shared mem error msg o

* capitalize Out in title
```
50918ec2
L
docs - enhance pxf jdbc partitioning content (#7588) · c1d97700
由 Lisa Owen 提交于 5月 07, 2019
```
* docs - enhance pxf jdbc partitioning content

* add missing comma

* simplify some content
```
c1d97700

Replace problematic modification of global variable with a constant argument. · 4ad8e668

由 Adam Berlin 提交于 4月 30, 2019

There was a concern that an exception during GetNonHistoricCatalogSnapshot would be problematic after setting the global variable and not resetting it back to its original value. This patch threads the desired distributed transaction context into GetNonHistoricCatalogSnapshot without modifying global state.

4ad8e668

A
Thread the desired distributed transaction context through for getting snapshot data. · fdf5a57a
由 Adam Berlin 提交于 4月 30, 2019
```
No longer rely on a global variable to determine the distributed snapshot context.
```
fdf5a57a

Fix motion hazard between outer and joinqual · fa762b69

由 Ning Yu 提交于 5月 07, 2019

A motion hazard is a deadlock between motions, a classic motion hazard
in a join executor is formed by its inner and outer motions, it can be
prevented by prefetching the inner plan, refer to motion_sanity_check()
for details.

A similar motion hazard can be formed by the outer motion and the join
qual motion.  A join executor fetches a outer tuple, filters it with the
join qual, then repeat the process on all the outer tuples.  When there
are motions in both outer plan and the join qual then below state is
possible:

0. processes A and B belong to the join slice, process C belongs to the
   outer slice, process D belongs to the JoinQual slice;
1. A has read the first outer tuple and is fetching tuples from D;
2. D is waiting for ACK from B;
3. B is fetching the first outer tuple from C;
4. C is waiting for ACK from A;

So a deadlock is formed A->D->B->C->A.  We can prevent it also by
prefetching the join qual.
Reviewed-by: NJesse Zhang <jzhang@pivotal.io>
Reviewed-by: NGang Xiong <gxiong@pivotal.io>
Reviewed-by: NZhenghua Lyu <zlv@pivotal.io>

fa762b69

concourse: Oops, set pr_pipeline concurrency back to 10 · 76e7b822

由 David Sharp 提交于 5月 06, 2019

Had set to 1 for a test pipeline and forgot to put it back.
Co-authored-by: NDavid Sharp <dsharp@pivotal.io>
Co-authored-by: NJason Vigil <jvigil@pivotal.io>

76e7b822

concourse pr_pipeline: Switch to building centos7; Only master PRs · 4f3d158e

由 David Sharp 提交于 5月 02, 2019

This will move us towards dropping Ubuntu 16.04 in preparation for
adding an Ubuntu 18.04 build, as well as removing remaining uses of
Conan.

5X_STABLE and master have diverged enough that this pipeline can't build
5X_STABLE PRs. Add "base_branch" filter (newly added to the
github-pr-resource) to build only PRs against master.
Authored-by: NDavid Sharp <dsharp@pivotal.io>

4f3d158e

B

Bump ORCA to v3.39.0 · ed2831d4
由 Bhuvnesh Chaudhary 提交于 5月 06, 2019

ed2831d4

06 5月, 2019 1 次提交

Fix auto explain init file · 1348afc0

由 Hao Wu 提交于 5月 06, 2019

Fix auto_explain init file for #7195

This patch includes

1. fix init_file
2. use float number for memory usage
3. Update the last SQL whose running time may be less than 1ms
4. Update test case: use newly created table other than pg_class
5. Add answer file for orca & enable nestloop

1348afc0

04 5月, 2019 4 次提交

Add test cases for auto_explain and enable it in GNUmakefile.in (#7195) · 73ca7e77

由 Hao Wu 提交于 5月 04, 2019

1. Add test cases for auto_explain
	* Add init_file to filter out random numbers
	* set CLIENT_MIN_MESSAGES to show `LOG` message to result file
2. Add auto_explain item in gpdb/GNUmakefile.in

73ca7e77

Remove gpseginstall · 64014685

由 Jamie McAtamney 提交于 4月 30, 2019

Now that the enterprise version of GPDB is only provided via RPM, including
gpseginstall in the distribution would cause conflicts if users try to install
GPDB with RPMs and with gpseginstall on the same cluster. While it could be
preserved for use by the OSS community, there are several standard tools for
copying GPDB files to segment hosts in a cluster, and recommendations for using
one or more of those tools will be included in the GPDB documentation.

In addition to removing gpseginstall itself, this commit removes references to
it in other utilities' documentation and removes code in gppylib that was only
called by gpseginstall.
Co-authored-by: NJamie McAtamney <jmcatamney@pivotal.io>
Co-authored-by: NKalen Krempely <kkrempely@pivotal.io>

64014685

Zero out padding bytes for memtuple. · 0b6f15b8

由 Ashwin Agrawal 提交于 4月 26, 2019

heaptuple_form_to() zero's out the buffer, so this commit does same
for memtuple_form_to(). This is done to have zeros in padding areas
and hence help get good compression for AO tables.

This also helps to eliminate the flakiness seen in appendonly test for
compression ratio. Now given same input the compression ratio will be
same.

Discussion:
https://groups.google.com/a/greenplum.org/d/msg/gpdb-dev/1N5Qi-4WPis/EAIdwvy9CAAJ

0b6f15b8

docs - clarify pxf filter partitioning support for hive (#7580) · 4a684535

由 Lisa Owen 提交于 5月 03, 2019

* docs - clarify pxf filter partitioning support for hive

* clarify the hadoop cfg update content

* remove cluster start

* edits requested by david

* remove disable statement per shivram

4a684535

03 5月, 2019 5 次提交
- D
  
  Fix typo in documentation · 6db0e2a9
  由 Daniel Gustafsson 提交于 5月 03, 2019
  
  6db0e2a9
- S
  Bump ORCA version to 3.38.0 (#7505) · dfde9618
  由 Shreedhar Hardikar 提交于 5月 02, 2019
```
Includes ICG changes for ORCA commit:
'Convert FULL OUTER JOIN to LEFT JOIN when possible'
```
  dfde9618
- C
  Fix reference leak when ORCA falls back to planner for queries involving foreign tables (#7595) · 1086ab7e
  由 Chris Hajas 提交于 5月 02, 2019
```
Authored-by: NChris Hajas <chajas@pivotal.io>
```
  1086ab7e
- H
  Bump ORCA version to 3.37 (from 3.35) (#7600) · e9625150
  由 Hans Zeller 提交于 5月 02, 2019
```
Also add a few more tests for unions to ICG.
```
  e9625150
- L
  
  docs - fixes/edits to gpbackup/restore email example (#7589) · a94962e7
  由 Lisa Owen 提交于 5月 02, 2019
  
  a94962e7
02 5月, 2019 5 次提交

Remove autoconf check for CURLOPT_MAIL_FROM · 2bfbb45e

由 Daniel Gustafsson 提交于 5月 02, 2019

The functionality to send alerts via email (and snmp was removed in
commit 65822b80 but I missed removing
the autoconf check for the libcurl feature required. Since there are
no more consumers, remove the check and feature macro as well.
Reviewed-by: NJimmy Yih <jyih@pivotal.io>

2bfbb45e

Make dtm_recovery_on_standby test more deterministic · c5eb8f25

由 Asim R P 提交于 5月 02, 2019

The test should wait for the transactions to be in the right state
before promoting standby.  This commit adds a wait step to ensure just
that.  One of the ICW jobs in CI failed because the test promoted the
standby before the transactions were preprared on master.  This should
no longer happen now.

c5eb8f25

L
docs - add pxf jdbc.connection.transactionIsolation server cfg property (#7584) · 72fac815
由 Lisa Owen 提交于 5月 01, 2019
```
* docs - add pxf jdbc.connection.transactionIsolation server cfg property

* use read uncommitted in example; add external
```
72fac815
J

gpconfig: update docs with valid options · 96f34a8c
由 Jacob Champion 提交于 4月 29, 2019

96f34a8c

gpconfig: Update message when file has no GUC value · b8bd3577

由 Shoaib Lari 提交于 4月 18, 2019

Because users may find it confusing when gpconfig prints '-' or 'None'
when no GUC value is found in the file, this commit updates gpconfig to
output a clearer message.

Apply the out-of-band messaging to failure reports as well, and fix up
the output of --file-compare. Add tests for each modified
implementation.
Co-Authored-By: NJamie McAtamney <jmcatamney@pivotal.io>
Co-Authored-By: NMark Sliva <msliva@pivotal.io>
Co-Authored-By: NShoaib Lari <slari@pivotal.io>

b8bd3577

01 5月, 2019 10 次提交

Test to verify that standby performs DTM recovery after promotion · be10a1bb

由 Asim R P 提交于 4月 19, 2019

Transactions that are in the middle of two phase commit are suspended on
master. Standby is promoted while they are suspended. Based on what XLOG
records are emitted by master, the standby is expected to perform DTM recovery
and complete the transactions upon promotion.

be10a1bb

Enable connections to standby from an isolation2 spec · 7d8a1504

由 Asim R P 提交于 4月 19, 2019

This feature enables tests to run SQL on standby after it is promoted.  Use
"-1S: <sql>" to run <sql> statement on standby.  It is assumed that the standby
is already promoted.

7d8a1504

C

Reorganize flowchart for readability (#7561) · c2f00f14
由 Chuck Litzell 提交于 4月 30, 2019

c2f00f14

docs - pxf jdbc cfg supports connection- and session-level properties (#7571) · cc8782d1

由 Lisa Owen 提交于 4月 30, 2019

* docs - jdbc cfg supports connection- and session-level properties

* some edits requested by david

* reword jdbc server cfg opening paragraph

* clarify rejected session prop/value chars as requested by ivan

cc8782d1

L

docs - add content for pxf cluster status cmd (#7570) · 883c124f
由 Lisa Owen 提交于 4月 30, 2019

883c124f
L
docs - discuss pxf fragment metadata caching (#7548) · dec335df
由 Lisa Owen 提交于 4月 30, 2019
```
* docs - discuss pxf fragment metadata caching

* a large number of
```
dec335df
K
Behave: Remove unused step · 1fddefd1
由 Kalen Krempely 提交于 4月 23, 2019
```
Remove unused behave step:
"the user waits for "{process_name}" to finish running"
```
1fddefd1

gpaddmirrors behave tests: don't use pkill · 5252e7dc

由 Kalen Krempely 提交于 4月 23, 2019

Remove behave step "the database is killed on hosts mdw,sdw1,sdw2" in
favor of "the database is not running. This shuts down the databse more
cleanly, and avoids any potential race conditons.

5252e7dc

Behave: Rename step for accuracy · ecf4b123

由 Kalen Krempely 提交于 4月 29, 2019

Remove "user kills a primary postmaster process" in favor of the more
generic equivalent step "user stops all {segment_type} processes" which
can take either a primary or mirror.

For clarity and accuracy rename "user kills all {segment_type}
processes" to "user stops all {segment_type} processes".

ecf4b123

Behave: Use pg_ctl stop instead of killing processes · 525b91fa

由 Kalen Krempely 提交于 4月 23, 2019

Cleanly shutdown segments using pg_ctl to avoid potential race
conditions by grep'ing for the pid via ps. This is a much simplier
approach to help with maintainability and extensibility.

525b91fa

30 4月, 2019 3 次提交

Fix flaky gpcloud S3KeyReaderTest unit test (#7576) · 4ff071cf

由 Peifeng Qiu 提交于 4月 30, 2019

S3KeyReaderTest.MTReadWithUnexpectedFetchDataAtSecondRound is a
flaky case, related to multithread timing.

The case setup S3KeyReader and try to download in parallel with 2
chunks(threads). When any of them encounters an error, all thread
will abort with the shared error.

The case assumed that the first created thread will call fetchData()
twice before another thread fetch with error. But if the first
thread is never scheduled to run, the second thread will call
fetchData() first and sets the shared error. Then the first thread
continues and will exit at the first call to fetchData(), reporting
shared error.

Modify the second call to fetchData() to be at most once.

4ff071cf

Recursively create tablespace directories if they do not exist but we need... · 7a09e80d

由 Paul Guo 提交于 4月 18, 2019

Recursively create tablespace directories if they do not exist but we need them when re-redoing some tablespace related xlogs (e.g. database create with a tablespace) on mirror.

It is observed many time that gp_replica_check test fails because some mirror nodes
can not be brought up before testing recently. The related log looks like this:

2019-04-17 14:52:14.951 CST [23030] FATAL: could not create directory "pg_tblspc/65546/PG_12_201904072/65547": No such file or directory
2019-04-17 14:52:14.951 CST [23030] CONTEXT: WAL redo at 0/3011650 for Database/CREATE: copy dir 1663/1 to 65546/65547

That is because some mirror nodes can not be recovered after previous testing,
not due to gp_replica_check itself. The root cause is that tablespace recovery
related. Pengzhou Tang and Hao Wu digged that intially and kindly found a mini
repro as below.

run on shell:
rm -rf /tmp/some_isolation2_pg_basebackup_tablespace
mkdir -p /tmp/some_isolation2_pg_basebackup_tablespace

copy and run the below sql on psql client:
drop tablespace if exists some_isolation2_pg_basebackup_tablespace;
create tablespace some_isolation2_pg_basebackup_tablespace location '/tmp/some_isolation2_pg_basebackup_tablespace';
\!gpstop -ra -M fast;
drop database if exists some_database_with_tablespace;
create database some_database_with_tablespace tablespace some_isolation2_pg_basebackup_tablespace;
drop database some_database_with_tablespace;
drop tablespace some_isolation2_pg_basebackup_tablespace;
\!gpstop -ra -M immediate;

The root cause is on mirror after drop database & drop tablespace, 'immediate'
stop causes the pg_control file not up-to-date with latest redo start lsn (this
is allowed), when the node restarts, it re-redoes 'create database
some_database_with_tablespace tablespace
some_isolation2_pg_basebackup_tablespace' but the tablespace directories have
been deleted in previous redoing.

The 'could not create directory' error could happen on re-redoing create table
in a tablespace also. We've seen this case on the ci environment, but that is
because missing of a get_parent_directory() call in the 'create two parents'
code block in TablespaceCreateDbspace(). Changing it to a simpler call
pg_mkdir_p() instead.

Also it seems that the src_path could be missing also in dbase_redo() for the
example below. For example re-redoing at the alter step since tbs1 directory is
deleted in later 'drop tablespace tbs1'.
alter database db1 set tablespace tbs2;
drop tablespace tbs1;

There is discussion on upstream about this,
https://www.postgresql.org/message-id/flat/CAEET0ZGx9AvioViLf7nbR_8tH9-%3D27DN5xWJ2P9-ROH16e4JUA%40mail.gmail.com

In this patch I recreate those directories to avoid this error. Other solutions
include ignoring the directory-not-existing error or forcing a flush when
redoing those kind of checkpoint xlogs which are added normally in drop
database, etc.

Let's revert or update the code change after the solution is finalized on
upstream.

7a09e80d

docs - pxf init/sync support to master standby (#7540) · 433a6ebb

由 Lisa Owen 提交于 4月 29, 2019

* docs - pxf init/sync support to master standby

* edits requested by david

* edits requested by francisco and oliver

* pxf sync from master TO standby or seg host

* identify sync run on master in pxf sync option description

433a6ebb

29 4月, 2019 2 次提交

Silece stringop-overflow warning and protect against invalid read · 689959e2

由 Georgios Kokolatos 提交于 4月 29, 2019

strncat unfortunately takes a misleading size argument which means
at most size from src. It is a bit of an antipattern in the string
family of functions and for that compilers will emit a warning if
it happens that the size argument matches the size of src, since that
is not what usually users of strncat want to do.

The usage of the function in the code was correct. However instead
of silencing the compiler, strncat was replaced with the dynamic buffer
family of operations that postgres provides for the frontend.

Also protect against an invalid read in case that the size of the
result is zero.
Reviewed-by: NAsim R P <apraveen@pivotal.io>
Reviewed-by: NDaniel Gustafsson <dgustafsson@pivotal.io>

689959e2

Remove superfluous NULL check · 6f3ed938

由 Daniel Gustafsson 提交于 4月 29, 2019

The curl slist API properly handle NULLs so we can be less verbose
and skip the check before passing to the slist cleanup function.
Reviewed-by: NFrancisco Guerrero <aguerrero@pivotal.io>

6f3ed938