提交 · dbb8f072488a8d5aa049afb1731b97a37d84561c · Greenplum / Gpdb

14 7月, 2017 4 次提交

A
Fixes readme confusion over steps for mac vs linux · dbb8f072
由 Alexandra Wang 提交于 7月 11, 2017
```
Signed-off-by: NJohn Gaskin <johntgaskin@gmail.com>
Signed-off-by: NTodd Sedano <tsedano@pivotal.io>
```
dbb8f072

During replay of AO XLOG records, keep track of missing AO/AOCO segment files · b659d047

由 Jimmy Yih 提交于 7月 11, 2017

When a standby is shut down and restarted, WAL recovery starts from
the last restartpoint. If we replay an AO write record which has a
following drop record, the WAL replay of the AO write record will find
that the segment file does not exist. To fix this, we piggyback on top
of the heap solution of tracking invalid pages in the invalid_page_tab
hash table. The hash table key struct uses a block number which, for
AO's sake, we pretend is the segment file number for AO/AOCO
tables. This solution will be revisited to possibly create a separate
hash table for AO/AOCO tables with a proper key struct.

Big thanks to Heikki for pointing out the issue.

b659d047

Replay of AO XLOG records · cc9131ba

由 Ashwin Agrawal 提交于 6月 23, 2017

We generate AO XLOG records when --enable-segwalrep is configured. We
should now replay those records on the mirror or during recovery. The
replay is only performed for standby mode since promotion will not
execute until after there are no more XLOG records to read from the
WAL stream.

cc9131ba

H
Add packages needed to compile to the list in README.debian. · c839b7c5
由 Heikki Linnakangas 提交于 7月 13, 2017
```
As reported by @flochman. See github issue #2739.
```
c839b7c5

13 7月, 2017 12 次提交

Harmonize error message, add test for external tables with too many URIs. · 4055ab3b

由 Heikki Linnakangas 提交于 7月 13, 2017

Seems like a good thing to test. To avoid having to have separate ORCA
and non-ORCA expected outputs, change the ORCA error message to match
that you get without ORCA.

4055ab3b

Remove unreachable and unused code (#2611) · f4e50a64

由 Daniel Gustafsson 提交于 7月 13, 2017

This removes code which is either unreachable due to prior identical
tests which break the codepath, or which is dead due to always being
true. Asserting that an unsigned integer is >= 0 will always be true,
so it's pointless.

Per "logically dead code" gripes by Coverity

f4e50a64

Fix gpsegwalrep.py deadlock issue · 195d65d4

由 Jimmy Yih 提交于 7月 12, 2017

When running `gpsegwalrep.py start`, it would intermittently deadlock
on the subprocess.check_output call.  Apparently, concurrent
subprocess.check_output calls can deadlock depending on what shell
commands are run and how fast they execute.  For now, fix the issue by
only calling subprocess.check_output under a thread lock.  Someone can
revisit this later although it is assumed a proper tool will be
created in the near future.

195d65d4

gpfaultinjector should work with filerep disabled · 41ba1012

由 Abhijit Subramanya 提交于 7月 10, 2017

If we try to inject certain faults when the system is initialized with filerep
disabled, we get the following error:

```
gpfaultinjector error: Injection Failed: Failure: could not insert fault
injection, segment not in primary or mirror role
Failure: could not insert fault injection, segment not in primary or mirror
role
```

This patch removes the check for the role for non-filerep faults so that they
don't fail on a cluster initialized without filerep.

41ba1012

Use block number instead of LSN to batch changed blocks in filerep · abe13c79

由 Asim R P 提交于 6月 30, 2017

Filerep resync logic to fetch changed blocks from changetracking (CT)
log is changed. LSN is no longer used to filter out blocks from CT
log. If a relation's changed blocks falls above the threshold number
of blocks that can be fetched at a time, the last fetched block number
is remembered and used to form subsequent batch.

abe13c79

TINC test to detect a bug in filerep resync logic. · 8e59eea3

由 Asim R P 提交于 6月 29, 2017

Filerep resync works by obtaining blocks changed since a mirror went down from
changetracking (CT) log. The changed blocks are obtained in fixed sized
batches. Blocks of the same relation are ordered by block number. The bug
occurs when a higher numbered block of a relation is changed such that it has
lower LSN as compared to lower numbered blocks. And the higher numbered blocks
is not included in the first batch of changed blocks for this relation. Such
blocks miss being resynchronized to mirror due to incorret filter based on
previously obtained changed blocks' LSN. That means the mirror is eventually
declared in-sync with primary but some changed blocks remain only on the
primary. This loss in data manifests only when the mirror takes over as
primary, upon rebalance or the primary going down.

8e59eea3

Add GUC to control number of blocks that a resync worker operates on · 2960bd7c

由 Asim R P 提交于 6月 27, 2017

The GUC gp_changetracking_max_rows replaces a compile time constant. Resync
worker obtains at the most gp_changetracking_max_rows number of changed blocks
from changetracking log at one time. Controling this with a GUC allows
exploiting bugs in resync logic around this area.

2960bd7c

M

Docs: add database and object name limitation. · 85372be2
由 mkiyama 提交于 7月 12, 2017

85372be2
M

Docs: minor edits and fixes. · 0ed68855
由 mkiyama 提交于 7月 12, 2017

0ed68855
M

Docs: minor update for backup with NetBackup · 041f7146
由 mkiyama 提交于 7月 12, 2017

041f7146
M

Docs: add gpfdist info for X-GP-PROTO in request header · 0f381544
由 mkiyama 提交于 7月 12, 2017

0f381544
M

Docs: update gpssh.config option sync_retries · 3e14992e
由 mkiyama 提交于 7月 12, 2017

3e14992e

12 7月, 2017 3 次提交
- A
  
  Fix cURL artifacts config names · 8cc28a03
  由 Adam Lee 提交于 7月 12, 2017
  
  8cc28a03
- A
  Upgrade OpenSSL to 1.0.2l · 1803dee8
  由 Adam Lee 提交于 7月 06, 2017
```
0.9.8 is EOL, 1.0+ version has many security and performance improvements.
```
  1803dee8
- J
  Fixed dangling rename left over from 3c40df0a · 18099cb9
  由 Jesse Zhang 提交于 7月 11, 2017
```
`enable-cassert` is your friend, yo
```
  18099cb9
11 7月, 2017 16 次提交

Speed up simple queries on AOCS tables, when only a few columns are needed. · 3c40df0a

由 Heikki Linnakangas 提交于 7月 11, 2017

If you have a query like "SELECT COUNT(col1) FROM wide_table", where the
table has dozens of columns, the overhead in aocs_getnext() just to figure
out which columns need to be fetched becomes noticeable. Optimize it.

3c40df0a

Reformat code. · ecc94a32

由 Heikki Linnakangas 提交于 7月 11, 2017

There was a mixture of spaces and tabs being used for indentation in
aocsam.c, and I finally got fed up with that while doing other changes in
that file. I ran pgindent, and did a bunch of manual fixups of the
formatting. All the changes in this commit are purely cosmetic.

I did the same for appendonlyam.c, although I'm not changing it at the
moment, to keep aocsam.c and appendonlyam.c in sync.

ecc94a32

Turn a couple of hazardous macros into inline functions. · 2395d803

由 Heikki Linnakangas 提交于 7月 11, 2017

In aocsam.c, there's a block of code that does:

    if (...)
    {
        AOTupleIdInit_rowNum(...);
    }
    else
    {
        AOTupleIdInit_rowNum(...);
    }

While hacking, I removed the seemingly unnecessary braces, turning that
into just:

    if (...)
        AOTupleIdInit_rowNum(...);
    else
        AOTupleIdInit_rowNum(...);

But then I got a compiler error, about 'else' without 'if'. I was baffled
for a moment, until I looked at the definition of AOTupleIdInit_rowNum. The
way it includes curly braces makes it not work in an if-else construct like
above. These macros also have double-evaluation hazards.

To make this more robust, turn the macros into static inline functions.
Inline functions generally behave more sanely and are more readable than
macros.

2395d803

Remove unused field. · 781ff5b7

由 Heikki Linnakangas 提交于 7月 11, 2017

This does mean that we don't free the array quite as quickly as we used to,
but it's a drop in the sea. The array is very small, there are much bigger
data structures involved in evey AOCS scan that are not freed as quickly,
and it's freed at the end of the query in any case.

781ff5b7

H
Add missing prototypes to silence compiler warnings. · 99c30c58
由 Heikki Linnakangas 提交于 7月 11, 2017
```
Commit fa6c2d43 added two functions, but forgot to add prototypes for
them.
```
99c30c58
A
gpload: log gpfdist outputs by default · 64d150e9
由 Adam Lee 提交于 6月 23, 2017
```
Which is important for debugging customers' issues. (log level still
matters)
```
64d150e9

Fixed gpload freezed when logging some non-unicode data · 769836cc

由 Ming LI 提交于 7月 05, 2017

1. Log raw string if it can't be decoded as unicode.
2. If similar exception issues in log(), continue processing left log with a warning.
3. If other exception issues in CatThread, log thread exit without blocking worker process,
and report warning "gpfdist log halt because Log Thread got an exception:".

769836cc

behave: make select sql longer for gpperfmon test · b325f08b

由 Marbin Tan 提交于 7月 06, 2017

Create a more extensive workload for the sql to make it last longer.
The previous sql was completing too fast and so when the actual pid read
happens, there pid no longer exists and causes the result to be 0.

b325f08b

V

Update GPORCA version · b705a37b
由 Venkatesh Raghavan 提交于 7月 10, 2017

b705a37b
J
Revert "Bump Orca version number" · b3d6b0db
由 Jesse Zhang, Kavinder Dhaliwal and Melanie Plageman 提交于 7月 10, 2017
```
Oops we broke the tests sorry :(
This reverts commit 97db5bdd.
```
b3d6b0db
K
Bump Orca version number · 97db5bdd
由 Kavinder Dhaliwal 提交于 7月 10, 2017
```
Signed-off-by: NMelanie Plageman <mplageman@pivotal.io>
```
97db5bdd

Docs gss rename (#2761) · 378cc5d8

由 Chuck Litzell 提交于 7月 10, 2017

* Pivotal GSS name change to Pivotal Support

* Change Greenplum Customer Support reference to a Warning, as in the user doc

378cc5d8

J
gphdfs limit fix with altering process close/cleanup behavior · fd58ba5f
由 John Gaskin 提交于 7月 10, 2017
```
Signed-off-by: NShivram Mani <shivram.mani@gmail.com>
```
fd58ba5f

Adapt pexpect override to handle delays up to 10 second · f7aa215a

由 Nadeem Ghani 提交于 6月 28, 2017

Workaround a problem discovered by a client that noticed intermittent errors for gpssh when some nodes became very cpu-bound.

In particular, we override the way the ssh command prompt is validated
on a remote machine, within gpssh. The vendored module 'pexpect' tries to match 2 successive prompts
from an interactive bash shell.  However, if the target host is slow from
CPU loading or network loading, these prompts may return late.

In that case, the override retries several times, extending the timeout from the
default 1 second to up to 125 times that duration.

Experimentally, these added retries seem to tolerate about 1 second delay, testing with a 'tc' command that slows network traffic artificially.

The number of retries can be configured.

--add unit tests to verify happy path of ssh-ing to localhost
--add module for gpssh, for overriding pexpect (pxxssh)
--add readme to describe testing technique using 'tc' to delay network
Signed-off-by: NLarry Hamel <lhamel@pivotal.io>

f7aa215a

L
Refactor to introduce a main method · cf67667e
由 Larry Hamel 提交于 6月 28, 2017
```
Also, added a unit test.
Signed-off-by: NNadeem Ghani <nghani@pivotal.io>
```
cf67667e
N
Whitespace reformat only · d34d4f89
由 Nadeem Ghani 提交于 6月 28, 2017
```
Signed-off-by: NLarry Hamel <lhamel@pivotal.io>
```
d34d4f89

10 7月, 2017 2 次提交

Fix assert failure in ResGroupSlotAcquire() · 3cac2952

由 xiong-gang 提交于 7月 10, 2017

CREATE RESOURCE GROUP rg1 WITH (concurrency=1, cpu_rate_limit=10, memory_limit=10);
	CREATE ROLE r1 RESOURCE GROUP rg1;

	session 1:
	set role r1;
	BEGIN;

	session 2:
	BEGIN; <--- hang, and then cancel
	BEGIN; <--- assertion failure
Signed-off-by: NNing Yu <nyu@pivotal.io>

3cac2952

Add assertion when do subtraction on memory usage in resource group. · 7dd4dd74

由 Richard Guo 提交于 7月 10, 2017

Memory usage statistic in resource group is defined as unsigned integer.
For subtraction 'a - b' on memory usage, the atomic subtraction function
'pg_atomic_sub_fetch_*' will return the value of 'a' before the subtraction.
Then this value is asserted to be no less than 'b'.

7dd4dd74

07 7月, 2017 3 次提交

A

Use PGXS on concourse without compiled GPDB · e87dfaec
由 Adam Lee 提交于 7月 07, 2017

e87dfaec
A

gpcloud: make unit tests able to run without compiled GPDB · 53d84389
由 Adam Lee 提交于 7月 07, 2017

53d84389

Resgroup catalog changes · 4fafebe2

由 Ning Yu 提交于 7月 07, 2017

Change initial contents in pg_resgroupcapability:
* Remove memory_redzone_limit;
* Add memory_shared_quota, memory_spill_ratio;

Change resgroup concurrency range to [1, 'max_connections']:
* Original range is [0, 'max_connections'], and -1 means unlimited.
* Now the range is [1, 'max_connections'], and -1 is not supported.

Change resgroup limit type from float to int.

Changed below resgroup resource limit types from float to int percentage value:
* cpu_rate_limit;
* memory_limit;
* memory_shared_quota;
* memory_spill_ratio;

4fafebe2