提交 · 481cdcef6594e590560240f41b92b42d39b5b69b · Greenplum / Gpdb

24 6月, 2017 14 次提交

Restore statistics file in gpdbrestore --restore-stats with ddboost · 481cdcef

由 Chris Hajas 提交于 6月 21, 2017

The gp_statsistics prefix was not included in the list of files to
restore from ddboost, causing restore to fail when gpdbrestore
--restore-stats was used.

481cdcef

C
Fix gpcrondump to correctly backup ordered aggregates · 7256defe
由 Chris Hajas 提交于 6月 21, 2017
```
This functionality was included with pg_dump, but was missing from
gpcrondump.
```
7256defe
J

Update timestamps, correct example (#2682) · a638dfee
由 Jane Beckman 提交于 6月 23, 2017

a638dfee
A

Fix compiler warning unused function 'register_unlink'. · 5c3be9a0
由 Ashwin Agrawal 提交于 6月 21, 2017

5c3be9a0

Delete files on mirror for AO/CO tables. · 99af62c4

由 Ashwin Agrawal 提交于 6月 20, 2017

With --enable-segwalrep, mirror leverages replay of xl_mm_fs_obj records to
delete files. Code was not correctly handling appendonly tables as was calling
`smgrdounlink()`, which is for heap tables or indexes. For AO/CO tables, need to
perform drop of specific single file mentioned in xlog record, which is
performed by `MirroredAppendOnly_Drop()`. Without this code currently files like
<relfilenode>.127, <relfilenode>.129,... etc get left behind on mirror. The
problem was not seen so far as master never stores data for AO/CO tables hence
these files are not created on master. Only when now we start enabling wal
replication for segments this is required.

99af62c4

Change DEBUG1 to LOG for messages under Debug_print_qd_mirroring. · b938a87b

由 Ashwin Agrawal 提交于 6月 20, 2017

Helpful for debugging to set GUC Debug_print_qd_mirroring but the message were
with DEBUG1. Just enabling and disabling is guc enough to control the logging
don't need second level.

b938a87b

Enable xlogging for create fs objects on segments. · 9efec6b2

由 Ashwin Agrawal 提交于 6月 20, 2017

Incase of --enable-segwalrep, write-ahead logging should not be skipped for
anything, as it relies on that mechanism to construct the things on
mirror. Write-ahead logging for these pieces were only enabled performed for
master, with this commit gets enabled for segments as well.

9efec6b2

Make get_filespaces_to_send() work for QEs for default filespaces. · a758cb32

由 Ashwin Agrawal 提交于 6月 16, 2017

Currently, get_filespaces_to_send() only works for QD. To enable pg_basebackup
and wal replication for QEs, this function must also work on QEs. This function
relies on pg_filespace_entry table to provide information, as its only available
on QD currently can't be leveraged. Hence just enabling basic support for QEs
for default filespace. Supporting user defined filespaces and non-default
transaction filespace, will be dealt incrementally later.

a758cb32

Add gpcheckcat behave test for dependency check · fbe35fe5

由 Jimmy Yih 提交于 6月 16, 2017

In this behave test, we delete some entries in pg_depend and in some
relative catalog tables to simulate a corruption around pg_depend. The
gpcheckcat tool should then flag these down.

fbe35fe5

Update gpcheckcat dependency check · 6375e9bc

由 Jimmy Yih 提交于 6月 16, 2017

The current gpcheckcat dependency check only checked for extra
pg_depend entries where a pg_depend entry's objid or refobjid did not
exist as an OID of any catalog table with hasoids set. We also need to
check the reverse scenario where a catalog entry is missing an entry
in pg_depend. This particular scenario is difficult to flag due to
catalog entries having multiple unique pg_depend references or are
created later from a query that may add dependency (e.g. granting
ownership of a database to a certain user). Therefore, we add a very
basic check only against catalog tables that immediately create
dependencies upon its relative query.

6375e9bc

Change gpcheckcat missing/extra check to include pg_depend. · 4bdc7e45

由 Jimmy Yih 提交于 6月 09, 2017

We did not check for missing or extra pg_depend entries across the
cluster during gpcheckcat. We would be unaware of scenarios where a
pg_depend entry went missing and the object that used that dependency
is dropped. Those scenarios could lead to leftover catalog entries and
prevent some simple CREATE statements.

4bdc7e45

Fix gpcheckcat output · 843b2109

由 Jimmy Yih 提交于 6月 08, 2017

As gpcheckcat builds its mapping of catalog issues, it can flag
objects whose parents no longer exist (e.g. a toast table left over
after dropping a table). When these get caught, gpcheckcat will
unfortunately error out on the reporting step. To prevent erroring
out, we just check for None in the RelationObject's vars during
reporting.

Another issue that is fixed is the repetitive reporting of issues on
the testing's current database following testing of a different
database. The catalog issues reported were invalid for the current
database and were actually issues from the previous database that was
checked. This was caused by the improper resetting of the GPObjects
and GPObjectGraph global dictionaries. To fix the issue, we properly
use the clear() function to reuse the global variables.

843b2109

F
Stop enabling codegen in PR pipeline · fd8854bf
由 Foyzur Rahman and Jesse Zhang 提交于 6月 23, 2017
```
[#147538353]
```
fd8854bf

Keep gpfdist logs when gptransfer run with gpfdist verbosity flags · 65fb3e09

由 Chris Hajas 提交于 6月 16, 2017

When gptransfer is run with the gpfdist-verbose or gpfdist-very-verbose
flags, the gpfdist logs will be kept.
Signed-off-by: NJamie McAtamney <jmcatamney@pivotal.io>

65fb3e09

23 6月, 2017 9 次提交

Revert "Silence compiler warnings in pg_dump." · 8621392d

由 Karen Huddleston 提交于 6月 22, 2017

This reverts commit 6a76c5d0.

This commit caused gp_dump_agent to hang during backup.
Signed-off-by: NTom Meyer <tmeyer@pivotal.io>

8621392d

DOCS: Adding extra concepts (#2586) · 52b5e817

由 Jane Beckman 提交于 6月 22, 2017

* Clarify gpdb implementation

* Minor updates

* Add column storage info

* Fix typo

* Edits form Mel

* Fix minor typo

* Clarifying table compression

* More on AO storage

* Make append-optimized lower case

52b5e817

Disallow rescanning motion regardless of param change · c977d4de

由 Jesse Zhang 提交于 6月 21, 2017

Context: unlike upstream Postgres, not every operator in Greenplum is
rescannable, noticeably motions. Semantically, a motion is not
rescannable except for one very limited cases:

1. When the motion has been initialized, but it has never streamed out
any tuples yet. Rescanning is permitted here, because it is as good as
not rescanning.

Historically, we've been checking for exactly this condition, until in
4.2 we added a check to allow rescanning when parameters changed.
Correlated subquery was cited as the intent of commit ea867177 (private)
that introduced the additional relaxing check . But come to think of it,
motions are not rescannable regardless of parameter change. In fact, if
execution reaches this point, the optimizer must have generated a wrong
plan.

This commit reinstates the original stricter check.

c977d4de

D
DOCS: removing DDBoost from OSS build (#2677) · bde7f218
由 David Yozie 提交于 6月 22, 2017
```
* DOCS: removing DDBoost info from OSS build

* DOCS: Removing RHEL 5 reference
```
bde7f218

Fix dumping of "COUNT(*) FILTER (...)" expressions. · 54d6b71b

由 Heikki Linnakangas 提交于 6月 22, 2017

Without this fix, the FILTER expression would be left out of the deparsed
DDL of a view. Now it gets dumped as the CASE - WHEN expression that we
tranform the FILTER to at parse analysis. Ideally, we would dump it using
the original FILTER syntax, but that would be a much bigger patch. We'll
get that when we merge the upstream FILTER implementation, in PostgreSQL
9.4.

Fixes github issue #1854, reported by @water32.

54d6b71b

Adding SQL tests to check successful elimination of alien nodes. (#2663) · ce5aef56

由 foyzur 提交于 6月 22, 2017

This PR adds SQL test to verify that memory consumption of alien nodes drops to zero after we set the guc execute_pruned_plan=on.
Signed-off-by: NFoyzur Rahman <foyzur@gmail.com>

ce5aef56

H
Remove incorrect optimizations on IS NOT FALSE. · 60787a9e
由 Heikki Linnakangas 提交于 6月 22, 2017
```
Fixes github issue #2130
```
60787a9e

Revert part of commit , which caused compiler warnings. · 4430c831

由 Heikki Linnakangas 提交于 6月 22, 2017

This re-introduces a minor memory leak, per dumped operator. That is
not significant in practice, no-one has enough operators for that to
matter, and we're storing some information in memory for each dumped
operator anyway.

Moreover, this is a divergence from the upstream. In the upstream, this was
fixed in commit b1aebbb6, slightly differently, in a way that doesn't
introduce new compiler warnings. If we must fix this, we should cherry-pick
that commit, and fix it the same way in both pg_dump.c and cdb_dump_agent.c.
But I think this is not worth fixing, and we are better off just leaving
the code as it is in PostgreSQL 8.3.

4430c831

Silence compiler warnings in pg_dump. · 6a76c5d0

由 Heikki Linnakangas 提交于 6月 22, 2017

Cherry-pick two upstream commits from PostgreSQL 9.2, to silence compiler
warnings from src/bin/pg_dump. I don't normally advocate for cherry-picking
random things from upstream, but I'm getting pretty annoyed by the
warnings. This will probably cause some minor merge conflicts between now
and 9.2, but nothing major, and the compiler warnings are annoying too.

Fixes github issue #447.

Upstream commits included in this:

commit d923125b
Author: Peter Eisentraut <peter_e@gmx.net>
Date:   Fri Mar 2 22:30:01 2012 +0200

    Fix incorrect uses of gzFile

    gzFile is already a pointer, so code like

    gzFile *handle = gzopen(...)

    is wrong.

    This used to pass silently because gzFile used to be defined as void*,
    and you can assign a void* to a void**.  But somewhere between zlib
    versions 1.2.3.4 and 1.2.6, the definition of gzFile was changed to
    struct gzFile_s *, and with that new definition this usage causes
    compiler warnings.

    So remove all those extra pointer decorations.

    There is a related issue in pg_backup_archiver.h, where

    FILE       *FH;             /* General purpose file handle */

    is used throughout pg_dump as sometimes a real FILE* and sometimes a
    gzFile handle, which also causes warnings now.  This is not yet fixed
    here, because it might need more code restructuring.

commit 19f45565
Author: Peter Eisentraut <peter_e@gmx.net>
Date:   Tue Mar 20 20:38:20 2012 +0200

    pg_dump: Remove undocumented "files" output format

    This was for demonstration only, and now it was creating compiler
    warnings from zlib without an obvious fix (see also
    d923125b), let's just remove it.  The
    "directory" format is presumably similar enough anyway.

6a76c5d0

22 6月, 2017 17 次提交

H
Don't generate invalid plans with LASJ merge joins. · c1d8e223
由 Heikki Linnakangas 提交于 6月 22, 2017
```
Fixes github issue #2195, reported by @Toknowledge.
```
c1d8e223

Add library cleaning to gppc · 16be5c69

由 Daniel Gustafsson 提交于 6月 22, 2017

Just removing the .o file on clean leaves .a and the shlib files
around which can cause problems when building. Add the clean-lib
targets from Makefile.shlib.

16be5c69

Restore PGREQUIRESSL recognition in libpq · 39170c1c

由 Daniel Gustafsson 提交于 6月 22, 2017

This is a partial (documentation part left out) backport of upstream
commit aafbd1df96 which fixes a potential SSL downgrade in libpq.

  commit aafbd1df969135c185947c596c46608fc9f4a67c
  Author: Noah Misch <noah@leadboat.com>
  Date:   Mon May 8 07:24:24 2017 -0700

    Restore PGREQUIRESSL recognition in libpq.

    Commit 65c3bf19 moved handling of the,
    already then, deprecated requiressl parameter into conninfo_storeval().
    The default PGREQUIRESSL environment variable was however lost in the
    change resulting in a potentially silent accept of a non-SSL connection
    even when set.  Its documentation remained.  Restore its implementation.
    Also amend the documentation to mark PGREQUIRESSL as deprecated for
    those not following the link to requiressl.  Back-patch to 9.3, where
    commit 65c3bf19 first appeared.

    Behavior has been more complex when the user provides both deprecated
    and non-deprecated settings.  Before commit 65c3bf19, libpq operated
    according to the first of these found:

      requiressl=1
      PGREQUIRESSL=1
      sslmode=*
      PGSSLMODE=*

    (Note requiressl=0 didn't override sslmode=*; it would only suppress
    PGREQUIRESSL=1 or a previous requiressl=1.  PGREQUIRESSL=0 had no effect
    whatsoever.)  Starting with commit 65c3bf19, libpq ignored PGREQUIRESSL,
    and order of precedence changed to this:

      last of requiressl=* or sslmode=*
      PGSSLMODE=*

    Starting now, adopt the following order of precedence:

      last of requiressl=* or sslmode=*
      PGSSLMODE=*
      PGREQUIRESSL=1

    This retains the 65c3bf19 behavior for connection strings that contain
    both requiressl=* and sslmode=*.  It retains the 65c3bf19 change that
    either connection string option overrides both environment variables.
    For the first time, PGSSLMODE has precedence over PGREQUIRESSL; this
    avoids reducing security of "PGREQUIRESSL=1 PGSSLMODE=verify-full"
    configurations originating under v9.3 and later.

    Daniel Gustafsson

    Security: CVE-2017-7485

39170c1c

Manipulate callback functions for resource group related operations. · 4ccf54c4

由 Richard Guo 提交于 6月 22, 2017

A dedicated list is maintained for resource group related callbacks.
At transaction end, the callback functions are processed in the order
of FIFO on COMMIT, and in the order of LIFO on ABORT.
Signed-off-by: NPengzhou Tang <ptang@pivotal.io>

4ccf54c4

D
DOCS: removing/conditionalizing pivotal-specific download info; changing... · 4016b54b
由 David Yozie 提交于 6月 21, 2017
```
DOCS: removing/conditionalizing pivotal-specific download info; changing PivotalR references to open source page (#2664)
```
4016b54b
C

docs: add --gpfdist-verbose and --gpfdist-very-verbose gptransfer options · 4997e0f0
由 Chuck Litzell 提交于 6月 21, 2017

4997e0f0
C
Allow gptransfer to run on non-Linux machines · 54c82bb2
由 Chris Hajas 提交于 6月 21, 2017
```
Signed-off-by: NJamie McAtamney <jmcatamney@pivotal.io>
```
54c82bb2
K
Add --gpfdist-verbose and --gpfdist-very-verbose options to gptransfer · a8aa31e7
由 Karen Huddleston 提交于 6月 21, 2017
```
Signed-off-by: NTodd Sedano <professor@gmail.com>
```
a8aa31e7

For Text/Varchar/Char/Bpchar columns, we should ignore generating StatsBuckets in DXL · 5447a83f

由 Ekta Khanna and Jemish Patel 提交于 6月 19, 2017

Instead we should maintain NDVRemain and NullFreq to do Cardinality
Estimation.

Adding function to check if we need to create stats bucket in DXL

Function `FCreateStatsBucket` returns true if column data type is
not a text/varchar/char/bpchar type.
Signed-off-by: NEkta Khanna <ekhanna@pivotal.io>

5447a83f

C
Add statistics file to list of pipes in gpcrondump --list-backup-files (#2653) · 03bf5e9c
由 Chris Hajas 提交于 6月 21, 2017
```
Signed-off-by: NChris Hajas <chajas@pivotal.io>
```
03bf5e9c
D

DOCS: Remove info about running gpfdist as a Windows service · 0ba485d9
由 dyozie 提交于 6月 21, 2017

0ba485d9
C
Fix gptransfer distribution key quoting in fast mode (#2641) · efe7382e
由 Chris Hajas 提交于 6月 21, 2017
```
Signed-off-by: NJamie McAtamney <jmcatamney@pivotal.io>
```
efe7382e
D
Fix typo in comment · 21a72702
由 Daniel Gustafsson 提交于 6月 21, 2017
```
[ci skip]
```
21a72702

Docs - removing client & loader docs (#2656) · 5732e398

由 David Yozie 提交于 6月 21, 2017

* adding psql example to kerberos linux client doc

* removing client/loader maps from main .ditamap

* removing client tool guides from repo

5732e398

Eliminating alien nodes before execution (#2588) · 9b8f5c0b

由 foyzur 提交于 6月 21, 2017

In GPDB the dispatcher dispatches the entire plan tree to each query executor (QX). Each QX deserializes the entire plan tree and starts execution from the root of the plan tree. This begins by calling InitPlan on the QueryDesc, which blindly calls ExecInitNode on the root of the plan.

Unfortunately, this is wasteful, in terms of memory and CPU. Each QX is in charge of a single slice. There can be many slices. Looking into plan nodes that belong to other slices, and initializing (e.g., creating PlanState for such nodes) is clearly wasteful. For large plans, particularly planner plans, in the presence of partitions, this can add up to a significant waste.

This PR proposes a fix to solve this problem. The idea is to find the local root for each slice and start ExecInitNode there.

There are few special cases:

SubPlans are special, as they appear as expression but the expression holds the root of the sub plan tree. All the subplans are bundled in the plannedstmt->subplans, but confusingly as Plan pointers (i.e., we save the root of the SubPlan expression's Plan tree). Therefore, to find the relevant sub plans, we need to first find the relevant expressions and extract their roots and then iterate the plannedstmt->subplans, but only ExecInitNode on the ones that we can reach from some expressions in current slice.

InitPlan are no better as they can appear anywhere in the Plan tree. Walking from a local motion is not sufficient to find these InitPlan. Therefore, we need to walk from the root of the plan tree and identify all the SubPlan. Note: unlike regular subplan, the initplan may not appear in the expression as subplan; rather it will appear as a parameter generator in some other parts of the tree. We need to find these InitPlan and obtain the SubPlan for each InitPlan. We can then use the SubPlan's setParam to copy precomputed parameter values from estate->es_param_list_info to estate->es_param_exec_vals

We also found that the origSliceIdInPlan is highly unreliable and cannot be used as an indicator of a plan node's slice information. Therefore, we precompute each plan node's slice information to correctly determine if a Plan node is alien or not. This makes alien node identification more accurate. In successive PRs, we plan to use the alien memory account balance as a test to see if we successfully eliminated all aliens. We will also use the alien account balance to determine memory savings.

9b8f5c0b

H

Fix CTE related compiler warnings · e40e78fa
由 Haisheng Yuan and Kavinder Dhaliwal 提交于 6月 20, 2017

e40e78fa

Assign Rollover as parent account if the parent account is obsolete (#2609) · 2cabdf9d

由 foyzur 提交于 6月 21, 2017

Detecting dead parent account and replacing with Rollover during memory accounting array to tree conversion.

* Unit test to check if children of dead parents are serialized as children of Rollover account.

2cabdf9d