1. 23 9月, 2017 5 次提交
  2. 22 9月, 2017 6 次提交
    • D
      Merge amgetbitmap AM functions. · 1e39a91e
      Daniel Gustafsson 提交于
      This merges and backports the upstream commits which replaces the
      amgetmulti AM function with amgetbitmap, which performs the whole indexscan
      in one call (for HashBitmap, StreamBitmaps are not affected by this). GPDB
      was more or less already doing this, as the upstream patch was from the
      beginning submitted from Greenplum. This commit refactors the AM function
      to mimick the upstream behavior, while keeping the GPDB API for the
      callsites.
      
      The below commits are included either in full, or in part:
      
        commit 4e82a954
        Author: Tom Lane <tgl@sss.pgh.pa.us>
        Date:   Thu Apr 10 22:25:26 2008 +0000
      
          Replace "amgetmulti" AM functions with "amgetbitmap", in which the whole
          indexscan always occurs in one call, and the results are returned in a
          TIDBitmap instead of a limited-size array of TIDs.  This should improve
          speed a little by reducing AM entry/exit overhead, and it is necessary
          infrastructure if we are ever to support bitmap indexes.
      
          In an only slightly related change, add support for TIDBitmaps to preserve
          (somewhat lossily) the knowledge that particular TIDs reported by an index
          need to have their quals rechecked when the heap is visited.  This facility
          is not really used yet; we'll need to extend the forced-recheck feature to
          plain indexscans before it's useful, and that hasn't been coded yet.
          The intent is to use it to clean up 8.3's horrid @@@ kluge for text search
          with weighted queries.  There might be other uses in future, but that one
          alone is sufficient reason.
      
          Heikki Linnakangas, with some adjustments by me.
      
        commit 1dcf6fdf
        Author: Teodor Sigaev <teodor@sigaev.ru>
        Date:   Sat Aug 23 10:37:24 2008 +0000
      
          Fix possible duplicate tuples while  GiST scan. Now page is processed
          at once and ItemPointers are collected in memory.
      
          Remove tuple's killing by killtuple() if tuple was moved to another
          page - it could produce unaceptable overhead.
      
          Backpatch up to 8.1 because the bug was introduced by GiST's concurrency support.
      
        commit b9856b67
        Author: Teodor Sigaev <teodor@sigaev.ru>
        Date:   Wed Oct 22 12:53:56 2008 +0000
      
          Fix GiST's killing tuple: GISTScanOpaque->curpos wasn't
          correctly set. As result, killtuple() marks as dead
          wrong tuple on page. Bug was introduced by me while fixing
          possible duplicates during GiST index scan.
      1e39a91e
    • K
      Enable ORCA to be tracked by Mem Accounting · 669dd279
      Kavinder Dhaliwal 提交于
      Before this commit all memory allocations made by ORCA/GPOS were a
      blackbox to GPDB. However the ground work had been in place to allow
      GPDB's Memory Accounting Framework to track memory consumption by ORCA.
      This commit introduces two new functions
      Ext_OptimizerAlloc and Ext_OptimizerFree which
      pass through their parameters to gp_malloc and gp_free and do some bookeeping
      against the Optimizer Memory Account. This introduces very little
      overhead to the GPOS memory management framework.
      Signed-off-by: NMelanie Plageman <mplageman@pivotal.io>
      Signed-off-by: NSambitesh Dash <sdash@pivotal.io>
      669dd279
    • J
      resgroup isolation2: increase memory limits for 8.4 · 39bb8145
      Jacob Champion 提交于
      8.4 seems to use more memory during this test. To get master green
      again, we're checking in these changes to the memory limits for the
      resource group tests. Follow-up should be on issue #3345; there's a good
      chance this will not be our final solution to this test failure.
      Signed-off-by: NTom Meyer <tmeyer@pivotal.io>
      39bb8145
    • H
      Fix comment, rendered incorrect by commit f7101d98. · a133901a
      Heikki Linnakangas 提交于
      We can encounter tuples that belong to later batches even after the first
      pass. Revert the comment to the way it is in upstream.
      I forgot to update
      a133901a
    • H
      Merge with commit 'f260edb1', from PostgreSQL 8.4devel. · 3b4cd788
      Heikki Linnakangas 提交于
      Noteworthy changes that were not totally straightforward to merge:
      
      * Changes in the hash function. This replaces the contents of hashfunc.c
        directly with REL8_4_STABLE, not just changes otherwise included in the
        merge batch. That includes later changes to the hash algorithm used. I
        didn't feel like trying to fix it to an intermediate state that we would
        just rewrite again later.
      
        The hash function had been replaced in GPDB, too, but I couldn't quite
        figure out what the GPDB algorithm was, and whether it was better or how.
        In any case, I believe the new PostgreSQL algorithm is decent, so let's
        just use that. I'm not very impressed by the old code, there was weird
        stuff going on with the little and big endianess stuff. And at the top,
        WORDS_BIGENDIAN was misspelled as WORS_BIGENDIAN, so it never worked as
        intended on big endian systems.
      
        Note that GPDB uses a completely different set of hash functions for
        calculating the DISTRIBUTED BY key, so this doesn't affect pg_upgrade.
        This does invalidate hash indexes, but they're not supported on GPDB
        anyway. And we don't support hash partitioning either.
      
      * Pattern selectivity functions had been heavily modified in GPDB, but this
        replaces it with the upstream version. It was not clear to us what the
        purpose of the GPDB changes were. That ought to be revisited, and there's
        a GPDB_84_MERGE_FIXME comment about it.
      
      * Commit 95c238d9, to make COPY of CSV files faster, was not merged.
        The function had been heavily modified in GPDB, and it was not
        immediately clear how to resolve the conflicts. That commit was just a
        performance enhancement, so we can revisit that later. Added a
        GPDB_84_MERGE_FIXME comment about that too.
      
      * Resurrect the MyXactAccessedTempRel global variable. It's not used for
        anything in GPDB, as noted in the comment in PrepareTransaction. We had
        #ifdef'd out the variable, and all the places that set the variable. To
        reduce future merge conflicts, it seems better to have the variable and
        keep all the places where it's set unmodified from the upstream, and only
        comment out the place where it's checked in PrepareTransaction.
      
      * heap_release_fetch was removed in upstream, because it was unused.
        However, it was still used in one GPDB-specific function, in nbtree.c.
        Replace the call in nbtree.c with a ReleaseBuffer() + heap_fetch(), and
        add a GPDB_84_MERGE_FIXME to revisit.
      
      * This merge included an upstream change to add USE_SEGMENTED_FILES flag,
        but it was later later in the 8.4 dev cycle. Cherry-pick the change to
        remove it now, to avoid having to make it work just to remove it later.
        (commit 3c6248a8)
      
      * This adds support for enum-type GUCs, but we do not yet take advantage
        of that in the GPDB-specific GUCs, except for a few that shared code
        with client_min_messages and log_min_messages.
      
      * Reshuffle a few OIDs to avoid collision. We had reserved OID 1980 for
        int8_ops opclass. But that is now used for numeric_div_trunc function(),
        which we just merged in. In the upstream, we have reserved OID 3124 for
        the opclass, but only since version 9.2. Before that, we used whatever
        was free at initdb time. But we have been using OID 3124 for the
        GPDB-specific pg_proc_callback system table
      
        To resolve this mess, change the OID of pg_proc_callback from 3124 to
        7176, to make 3124 available. And then use 3124 for int8_ops. That leaves
        1980 for numeric_div_trunc function(), like in upstream.
      
      * TRUNCATE triggers now work, and to make that work, I made some changes to
        the way statement-level triggers are fired in general. The goal with
        statement-level triggers is to always execute them on the dispatcher, but
        they've been broken and unsupported before. At first, I thought these
        changes would be enough to do that for all statement-level triggers, but
        testing shows that not quite. So statement-level triggers are broken,
        like they were before, even though we pass the truncate-trigger tests
        now.
      
      This has been a joint effort between Heikki Linnakangas, Daniel Gustafsson,
      Jacob Champion and Tom Meyer.
      3b4cd788
    • L
      docs - add suse11 swapaccount req to resgroup cgroup cfg (#3323) · 430e7343
      Lisa Owen 提交于
      * docs - add suse11 swapaccount req to resgroup cgroup cfg
      
      * must reboot after setting boot parameters
      430e7343
  3. 21 9月, 2017 14 次提交
    • H
      Mask out differences in plperl.c line numbers in errors. · 8b153171
      Heikki Linnakangas 提交于
      Ideally, we would use proper error codes, or find some other way to prevent
      the useless "(plperl.c:2118)" from appearing in PL/perl errors. Later
      versions of PostgreSQL do that, so we'll get that eventually. In the
      meanwhile, silence errors caused by code movement in that file. Same as
      we had done for plperl's own tests already.
      8b153171
    • D
      Use autoconf for resolving PXF library dependency · 6f1ca717
      Daniel Gustafsson 提交于
      Leverage the core autoconf scaffolding for resolving the dependency
      on libcurl. Enabling PXF in autoconf now automatically adds libcurl
      as a dependency. Coupled with the recent commit which relaxes the
      curl version requirement on macOS, we can remove the library copying
      from the PXF makefile as well.
      6f1ca717
    • H
      Fix bug in handling re-scan of a hash join. · f7101d98
      Heikki Linnakangas 提交于
      The WITH RECURSIVE test case in 'join_gp' would miss some rows, if
      the hash algorithm (src/backend/access/hash/hashfunc.c) was replaced
      with the one from PostgreSQL 8.4, or if statement_mem was lowered from
      1000 kB to 700 kB. This is what happened:
      
      1. A tuple belongs to batch 0, and is kept in memory during processing
         batch 0.
      
      2. The outer scan finishes, and we spill the inner batch 0 from memory
         to a file, with SpillFirstBatch, and start processing tuple 1
      
      3. While processing batch 1, the number of batches is increased, and
         the tuple that belonged to batch 0, and was already written to the
         batch 0's file, is moved, to a later batch.
      
      4. After the first scan is complete, the hash join is re-scanned
      
      5. We reload the batch file 0 into memory. While reloading, we encounter
         the tuple that now doesn't seem to belong to batch 0, and throw it
         away.
      
      6. We perform the rest of the re-scan. We have missed any matches to the
         tuple that was thrown away. It was not part of the later batch files,
         because in the first pass, it was handled as part of batch 0. But in
         the re-scan, it was not handled as part of batch 0, because nbatch was
         now larger, so it didn't belong there.
      
      To fix, when reloading a batch file we see a tuple that actually belongs
      to a later batch file, we write it to that later file. To avoid adding
      it there multiple times, if the hash join is re-scanned multiple times,
      if any tuples are moved when reloading a batch file, destroy the batch
      file and re-create it with just the remaining tuples.
      
      This is made a bit complicated by the fact that BFZ temp files don't support
      appending to a file that's already been rewinded for reading. So what we
      actually do, is always re-create the batch file, even if there has been no
      changes to it. I left comments about that, Ideally, we would either support
      re-appending to BFZ files, or stopped using BFZ workfiles for this
      altogether (I'm not convinced they're any better than plain BufFiles). But
      that can be done later.
      
      Fixes github issue #3284
      f7101d98
    • H
      Don't double-count inner tuples reloaded from file. · 429ff8c4
      Heikki Linnakangas 提交于
      ExecHashTableInsert also increments the counter, so we don't need to do it
      here. This is harmless AFAICS, the counter isn't used for anything but
      instrumentation at the moment, but it confused me while debugging.
      429ff8c4
    • H
      Fix CURRENT OF to work with PL/pgSQL cursors. · 91411ac4
      Heikki Linnakangas 提交于
      It only worked for cursors declared with DECLARE CURSOR, before. You got
      an "there is no parameter $0" error if you tried. This moves the decision
      on whether a plan is "simply updatable", from the parser to the planner.
      Doing it in the parser was awkward, because we only want to do it for
      queries that are used in a cursor, and for SPI queries, we don't know it
      at that time yet.
      
      For some reason, the copy, out, read-functions of CurrentOfExpr were missing
      the cursor_param field. While we're at it, reorder the code to match
      upstream.
      
      This only makes the required changes to the Postgres planner. ORCA has never
      supported updatable cursors. In fact, it will fall back to the Postgres
      planner on any DECLARE CURSOR command, so that's why the existing tests
      have passed even with optimizer=off.
      91411ac4
    • H
      Remove now-unnecessary code from gp_read_error_log to dispatch the call. · 4035881e
      Heikki Linnakangas 提交于
      There was code in gp_read_error_log(), to "manually" dispatch the call to
      all the segments, if it was executed in the dispatcher. This was
      previously necessary, because even though the function was marked with
      prodataaccess='s', the planner did not guarantee that it's executed in the
      segments, when called in the targetlist like "SELECT
      gp_read_error_log('tab')". Now that we have the EXECUTE ON ALL SEGMENTS
      syntax, and are more rigorous about enforcing that in the planner, this
      hack is no longer required.
      4035881e
    • N
      Refactor resource group source code, part 2. · a2cf9bdf
      Ning Yu 提交于
      * resgroup: provide helper funcs for memory usage updates.
      
      We used to have complex and duplicate logic to update group & slot
      memory usage under different context, now we provide two helper
      functions to increase or decrease memory usage in group and slot.
      
      Two bad named functions `attachToSlot()` and `detachFromSlot()` are
      retired now.
      
      * resgroup: provide helper function to unassign a dropped resgroup.
      
      * resgroup: move complex checks into helper functions.
      
      Many helper functions were added with descriptive names to increase
      readability of lots of complex checks.
      
      Also added a pointer to resource group slot in self.
      
      * resgroup: add helper functions for wait queue operations.
      a2cf9bdf
    • A
      Fix aix7_ppc_64 making script · 15c04803
      Adam Lee 提交于
          $ make -j -s install
          ...
          --- subprocess32, Linux only
          /bin/sh: line 3: [: =: unary operator expected
          --- stream
          ...
          Greenplum Database installation complete.
      
      When `$(BLD_ARCH)` is empty, the check becomes `[ = 'aix7_ppc_64' ]`, and gets
      the unary operator expected error.
      15c04803
    • A
      Make gp_replication.conf for USE_SEGWALREP only. · b7ce6930
      Ashwin Agrawal 提交于
      The intend of this extra configuration file is to control the
      synchronization between primary and mirror for WALREP.
      
      The gp_replication.conf is not designed to work with filerep, for
      example, the scripts like gp_expand will fail since it directly modify
      the configuration files instead of going through initdb.
      Signed-off-by: NXin Zhang <xzhang@pivotal.io>
      b7ce6930
    • A
      d60e2389
    • L
    • H
      Take advantage of the new EXECUTE ON syntax in gp_toolkit. · 9a039e4f
      Heikki Linnakangas 提交于
      Also change a few regression tests to use the new syntax, instead of
      gp_toolkit's __gp_localid and __gp_masterid functions.
      9a039e4f
    • H
      Add support for CREATE FUNCTION EXECUTE ON [MASTER | ALL SEGMENTS] · aa148d2a
      Heikki Linnakangas 提交于
      We already had a hack for the EXECUTE ON ALL SEGMENTS case, by setting
      prodataaccess='s'. This exposes the functionality to users via DDL, and adds
      support for the EXECUTE ON MASTER case.
      
      There was discussion on gpdb-dev about also supporting ON MASTER AND ALL
      SEGMENTS, but that is not implemented yet. There is no handy "locus" in the
      planner to represent that. There was also discussion about making a
      gp_segment_id column implicitly available for functions, but that is also
      not implemented yet.
      
      The old behavior was that a function that if a function was marked as
      IMMUTABLE, it could be executed anywhere. Otherwise it was always executed
      on the master. For backwards-compatibility, this keeps that behavior for
      EXECUTE ON ANY (the default), so even if a function is marked as EXECUTE ON
      ANY, it will always be executed on the master unless it's IMMUTABLE.
      
      There is no support for these new options in ORCA. Using any ON MASTER or
      ON ALL SEGMENTS functions in a query cause ORCA to fall back. This is the
      same as with the prodataaccess='s' hack that this replaces, but now that it
      is more user-visible, it would be nice to teach ORCA about it.
      
      The new options are only supported for set-returning functions, because for
      a regular function marked as EXECUTE ON ALL SEGMENTS, it's not clear how
      the results should be combined. ON MASTER would probably be doable, but
      there's no need for that right now, so punt.
      
      Another restriction is that a function with ON ALL SEGMENTS or ON MASTER can
      only be used in the FROM clause, or in the target list of a simple SELECT
      with no FROM clause. So "SELECT func()" is accepted, but "SELECT func() FROM
      foo" is not. "SELECT * FROM func(), foo" works, however. EXECUTE ON ANY
      functions, which is the default, work the same as before.
      aa148d2a
    • B
      Fix multistage aggregation plan targetlists · 41640e69
      Bhuvnesh Chaudhary 提交于
      If there are aggregation queries with aliases same as the table actual
      columns and they are propagated further from subqueries and grouping is
      applied on the column alias it may result in inconsistent targetlists
      for aggregation plan causing crash.
      
      	CREATE TABLE t1 (a int) DISTRIBUTED RANDOMLY;
      	SELECT substr(a, 2) as a
      	FROM
      		(SELECT ('-'||a)::varchar as a
      			FROM (SELECT a FROM t1) t2
      		) t3
      	GROUP BY a;
      41640e69
  4. 20 9月, 2017 10 次提交
  5. 19 9月, 2017 5 次提交