- 23 9月, 2017 5 次提交
-
-
由 Todd Sedano 提交于
-
由 Taylor Vesely 提交于
In order to view the primary segments' replication stream data from their pg_stat_replication view, we currently need to connect to the primary segment individually via utility mode. To make life easier, we introduce a function that will fetch each primary segment's replication stream data and wrap it with a view named gp_stat_replication. It will now be possible to view all the cluster replication information from the master in a regular psql session. Authors: Taylor Vesely and Jimmy Yih
-
由 Bhuvnesh Chaudhary 提交于
Signed-off-by: NBhuvnesh Chaudhary <bchaudhary@pivotal.io>
-
由 Alexander Denissov 提交于
* run pxf_automation as part of pxf regression test Signed-off-by: NAlexander Denissov <adenissov@pivotal.io> * added missing input Signed-off-by: NLav Jain <ljain@pivotal.io> * Fix symbolic links * create extension pxf before running automation tests * Hack python psi module by copying it from system to gpdb python * remove if exists for extension * Run pxf_automation before regression tests * Change owner to gpadmin before running tests * Generalize copying of PSI package * Generalize install path using GPHOME
-
由 David Yozie 提交于
* Edits to make 'resource management' using resource queues, resources groups, consistent throughout. This is to distinguish between resource management and general workload profiles, as well as to avoid conusion with Workload Manager product. * Edits from Lisa's review
-
- 22 9月, 2017 6 次提交
-
-
由 Daniel Gustafsson 提交于
This merges and backports the upstream commits which replaces the amgetmulti AM function with amgetbitmap, which performs the whole indexscan in one call (for HashBitmap, StreamBitmaps are not affected by this). GPDB was more or less already doing this, as the upstream patch was from the beginning submitted from Greenplum. This commit refactors the AM function to mimick the upstream behavior, while keeping the GPDB API for the callsites. The below commits are included either in full, or in part: commit 4e82a954 Author: Tom Lane <tgl@sss.pgh.pa.us> Date: Thu Apr 10 22:25:26 2008 +0000 Replace "amgetmulti" AM functions with "amgetbitmap", in which the whole indexscan always occurs in one call, and the results are returned in a TIDBitmap instead of a limited-size array of TIDs. This should improve speed a little by reducing AM entry/exit overhead, and it is necessary infrastructure if we are ever to support bitmap indexes. In an only slightly related change, add support for TIDBitmaps to preserve (somewhat lossily) the knowledge that particular TIDs reported by an index need to have their quals rechecked when the heap is visited. This facility is not really used yet; we'll need to extend the forced-recheck feature to plain indexscans before it's useful, and that hasn't been coded yet. The intent is to use it to clean up 8.3's horrid @@@ kluge for text search with weighted queries. There might be other uses in future, but that one alone is sufficient reason. Heikki Linnakangas, with some adjustments by me. commit 1dcf6fdf Author: Teodor Sigaev <teodor@sigaev.ru> Date: Sat Aug 23 10:37:24 2008 +0000 Fix possible duplicate tuples while GiST scan. Now page is processed at once and ItemPointers are collected in memory. Remove tuple's killing by killtuple() if tuple was moved to another page - it could produce unaceptable overhead. Backpatch up to 8.1 because the bug was introduced by GiST's concurrency support. commit b9856b67 Author: Teodor Sigaev <teodor@sigaev.ru> Date: Wed Oct 22 12:53:56 2008 +0000 Fix GiST's killing tuple: GISTScanOpaque->curpos wasn't correctly set. As result, killtuple() marks as dead wrong tuple on page. Bug was introduced by me while fixing possible duplicates during GiST index scan.
-
由 Kavinder Dhaliwal 提交于
Before this commit all memory allocations made by ORCA/GPOS were a blackbox to GPDB. However the ground work had been in place to allow GPDB's Memory Accounting Framework to track memory consumption by ORCA. This commit introduces two new functions Ext_OptimizerAlloc and Ext_OptimizerFree which pass through their parameters to gp_malloc and gp_free and do some bookeeping against the Optimizer Memory Account. This introduces very little overhead to the GPOS memory management framework. Signed-off-by: NMelanie Plageman <mplageman@pivotal.io> Signed-off-by: NSambitesh Dash <sdash@pivotal.io>
-
由 Jacob Champion 提交于
8.4 seems to use more memory during this test. To get master green again, we're checking in these changes to the memory limits for the resource group tests. Follow-up should be on issue #3345; there's a good chance this will not be our final solution to this test failure. Signed-off-by: NTom Meyer <tmeyer@pivotal.io>
-
由 Heikki Linnakangas 提交于
We can encounter tuples that belong to later batches even after the first pass. Revert the comment to the way it is in upstream. I forgot to update
-
由 Heikki Linnakangas 提交于
Noteworthy changes that were not totally straightforward to merge: * Changes in the hash function. This replaces the contents of hashfunc.c directly with REL8_4_STABLE, not just changes otherwise included in the merge batch. That includes later changes to the hash algorithm used. I didn't feel like trying to fix it to an intermediate state that we would just rewrite again later. The hash function had been replaced in GPDB, too, but I couldn't quite figure out what the GPDB algorithm was, and whether it was better or how. In any case, I believe the new PostgreSQL algorithm is decent, so let's just use that. I'm not very impressed by the old code, there was weird stuff going on with the little and big endianess stuff. And at the top, WORDS_BIGENDIAN was misspelled as WORS_BIGENDIAN, so it never worked as intended on big endian systems. Note that GPDB uses a completely different set of hash functions for calculating the DISTRIBUTED BY key, so this doesn't affect pg_upgrade. This does invalidate hash indexes, but they're not supported on GPDB anyway. And we don't support hash partitioning either. * Pattern selectivity functions had been heavily modified in GPDB, but this replaces it with the upstream version. It was not clear to us what the purpose of the GPDB changes were. That ought to be revisited, and there's a GPDB_84_MERGE_FIXME comment about it. * Commit 95c238d9, to make COPY of CSV files faster, was not merged. The function had been heavily modified in GPDB, and it was not immediately clear how to resolve the conflicts. That commit was just a performance enhancement, so we can revisit that later. Added a GPDB_84_MERGE_FIXME comment about that too. * Resurrect the MyXactAccessedTempRel global variable. It's not used for anything in GPDB, as noted in the comment in PrepareTransaction. We had #ifdef'd out the variable, and all the places that set the variable. To reduce future merge conflicts, it seems better to have the variable and keep all the places where it's set unmodified from the upstream, and only comment out the place where it's checked in PrepareTransaction. * heap_release_fetch was removed in upstream, because it was unused. However, it was still used in one GPDB-specific function, in nbtree.c. Replace the call in nbtree.c with a ReleaseBuffer() + heap_fetch(), and add a GPDB_84_MERGE_FIXME to revisit. * This merge included an upstream change to add USE_SEGMENTED_FILES flag, but it was later later in the 8.4 dev cycle. Cherry-pick the change to remove it now, to avoid having to make it work just to remove it later. (commit 3c6248a8) * This adds support for enum-type GUCs, but we do not yet take advantage of that in the GPDB-specific GUCs, except for a few that shared code with client_min_messages and log_min_messages. * Reshuffle a few OIDs to avoid collision. We had reserved OID 1980 for int8_ops opclass. But that is now used for numeric_div_trunc function(), which we just merged in. In the upstream, we have reserved OID 3124 for the opclass, but only since version 9.2. Before that, we used whatever was free at initdb time. But we have been using OID 3124 for the GPDB-specific pg_proc_callback system table To resolve this mess, change the OID of pg_proc_callback from 3124 to 7176, to make 3124 available. And then use 3124 for int8_ops. That leaves 1980 for numeric_div_trunc function(), like in upstream. * TRUNCATE triggers now work, and to make that work, I made some changes to the way statement-level triggers are fired in general. The goal with statement-level triggers is to always execute them on the dispatcher, but they've been broken and unsupported before. At first, I thought these changes would be enough to do that for all statement-level triggers, but testing shows that not quite. So statement-level triggers are broken, like they were before, even though we pass the truncate-trigger tests now. This has been a joint effort between Heikki Linnakangas, Daniel Gustafsson, Jacob Champion and Tom Meyer.
-
由 Lisa Owen 提交于
* docs - add suse11 swapaccount req to resgroup cgroup cfg * must reboot after setting boot parameters
-
- 21 9月, 2017 14 次提交
-
-
由 Heikki Linnakangas 提交于
Ideally, we would use proper error codes, or find some other way to prevent the useless "(plperl.c:2118)" from appearing in PL/perl errors. Later versions of PostgreSQL do that, so we'll get that eventually. In the meanwhile, silence errors caused by code movement in that file. Same as we had done for plperl's own tests already.
-
由 Daniel Gustafsson 提交于
Leverage the core autoconf scaffolding for resolving the dependency on libcurl. Enabling PXF in autoconf now automatically adds libcurl as a dependency. Coupled with the recent commit which relaxes the curl version requirement on macOS, we can remove the library copying from the PXF makefile as well.
-
由 Heikki Linnakangas 提交于
The WITH RECURSIVE test case in 'join_gp' would miss some rows, if the hash algorithm (src/backend/access/hash/hashfunc.c) was replaced with the one from PostgreSQL 8.4, or if statement_mem was lowered from 1000 kB to 700 kB. This is what happened: 1. A tuple belongs to batch 0, and is kept in memory during processing batch 0. 2. The outer scan finishes, and we spill the inner batch 0 from memory to a file, with SpillFirstBatch, and start processing tuple 1 3. While processing batch 1, the number of batches is increased, and the tuple that belonged to batch 0, and was already written to the batch 0's file, is moved, to a later batch. 4. After the first scan is complete, the hash join is re-scanned 5. We reload the batch file 0 into memory. While reloading, we encounter the tuple that now doesn't seem to belong to batch 0, and throw it away. 6. We perform the rest of the re-scan. We have missed any matches to the tuple that was thrown away. It was not part of the later batch files, because in the first pass, it was handled as part of batch 0. But in the re-scan, it was not handled as part of batch 0, because nbatch was now larger, so it didn't belong there. To fix, when reloading a batch file we see a tuple that actually belongs to a later batch file, we write it to that later file. To avoid adding it there multiple times, if the hash join is re-scanned multiple times, if any tuples are moved when reloading a batch file, destroy the batch file and re-create it with just the remaining tuples. This is made a bit complicated by the fact that BFZ temp files don't support appending to a file that's already been rewinded for reading. So what we actually do, is always re-create the batch file, even if there has been no changes to it. I left comments about that, Ideally, we would either support re-appending to BFZ files, or stopped using BFZ workfiles for this altogether (I'm not convinced they're any better than plain BufFiles). But that can be done later. Fixes github issue #3284
-
由 Heikki Linnakangas 提交于
ExecHashTableInsert also increments the counter, so we don't need to do it here. This is harmless AFAICS, the counter isn't used for anything but instrumentation at the moment, but it confused me while debugging.
-
由 Heikki Linnakangas 提交于
It only worked for cursors declared with DECLARE CURSOR, before. You got an "there is no parameter $0" error if you tried. This moves the decision on whether a plan is "simply updatable", from the parser to the planner. Doing it in the parser was awkward, because we only want to do it for queries that are used in a cursor, and for SPI queries, we don't know it at that time yet. For some reason, the copy, out, read-functions of CurrentOfExpr were missing the cursor_param field. While we're at it, reorder the code to match upstream. This only makes the required changes to the Postgres planner. ORCA has never supported updatable cursors. In fact, it will fall back to the Postgres planner on any DECLARE CURSOR command, so that's why the existing tests have passed even with optimizer=off.
-
由 Heikki Linnakangas 提交于
There was code in gp_read_error_log(), to "manually" dispatch the call to all the segments, if it was executed in the dispatcher. This was previously necessary, because even though the function was marked with prodataaccess='s', the planner did not guarantee that it's executed in the segments, when called in the targetlist like "SELECT gp_read_error_log('tab')". Now that we have the EXECUTE ON ALL SEGMENTS syntax, and are more rigorous about enforcing that in the planner, this hack is no longer required.
-
由 Ning Yu 提交于
* resgroup: provide helper funcs for memory usage updates. We used to have complex and duplicate logic to update group & slot memory usage under different context, now we provide two helper functions to increase or decrease memory usage in group and slot. Two bad named functions `attachToSlot()` and `detachFromSlot()` are retired now. * resgroup: provide helper function to unassign a dropped resgroup. * resgroup: move complex checks into helper functions. Many helper functions were added with descriptive names to increase readability of lots of complex checks. Also added a pointer to resource group slot in self. * resgroup: add helper functions for wait queue operations.
-
由 Adam Lee 提交于
$ make -j -s install ... --- subprocess32, Linux only /bin/sh: line 3: [: =: unary operator expected --- stream ... Greenplum Database installation complete. When `$(BLD_ARCH)` is empty, the check becomes `[ = 'aix7_ppc_64' ]`, and gets the unary operator expected error.
-
由 Ashwin Agrawal 提交于
The intend of this extra configuration file is to control the synchronization between primary and mirror for WALREP. The gp_replication.conf is not designed to work with filerep, for example, the scripts like gp_expand will fail since it directly modify the configuration files instead of going through initdb. Signed-off-by: NXin Zhang <xzhang@pivotal.io>
-
-
由 Lav Jain 提交于
-
由 Heikki Linnakangas 提交于
Also change a few regression tests to use the new syntax, instead of gp_toolkit's __gp_localid and __gp_masterid functions.
-
由 Heikki Linnakangas 提交于
We already had a hack for the EXECUTE ON ALL SEGMENTS case, by setting prodataaccess='s'. This exposes the functionality to users via DDL, and adds support for the EXECUTE ON MASTER case. There was discussion on gpdb-dev about also supporting ON MASTER AND ALL SEGMENTS, but that is not implemented yet. There is no handy "locus" in the planner to represent that. There was also discussion about making a gp_segment_id column implicitly available for functions, but that is also not implemented yet. The old behavior was that a function that if a function was marked as IMMUTABLE, it could be executed anywhere. Otherwise it was always executed on the master. For backwards-compatibility, this keeps that behavior for EXECUTE ON ANY (the default), so even if a function is marked as EXECUTE ON ANY, it will always be executed on the master unless it's IMMUTABLE. There is no support for these new options in ORCA. Using any ON MASTER or ON ALL SEGMENTS functions in a query cause ORCA to fall back. This is the same as with the prodataaccess='s' hack that this replaces, but now that it is more user-visible, it would be nice to teach ORCA about it. The new options are only supported for set-returning functions, because for a regular function marked as EXECUTE ON ALL SEGMENTS, it's not clear how the results should be combined. ON MASTER would probably be doable, but there's no need for that right now, so punt. Another restriction is that a function with ON ALL SEGMENTS or ON MASTER can only be used in the FROM clause, or in the target list of a simple SELECT with no FROM clause. So "SELECT func()" is accepted, but "SELECT func() FROM foo" is not. "SELECT * FROM func(), foo" works, however. EXECUTE ON ANY functions, which is the default, work the same as before.
-
由 Bhuvnesh Chaudhary 提交于
If there are aggregation queries with aliases same as the table actual columns and they are propagated further from subqueries and grouping is applied on the column alias it may result in inconsistent targetlists for aggregation plan causing crash. CREATE TABLE t1 (a int) DISTRIBUTED RANDOMLY; SELECT substr(a, 2) as a FROM (SELECT ('-'||a)::varchar as a FROM (SELECT a FROM t1) t2 ) t3 GROUP BY a;
-
- 20 9月, 2017 10 次提交
-
-
由 Pengzhou Tang 提交于
In this commit, we add more detailed memory metrics to the 'memory_usage' column of gp_resgroup_status include current/available memory usage in a group, current/available memory usage for a slot, current/available memory usage for the shared part.
-
由 Gang Xiong 提交于
Previously, waiters waiting on a dropped resource group need to be reassigned to a new group, to achieve it, ResGroupSlotAcquire is modified to be complicated and not easy to understand, this commit refines it. Author: Gang Xiong <gxiong@pivotal.io>
-
由 Pengzhou Tang 提交于
Allow CREATE RESOURCE GROUP and ALTER RESOURCE GROUP to set concurrency to 0, so there will eventually be no running queries after some time, so the resource group can be dropped. On drop all pending queries will be moved to the new resource group assigned to the role; but if the role is also dropped the pending queries will all be canceled. Another thing is we do not allow setting concurrency of admin group to zero, superuser is under admin group and only superuser can alter resource group, so once concurrency of admin group is set to zero, there will be no chance to set it again. Signed-off-by: NNing Yu <nyu@pivotal.io>
-
由 Ming LI 提交于
Because we don't know the data location of the result of SELECT query, ON SEGMENT is forbidden.
-
由 Richard Guo 提交于
This commit does two changes: 1. Remove the restriction that sum of memory_spill_ratio and memory_shared_quota must be no larger than 100. 2. Change the range of memory_spill_ratio to be [0, 100].
-
由 Hubert Zhang 提交于
Function FaultInjectorIdentifierStringToEnum(faultName) pass a const string to a non-const parameter, which cause a build warnig. But on the second thought, we have supported injecting fault by fault name without corresponding fault identifier, so it's better to use faultname instead of fault enum identifier in the ereport.
-
由 Chuck Litzell 提交于
-
由 Taylor Vesely 提交于
Adds a clusterstart command to gpsegwalrep.py allow a user to start a cluster with WALRep configured. This is a developer utility that assumes all cluster replicas are present on localhost, and thus is not intended for production use.
-
由 Lisa Owen 提交于
* docs - memory_spill_ratio guc and related content * operator -> transaction
-
由 Lav Jain 提交于
* Remove dependency on system curl; Fix bug with OSX * Add ifdef for CURLOPT_RESOLVE * Incorporate feedback * brew curl not needed anymore
-
- 19 9月, 2017 5 次提交
-
-
由 Bhuvnesh Chaudhary 提交于
GPOS raises exception with different severity level, but they were being logged to GPDB logs at LOG severity level. This disabled users to not turn off logging for GPOS exceptions, unless GPDB log setting was changed higher than LOG severity level. This is the initial commit which introduces the functionality. If an exception is created without the GPDB severity level, it will default to LOG severity level in GPDB. Signed-off-by: NJemish Patel <jpatel@pivotal.io>
-
由 Bhuvnesh Chaudhary 提交于
-
由 Xin Zhang 提交于
Signed-off-by: NAbhijit Subramanya <asubramanya@pivotal.io>
-
由 Abhijit Subramanya 提交于
Signed-off-by: NXin Zhang <xzhang@pivotal.io>
-
由 Xin Zhang 提交于
New API: void set_gp_replication_config(const char *name, const char *value) This function is inspired by the upstream ALTER SYSTEM command AlterSystemSetConfigFile() from commit 7dfab04a. Once we merged the upstream changes, we can remove this function and directly use the AlterSystemSetConfigFile(). Signed-off-by: NAbhijit Subramanya <asubramanya@pivotal.io>
-