提交 · e99b9fef120ce621781b3781438f6b30a3acd673 · Greenplum / Gpdb

17 3月, 2017 1 次提交

Fix bug that can't generate equality partition filters for multilevel partition table · e99b9fef

由 Haisheng Yuan 提交于 3月 15, 2017

The equality parition filter of partition selector works very well for single
level partition table, but if the table has multilevel partitions, e.g. 2
levels with pk1, pk2 as the partition key for level 1 and 2, and there is a
equality predicate in the query, say pk1 = 2, then level 2 equality filter is null,
the function `FEqPartFiltersAllLevels` will return false, causing the equality
predicate put into PartFilters instead of PartEqFilters. This bug has been
fixed in the patch.

[#141826453]

e99b9fef

16 3月, 2017 2 次提交

E

Bump ORCA version to 2.11.0 [#141511349] · 670028bb
由 Ekta Khanna, Jesse Zhang and Omer Arap 提交于 3月 15, 2017

670028bb

Update destructor logic for HashMaps [#141511349] · 7d07fd6d

由 Dhanashree Kashid 提交于 3月 10, 2017

In previous implementation of hash maps, all the bucket chains have been
traversed no matter if the bucket is pointing to a chain or not.

In this new version of `Clear()` we keep track of the indices for every
bucket where a chain is created and we simply deallocated these chains.
Therefore, no time is wasted on checking unallocated bucket chains.
Signed-off-by: NOmer Arap <oarap@pivotal.io>

7d07fd6d

11 3月, 2017 1 次提交

[#141511349] Improve HashMap iterator implementation · 4d9e03a8

由 Omer Arap 提交于 3月 09, 2017

Currently, the HashMapIter implementation scans through all hash map
buckets to get the next existing hash chain. This degrades performance
significantly.

This commit improves the iterator implementation by maintaining a
dynamic key array which holds the existing keys in HashMap. The
iteration is done using this array.
Signed-off-by: NDhanashree Kashid <dkashid@pivotal.io>

4d9e03a8

08 3月, 2017 5 次提交

O

Bump ORCA version to 2.10 · 8263c6b4
由 Omer Arap 提交于 3月 07, 2017

8263c6b4

Remove wrappers over standard C99 math functons. · c779beb7

由 Heikki Linnakangas 提交于 2月 15, 2017

Makes the code simpler for humans, and also allows the compiler to "see"
what the operations are, and optimize accordingly.

I hoped for the GPDB compiler warnings about failed inlining with -Winline
to go away with commit 5f774da5. It turns out that commit was not enough,
unfortunately, but this commit does the trick.

c779beb7

B
Bump ORCA version to 2.9 · a8865d7d
由 Bhuvnesh Chaudhary 提交于 3月 07, 2017
```
Signed-off-by: NOmer Arap <oarap@pivotal.io>
```
a8865d7d

Added test cases for equivalance classes · 83aac2cb

由 Bhuvnesh Chaudhary 提交于 3月 07, 2017

Added utility function to compare two Equivalence Class
Arrays
Signed-off-by: NOmer Arap <oarap@pivotal.io>

83aac2cb

[#140601033] Optimize equivalence classes intersection · 66c3c843

由 Omer Arap 提交于 3月 07, 2017

This commit request introduces a better algorithm for generating the intersection
of two sets of equivalence classes. The input sets are arrays of column
reference sets which denotes individual equivalence classes.

Since the equivalence classes are disjoint by definition, the implementation
could be reduced from quadratic time to the linear time by utilizing an HashMap.

We build a hash-map using the columns as keys and equivalence classes as the
value.

Below is a sample input and output scenario.

`Classes1`: `[(a,b),(c,d,e),(f,g)]`
`Classes2`: `[(a),(b,c,d),(e),(f,g)]`

The hashmap after first iteration on `Classes1`:
`a->(a,b)`
`b->(a,b)`
`c->(c,d,e)`
`d->(c,d,e)`
`e->(c,d,e)`
`f->(f,g)`
`g->(f,g)`

In the probe iteration, once we detect an intersection, we replace the intersected
columns' entry in the HashMap with an empty set to avoid duplication.

In the probe iteration, first `(a)` will be read first and column `a` will be
probed. `map[a] = (a,b)`. So the intersection of `(a,b)` and `(a)` is `(a)` and it will
be added to result list. `a`'s entry in the map is emptied. Hash map after this step
will look like below:
`a->()`
`b->(a,b)`
`c->(c,d,e)`
`d->(c,d,e)`
`e->(c,d,e)`
`f->(f,g)`
`g->(f,g)`

Then, `(b,c,d)` will be processed. First column `b` is used to probe the hash map and
it will return `(a,b)`. The intersection will be `(b)` and it will be appended to the
result list. `b`'s entry in the map is emptied. Hash Map after this step:
`a->()`
`b->()`
`c->(c,d,e)`
`d->(c,d,e)`
`e->(c,d,e)`
`f->(f,g)`
`g->(f,g)`

Then column `c` will be looked up and hash map will return `(c,d,e)`. The intersaction
of `(b,c,d)` and `(c,d,e)` will be `(c,d)` and it will be added to result list. `c` and
`d`'s entry is also emptied in the Hash Map after this step and it will be like below:
`a->()`
`b->()`
`c->()`
`d->()`
`e->(c,d,e)`
`f->(f,g)`
`g->(f,g)`

So, when we look up for column `d` at this point which is the last column of `(b,c,d)`,
since `d`'s entry is blank in the hash map, it will not produce a duplicate intersection
result of `(c,d)` again.

The final result will be `[(a),(b),(c,d),(e),(f,g)]`
Signed-off-by: NTaylor Vesely <tvesely@pivotal.io>

66c3c843

07 3月, 2017 6 次提交

O
Bump ORCA version to 2.8 · 5bbc9fa4
由 Omer Arap 提交于 3月 06, 2017
```
Signed-off-by: NBhuvnesh Chaudhary <bchaudhary@pivotal.io>
```
5bbc9fa4

Adding a test for CScalarArray collapse · 5f1b0c6d

由 Omer Arap 提交于 3月 06, 2017

This commit adds a test case in CTranslatorDXLToExprTest to test the
utility that collapse the children of CScalarArray if all the children
are constants.
Signed-off-by: NBhuvnesh Chaudhary <bchaudhary@pivotal.io>

5f1b0c6d

O
DXL translator support for updated CScalarArray · 4c0df819
由 Omer Arap 提交于 2月 28, 2017
```
Signed-off-by: NHaisheng Yuan <hyuan@pivotal.io>
```
4c0df819

Apply utility funcs to codebase for CScalarArray · 54f2e73f

由 Omer Arap 提交于 2月 27, 2017

This commit applies the changes to reflect updated `CScalarArray`
changes in the codebase whereever it is necessary
Signed-off-by: NHaisheng Yuan <hyuan@pivotal.io>

54f2e73f

Utility functions to support CScalarArray update · fb545ec9

由 Haisheng Yuan 提交于 2月 27, 2017

`CScalarArray` has now a feature to embed the `CScalarConst` children
in the operator itself. However, therefore utility functions need to
support this behavior. If the `CScalarArray` expresion have all const
children, then `CScalarConst` expressions are collapsed and kept int the
operator itself.

In addition, the arity of the expresion is determined either by the
number of child expressions of a `CScalarArray` or the number of const
operators stored in the `CScalarArray` operator itself.

This commit also provides functions to lookup the children of `CScalarArray`.
Signed-off-by: NOmer Arap <oarap@pivotal.io>

fb545ec9

Update on CScalarArray to support const only array · 692f1d16

由 Omer Arap 提交于 2月 27, 2017

If all children of a CScalarArray expression are CScalarConst expressions,
then we store all the child operators in the CScalarArray operator to avoid
unnecessary memo group allocation and deallocation.
Signed-off-by: NHaisheng Yuan <hyuan@pivotal.io>

692f1d16

23 2月, 2017 2 次提交

B
Bump ORCA version to 2.7 [#139042295] · aa466bfe
由 Bhunvesh Chaudhary 提交于 2月 22, 2017
```
Signed-off-by: NTaylor Vesely <tvesely@pivotal.io>
```
aa466bfe

[#139042295] Dedup constraints in linear time (#143) · 319a6eb9

由 Omer Arap 提交于 2月 22, 2017

* [#139042295] Dedup constraints in linear time

GP Orca rearranges constraints list where a constrint only refers a single
column reference as much as possible. In order to do that the previous implementation
traverse the list for every single constraint and that results in O(n^2) (quadratic)
running time.

If the number of constraints in the list is higher than some threashold
(hardcoded as 5 for now) it results in significant performance degredation.

This commit uses hash map to create a map from column referances to constraints where
a constraint only refers a single column. This will reduce the quadratic running time
of deduplication process to a linear running time. However, creating a hashmap for short
list of constraints also results in performnce degradation.
Signed-off-by: NOmer Arap <oarap@pivotal.io>

319a6eb9

16 2月, 2017 1 次提交

Fix bug that ORCA cannot do DPE for array coerce predicate · f112264b

由 Haisheng Yuan 提交于 2月 15, 2017

Before this patch, Orca can't extract the array elements inside ScalarArrayCmp
if the array is wrapped with a ArrayCoerceExpr. This patch tries to extract
array inside, let Orca be able to expand the array into disjunctions and do DPE
for predicate with array coerce expression.

[#138085391]

f112264b

15 2月, 2017 1 次提交

Remove unused support for dynamic min/max abs value values in CDouble. · 5f774da5

由 Heikki Linnakangas 提交于 2月 15, 2017

It was possible to specify a minimum and maximum abs value for a CDouble,
in the constructor. That would allow using different minimum and maximum
in different places. However, that facility was unused; no caller passed
non-default min/max values. So remove the unnecessary flexibility, and
just use the same, default limits everywhere.

I was seeing compiler warnings on failed inlining in GPDB from the CDouble's
CheckValidity function, with -Winline. As well as saving some memory and
some cycles from all operations on CDoubles and making the code simpler,
I'm hoping that this will make those warnings go away.

5f774da5

21 1月, 2017 9 次提交

Bump ORCA version to 2.5 [#138076347] · 51779e96

由 Ekta Khanna and Jesse Zhang 提交于 1月 20, 2017

This version should bring much-needed fix that unblocks everybody on a
Mac. Kudos to Heikki Linnakangas and Ekta Khanna.

51779e96

E

Add operating system name macros [#138076347] · 3a0921d8
由 Ekta Khanna and Jesse Zhang 提交于 1月 20, 2017

3a0921d8

Fix populating config.h. · c60f5ea5

由 Heikki Linnakangas 提交于 1月 21, 2017

Commmit 76af8194, to generate a config.h with all the required GPOS_*
flags that affect the API of the resulting binaries, most notably
GPOS_DEBUG, was broken. It didn't derive the GPOS_DEBUG flag from
from CMAKE_BUILD_TYPE sit should've. It did the right thing if you set
"GPOS_DEBUG=1" property on the cmake command line, but that property was
not set automatically with CMAKE_BUILD_TYPE=Debug. CMAKE_BUILD_TYPE=Debug
added GPOS_DEBUG=1 to the COMPILE_DEFINITIONS property, but that's different
from having a stand-along GPOS_DEBUG property.

Likewise, the GPOS_<arch> and GPOS_32BIT/64BIT variables were also not set
correctly. The reason I didn't notice this while testing was that the
flags in COMPILE_DEFINITIONS were still set, even though if config.h was
generated correctly, those would not have been necessary aynmore.

To fix, remove those flags from COMPILE_DEFINITIONS so that they are not
passed to the compiler command line anymore. The defines are now always
read from the config.h file, and the defines are set correctly in config.h.

Per report off-list by Jesse Zhang.

c60f5ea5

Orca should skip collapsing projects when it has multiple set returning... · 9aa15351

由 Venkatesh Raghavan 提交于 1月 16, 2017

Orca should skip collapsing projects when it has multiple set returning functions and one of project elements cannot be collapsed [#137642207]

If the project element of a subquery has a set returning function, it should not be merged with its child since it may generate wrong results under the following conditions:

* The child's project list contains a set returning function (this was fixed in a prior fix)
* The current project list has other set returning functions that may not be merged with its child since it uses the childs output - see the example below. Splitting these project elements will change the semantic of the query (fixed by the current fix).

Example SQL:

select generate_series(0,2) rn, unnest(arr) cnt_div from (select ARRAY['0','1','2', '3'] as arr) arra;

The SQL has two project nodes:
* The top project node with two set returning function.
* The lower ARRAY function over a constant table get.

```

+--CLogicalProject
   |--CLogicalProject
   |  |--CLogicalConstTableGet Columns: ["" (0)] Values: [(1)]
   |  +--CScalarProjectList
   |     +--CScalarProjectElement "arr" (1)
   |        +--CScalarArray: {eleMDId: (25,1.0), arrayMDId: (1009,1.0)}
   |           |--CScalarConst (161088044.000)
   |           |--CScalarConst (161096236.000)
   |           +--CScalarConst (161104428.000)
   +--CScalarProjectList
      |--CScalarProjectElement "rn" (2)
      |  +--CScalarFunc (generate_series)
      |     |--CScalarConst (0)
      |     +--CScalarConst (2)
      +--CScalarProjectElement "cnt_div" (3)
         +--CScalarFunc (unnest)
            +--CScalarIdent "arr" (1)
```

The top project node has two set returning functions:
* generate_series - that has no dependence on the child's project list so can be collapsed.
* unnest(arr) - takes as input `arr` which comes from child's project list so cannot be collapsed.

Before this fix, we collapsed the first function and not the second. Like below.

```
Algebrized preprocessed query:
+--CLogicalProject
   |--CLogicalProject
   |  |--CLogicalConstTableGet Columns: ["" (0)] Values: [(1)]
   |  +--CScalarProjectList
   |     |--CScalarProjectElement "rn" (2)
   |     |  +--CScalarFunc (generate_series)
   |     |     |--CScalarConst (0)
   |     |     +--CScalarConst (2)
   |     +--CScalarProjectElement "arr" (1)
   |        +--CScalarArray: {eleMDId: (25,1.0), arrayMDId: (1009,1.0)}
   |           |--CScalarConst (161088044.000)
   |           |--CScalarConst (161096236.000)
   |           +--CScalarConst (161104428.000)
   +--CScalarProjectList
      +--CScalarProjectElement "cnt_div" (3)
         +--CScalarFunc (unnest)
            +--CScalarIdent "arr" (1)
```

This causes wrong result since it changes the SQL semantics when we have multiple set returning functions in the a single project list.
It is all or nothing.

9aa15351

O
Bump orca version 2.3.0 [#117186547] · 5ab1e794
由 Omer Arap 提交于 1月 20, 2017
```
Signed-off-by: NHaisheng Yuan <hyuan@pivotal.io>
```
5ab1e794
E

Refactor dxl node representation to use coerce base class [#117186547] · b3c897ed
由 Ekta Khanna and Haisheng Yuan 提交于 1月 16, 2017

b3c897ed
E

Refactor coerce operators to inherit CScalarCoerceBase [#117186547] · bf253864
由 Ekta Khanna and Haisheng Yuan 提交于 1月 16, 2017

bf253864
B
Adding scalar ArrayCoerceExpr support in GPORCA [#117186547] · aad4e7fa
由 Bhuvnesh Chaudhary 提交于 1月 10, 2017
```
Signed-off-by: NEkta Khanna <ekhanna@pivotal.io>
Signed-off-by: NXin Zhang <xzhang@pivotal.io>
```
aad4e7fa

Append `.0` to version string · 78b6ccd5

由 Jesse Zhang 提交于 1月 19, 2017

This patch adds a `.0` (or as they call it over semver.org, the "patch"
number) to our version. This makes our version semver-compliant. It also
makes it abundantly clear that our version number (say `2.2.0`) is not a
decimal floating point number. ("What is newer? 2.2 or 2.19?")

N.B. our practice is only bumping the minor and major version numbers.
This change doesn't not signal a change in that practice.

I'm also hiding this commit from CI so that it only impacts the next
push (so it will be, say, `2.3.0` if we are currently on `2.2`).

[ci skip]

78b6ccd5

20 1月, 2017 1 次提交

Append `.0` to version string · 8bd624bd

由 Jesse Zhang 提交于 1月 19, 2017

This patch adds a `.0` (or as they call it over semver.org, the "patch"
number) to our version. This makes our version semver-compliant. It also
makes it abundantly clear that our version number (say `2.2.0`) is not a
decimal floating point number. ("What is newer? 2.2 or 2.19?")

N.B. our practice is only bumping the minor and major version numbers.
This change doesn't not signal a change in that practice.

I'm also hiding this commit from CI so that it only impacts the next
push (so it will be, say, `2.3.0` if we are currently on `2.2`).

[ci skip]

8bd624bd

19 1月, 2017 1 次提交

Add config.h, for options that affect binary compatibility. · 76af8194

由 Heikki Linnakangas 提交于 1月 19, 2017

Before this patch, consumers of ORCA had to know out-of-band which
flags were used to compile ORCA, because e.g. ORCA compiled with
GPOS_DEBUG would only work if the application using ORCA was also
compiled with GPOS_DEBUG. This is because many of the structs differ
depending on GPOS_DEBUG. Same for the architecture flags, like GPOS_i386.

The new config.h file is #included from a few central other header files,
to make sure it gets included in any application that uses other gpos
headers. We probably should include config.h from all other gpos header
files, to be sure, but this seems to be enough for ORCA itself and GPDB at
least.

Bump version number to 2.2.

76af8194

07 1月, 2017 2 次提交

H

Remove unused make files · bf1b8a76
由 Haisheng Yuan 提交于 1月 05, 2017

bf1b8a76

Update Version number macros · 2183bf43

由 Haisheng Yuan 提交于 1月 06, 2017

We bumped the GPORCA version to 2.0 after merging GPOS with GPORCA.
Cmake undefs the GPORCA_VERSION_MINOR when the minor number is 0. 0 is
considered as a false constant by cmake.

In gpdb, use GPORCA_VERSION_STRING instead of separate GPORCA_VERSION_MAJOR
and GPORCA_VERSION_MINOR.

Bumped the GPORCA version to 2.1
Signed-off-by: NDhanashree Kashid <dkashid@pivotal.io>

2183bf43

06 1月, 2017 5 次提交
- H
  Remove mentions of GPOS versions [#136295835] · aab8d7fe
  由 Haisheng Yuan and Jesse Zhang 提交于 1月 05, 2017
```
We'll just use ORCA versions instead.
```
  aab8d7fe
- H
  
  Remove unused artifact files [#136295835] · 5cd49052
  由 Haisheng Yuan and Jesse Zhang 提交于 1月 05, 2017
  
  5cd49052
- H
  
  Update ci pipeline · 0d065d06
  由 Haisheng Yuan and Jesse Zhang 提交于 1月 05, 2017
  
  0d065d06
- H
  
  Bump Orca version to 2.0 · 726984ce
  由 Haisheng Yuan 提交于 1月 03, 2017
  
  726984ce
- X
  Update README.md after merging GPOS · 300b469a
  由 Xin Zhang 提交于 12月 29, 2016
```
Signed-off-by: NDhanashree Kashid <dkashid@pivotal.io>
```
  300b469a
05 1月, 2017 3 次提交
- H
  
  Eliminate remaining assumptions about separate GPOS [#136295835] · 35042abf
  由 Haisheng Yuan and Jesse Zhang 提交于 1月 04, 2017
  
  35042abf
- H
  Merge GPOS into ORCA [#136295835] · 4453a131
  由 Haisheng Yuan and Jesse Zhang 提交于 1月 04, 2017
```
This merges both the code and its history of GPOS, the foundation abstraction
library for ORCA.

This commit contains minimal changes to ORCA's build files. There are a few
remnants of the assumption that GPOS was a separate library. Those will be
remove in a subsequent commit.
```
  4453a131
- H
  
  Remove everything unnecessary for the merge [#136295835] · edd91c63
  由 Haisheng Yuan and Jesse Zhang 提交于 1月 04, 2017
  
  edd91c63