提交 · f1e9f6714cf29d9192f844e754901e0995f7413e · 2dot5 / ClickHouse

29 11月, 2020 1 次提交
- R
  
  Backport #17145 to 20.12: Fix unmatched type comparison in KeyCondition · f1e9f671
  由 robot-clickhouse 提交于 11月 29, 2020
  
  f1e9f671
13 11月, 2020 1 次提交
- A
  
  Fix verbatim partition pruner · 9961182e
  由 Amos Bird 提交于 11月 09, 2020
  
  9961182e
10 11月, 2020 3 次提交
- N
  
  Add comments. Update ActionsDAG::Index · 1db8e773
  由 Nikolai Kochetov 提交于 11月 10, 2020
  
  1db8e773
- A
  
  remove other stringstreams · 5cdfcfb3
  由 Alexander Tokmakov 提交于 11月 09, 2020
  
  5cdfcfb3
- N
  
  Empty commit. · e41b1ae5
  由 Nikolai Kochetov 提交于 11月 09, 2020
  
  e41b1ae5
09 11月, 2020 1 次提交
- N
  
  Update after merge. · 8c4db34f
  由 Nikolai Kochetov 提交于 11月 09, 2020
  
  8c4db34f
07 11月, 2020 1 次提交
- A
  
  Fix "server failed to start" error · fd84d163
  由 Alexey Milovidov 提交于 11月 07, 2020
  
  fd84d163
06 11月, 2020 2 次提交
- A
  
  Pruning is different from counting · 2b0085c1
  由 Amos Bird 提交于 11月 06, 2020
  
  2b0085c1
- A
  
  Transform single point · aa436a3c
  由 Amos Bird 提交于 11月 06, 2020
  
  aa436a3c
03 11月, 2020 1 次提交
- N
  
  Refactor ExpressionActions [Part 3] · 07a7c46b
  由 Nikolai Kochetov 提交于 11月 03, 2020
  
  07a7c46b
20 10月, 2020 1 次提交
- N
  
  Fixing build. · bc58637e
  由 Nikolai Kochetov 提交于 10月 19, 2020
  
  bc58637e
09 10月, 2020 1 次提交
- N
  
  Use ColumnWithTypeAndName as function argument instead of Block. · a7fb2e38
  由 Nikolai Kochetov 提交于 10月 09, 2020
  
  a7fb2e38
08 10月, 2020 1 次提交
- A
  
  Extend trivial count optimization. · 86721610
  由 Amos Bird 提交于 9月 21, 2020
  
  86721610
13 9月, 2020 2 次提交
- A
  
  Fix empty key segfault · 5cc8fd39
  由 Amos Bird 提交于 9月 13, 2020
  
  5cc8fd39
- A
  
  Binary operator monotonicity · 34b9547c
  由 Amos Bird 提交于 9月 05, 2020
  
  34b9547c
03 8月, 2020 3 次提交
- A
  
  Better code · 4ed0bf3a
  由 Alexey Milovidov 提交于 8月 03, 2020
  
  4ed0bf3a
- A
  
  Fix assertion in KeyCondition · 3c489ce1
  由 Alexey Milovidov 提交于 8月 02, 2020
  
  3c489ce1
- A
  
  Fix bad code · 5f808aa5
  由 Alexey Milovidov 提交于 8月 02, 2020
  
  5f808aa5
30 7月, 2020 1 次提交
- A
  
  fix wrong index analysis with functions · 4c266d1e
  由 Anton Popov 提交于 7月 29, 2020
  
  4c266d1e
23 7月, 2020 1 次提交
- A
  
  Refactoring: extract TreeOptimizer from SyntaxAnalyzer (#12645) · 2afd123e
  由 Artem Zuikov 提交于 7月 22, 2020
  
  2afd123e
21 7月, 2020 1 次提交
- N
  
  Remove mutable from RPNElement. · 12c5e376
  由 Nikolai Kochetov 提交于 7月 21, 2020
  
  12c5e376
12 7月, 2020 2 次提交

Allow conditions outside of PK with exact range · 8784994d

由 Ivan Babrou 提交于 7月 11, 2020

Conditions that are outside of PK are marked as `unknown` in `KeyCondition`,
so it's safe to allow them, as long as they are always combined by `AND`.

8784994d

Optimize PK lookup for queries that match exact PK range · d9d8d024

由 Ivan Babrou 提交于 7月 07, 2020

Existing code that looks up marks that match the query has a pathological
case, when most of the part does in fact match the query.

The code works by recursively splitting a part into ranges and then discarding
the ranges that definitely do not match the query, based on primary key.

The problem is that it requires visiting every mark that matches the query,
making the complexity of this sort of look up O(n).

For queries that match exact range on the primary key, we can find
both left and right parts of the range with O(log 2) complexity.

This change implements exactly that.

To engage this optimization, the query must:

* Have a prefix list of the primary key.
* Have only range or single set element constraints for columns.
* Have only AND as a boolean operator.

Consider a table with `(service, timestamp)` as the primary key.

The following conditions will be optimized:

* `service = 'foo'`
* `service = 'foo' and timestamp >= now() - 3600`
* `service in ('foo')`
* `service in ('foo') and timestamp >= now() - 3600 and timestamp <= now`

The following will fall back to previous lookup algorithm:

* `timestamp >= now() - 3600`
* `service in ('foo', 'bar') and timestamp >= now() - 3600`
* `service = 'foo'`

Note that the optimization won't engage when PK has a range expression
followed by a point expression, since in that case the range is not continuous.

Trace query logging provides the following messages types of messages,
each representing a different kind of PK usage for a part:

```
Used optimized inclusion search over index for part 20200711_5710108_5710108_0 with 9 steps
Used generic exclusion search over index for part 20200711_5710118_5710228_5 with 1495 steps
Not using index on part 20200710_5710473_5710473_0
```

Number of steps translates to computational complexity.

Here's a comparison for before and after for a query over 24h of data:

```
Read 4562944 rows, 148.05 MiB in 45.19249672 sec., 100966 rows/sec., 3.28 MiB/sec.
Read 4183040 rows, 135.78 MiB in 0.196279627 sec., 21311636 rows/sec., 691.75 MiB/sec.
```

This is especially useful for queries that read data in order
and terminate early to return "last X things" matching a query.

See #11564 for more thoughts on this.

d9d8d024

10 7月, 2020 1 次提交
- A
  Avoid exception when negative or floating point constant is used in WHERE... · 276b3a02
  由 Alexey Milovidov 提交于 7月 10, 2020
```
Avoid exception when negative or floating point constant is used in WHERE condition for indexed tables #11905
```
  276b3a02
05 7月, 2020 1 次提交

ILIKE operator (#12125) · 8c3417fb

由 myrrc 提交于 7月 05, 2020

* Integrated CachingAllocator into MarkCache

* fixed build errors

* reset func hotfix

* upd: Fixing build

* updated submodules links

* fix 2

* updating grabber allocator proto

* updating lost work

* updating CMake to use concepts

* some other changes to get it building (integration into MarkCache)

* further integration into caches

* updated Async metrics, fixed some build errors

* and some other errors revealing

* added perfect forwarding to some functions

* fix: forward template

* fix: constexpr modifier

* fix: FakePODAllocator missing member func

* updated PODArray constructor taking alloc params

* fix: PODArray overload with n restored

* fix: FakePODAlloc duplicating alloc() func

* added constexpr variable for alloc_tag_t

* split cache values by allocators, provided updates

* fix: memcpy

* fix: constexpr modifier

* fix: noexcept modifier

* fix: alloc_tag_t for PODArray constructor

* fix: PODArray copy ctor with different alloc

* fix: resize() signature

* updating to lastest working master

* syncing with 273267

* first draft version

* fix: update Searcher to case-insensitive

* added ILIKE test

* fixed style errors, updated test, split like and ilike,  added notILike

* replaced inconsistent comments

* fixed show tables ilike

* updated missing test cases

* regenerated ya.make

* Update 01355_ilike.sql
Co-authored-by: Nmyrrc <me-clickhouse@myrrec.space>
Co-authored-by: Nalexey-milovidov <milovidov@yandex-team.ru>

8c3417fb

01 7月, 2020 1 次提交
- N
  
  Rewrite Set lookup to make it more readable · 3854ce6d
  由 Nicolae Vartolomei 提交于 7月 01, 2020
  
  3854ce6d
30 6月, 2020 1 次提交

Try fix pk in tuple performance · 8f184518

由 Nicolae Vartolomei 提交于 6月 30, 2020

Possible approach for fixing #10574

The problem is that prepared sets are built correctly, it is a hash map of key -> set
where key is a hash of AST and list of data types (when we a list of
tuples of literals).

However, when the key is built from the index to try and find if there
exists a prepared set that would match it looks for data types of the
primary key (see how data_types is populated) because the primary key
has only one field (v in my example) it can not find the prepared set.

The patch looks for any prepared indexes where data types match for the
subset of fields found in primary key, we are not interested in other
fields anyway for the purpose of primary key pruning.

8f184518

15 6月, 2020 2 次提交
- A
  
  Split file for better build times · 8dac30ae
  由 Alexey Milovidov 提交于 6月 14, 2020
  
  8dac30ae
- A
  
  Allow comparison with String in index analysis; simplify code #11630 · f6c52fe1
  由 Alexey Milovidov 提交于 6月 14, 2020
  
  f6c52fe1
08 6月, 2020 2 次提交
- A
  
  Remove log debug · 23549399
  由 alesapin 提交于 6月 08, 2020
  
  23549399
- A
  
  Fix some bugs · 2226f79f
  由 alesapin 提交于 6月 08, 2020
  
  2226f79f
02 6月, 2020 1 次提交

Fuzzing-related changes. · 0a5cc96b

由 Alexander Kuzmenkov 提交于 6月 02, 2020

* More LOGICAL_ERROR
* Proper cloning of some Asts
* Field::safeGet for user-supplied values

0a5cc96b

30 5月, 2020 1 次提交
- A
  
  Fix issue #11286; add a test · 8c882147
  由 Alexey Milovidov 提交于 5月 30, 2020
  
  8c882147
11 5月, 2020 1 次提交
- A
  
  Use src_type for convertion in KeyCondition · 330f0632
  由 Andrew Onyshchuk 提交于 5月 10, 2020
  
  330f0632
22 4月, 2020 1 次提交
- A
  
  Checkpoint · 1e325a9f
  由 Alexey Milovidov 提交于 4月 22, 2020
  
  1e325a9f
06 4月, 2020 1 次提交
- A
  
  improve performance of index analysis with monotonic functions · 79024d73
  由 Anton Popov 提交于 4月 02, 2020
  
  79024d73
03 4月, 2020 1 次提交
- I
  
  dbms/ → src/ · 06446b4f
  由 Ivan Lezhankin 提交于 4月 03, 2020
  
  06446b4f
02 4月, 2020 1 次提交
- I
  Move all folders inside /dbms one level up (#9974) · 97f2a221
  由 Ivan 提交于 4月 02, 2020
```
* Move some code outside dbms/src folder
* Fix paths
```
  97f2a221
19 3月, 2020 2 次提交
- A
  
  Added most of bugprone checks · c20853ee
  由 Alexey Milovidov 提交于 3月 18, 2020
  
  c20853ee
- A
  
  Added most of bugprone checks · bceb246d
  由 Alexey Milovidov 提交于 3月 18, 2020
  
  bceb246d