提交 · b3397fb7b58a81d82ef7b8753087b7506804353e · apache / pulsar

28 8月, 2019 2 次提交

Enforce checkstyle in the pulsar sql module (#4882) · b3397fb7

由 Sergii Zhevzhyk 提交于 8月 05, 2019

The checksyle plugin was added to the pulsar sql module to enforce the defined style. All violations were fixed:

- Ordering of imports.
- Formatting of the code.
- Absent Javadoc comments.
- Other small issues.
(cherry picked from commit f6fee1c6)

b3397fb7

M
Reuse ManagedLedgerFactory instances across SQL queries (#4813) · c781e405
由 Matteo Merli 提交于 7月 25, 2019
```
(cherry picked from commit f88ea9df)
```
c781e405

14 9月, 2018 1 次提交

optimizing throughput in Pulsar Presto connector (#2564) · 6ef7acaf

由 Boyang Jerry Peng 提交于 9月 13, 2018

### Motivation

1. Currently, the presto pulsar connector will read synchronously from bookkeeper when it has run out of entries go process.  Basically, we process a batch of entries and then we read more.  Ideally should be doing reading and processing in parallel to increase throughput.

2. Each split initializes their own ManagedLedgerFactory/Bookkeeper client.  We really just need one bookkeeper client to be shared among threads.

### Modifications
1. Rewrote the logic in the Presto Pulsar connector to read async and process in parallel

2. Cache ManagedLedgerFactory to be used across splits

### Result

I see about 2X throughput improvement on single node as well as cluster (2 brokers, 3 bookies, 4 presto workers including coordinator) on AWS

6ef7acaf

07 8月, 2018 1 次提交

PIP-19: Initial implementation of Pulsar SQL (#2265) · 461647a2

由 Boyang Jerry Peng 提交于 8月 06, 2018

* adding module for Pulsar SQL

* renaming presto pulsar package

* fixing pom and configs

* fixing pom

* using project version variable in pom

* adding comments

461647a2