From b91a78a900e0415a7389070eded77936a26d8ff0 Mon Sep 17 00:00:00 2001 From: Lisa Owen Date: Tue, 11 Sep 2018 16:57:50 -0700 Subject: [PATCH] docs - updates for ext tbl filter pushdown default value on (#5734) --- gpdb-doc/dita/admin_guide/external/pxf-overview.xml | 2 +- gpdb-doc/markdown/pxf/hbase_pxf.html.md.erb | 4 +++- gpdb-doc/markdown/pxf/hive_pxf.html.md.erb | 2 +- gpdb-doc/markdown/pxf/using_pxf.html.md.erb | 5 +++-- 4 files changed, 8 insertions(+), 5 deletions(-) diff --git a/gpdb-doc/dita/admin_guide/external/pxf-overview.xml b/gpdb-doc/dita/admin_guide/external/pxf-overview.xml index 64693cc3ac..ac955f6388 100644 --- a/gpdb-doc/dita/admin_guide/external/pxf-overview.xml +++ b/gpdb-doc/dita/admin_guide/external/pxf-overview.xml @@ -5,7 +5,7 @@ Data managed by your organization may already reside in external sources. The Greenplum Platform Extension Framework (PXF) provides access to this external data via built-in connectors that map an external data source to a Greenplum Database table definition.

PXF is installed with HDFS, Hive, and HBase connectors. These connectors enable you to read external HDFS file system and Hive and HBase table data stored in text, Avro, JSON, RCFile, Parquet, SequenceFile, and ORC formats.

- PXF supports filter pushdown in the Hive connector only. + PXF supports filter pushdown in the Hive and HBase connectors only.

The Greenplum Platform Extension Framework includes a protocol C library and a Java service. After you configure and initialize PXF, you start a single PXF JVM process on each Greenplum Database segment host. This long-running process concurrently serves multiple query requests.

For detailed information about the architecture of and using PXF, refer to the Greenplum Platform Extension Framework (PXF) documentation.

diff --git a/gpdb-doc/markdown/pxf/hbase_pxf.html.md.erb b/gpdb-doc/markdown/pxf/hbase_pxf.html.md.erb index 1d8cedfa43..b652ac2961 100644 --- a/gpdb-doc/markdown/pxf/hbase_pxf.html.md.erb +++ b/gpdb-doc/markdown/pxf/hbase_pxf.html.md.erb @@ -4,7 +4,9 @@ title: Accessing HBase Table Data with PXF Apache HBase is a distributed, versioned, non-relational database on Hadoop. -The PXF HBase connector reads data stored in an HBase table. This section describes how to use the PXF HBase connector. +The PXF HBase connector reads data stored in an HBase table. The HBase connector supports filter pushdown. + +This section describes how to use the PXF HBase connector. ## Prerequisites diff --git a/gpdb-doc/markdown/pxf/hive_pxf.html.md.erb b/gpdb-doc/markdown/pxf/hive_pxf.html.md.erb index 7af2fe0eb0..29d9483367 100644 --- a/gpdb-doc/markdown/pxf/hive_pxf.html.md.erb +++ b/gpdb-doc/markdown/pxf/hive_pxf.html.md.erb @@ -668,7 +668,7 @@ To take advantage of PXF partition filtering pushdown, the Hive and PXF partitio **Note:** The Hive connector filters only on partition columns, not on other table attributes. -PXF filter pushdown is disabled by default. You configure PXF filter pushdown as described in [Configuring Filter Pushdown](using_pxf.html#filter-pushdown). +PXF filter pushdown is enabled by default. You configure PXF filter pushdown as described in [Configuring Filter Pushdown](using_pxf.html#filter-pushdown). ### Example: Using the Hive Profile to Access Partitioned Homogenous Data diff --git a/gpdb-doc/markdown/pxf/using_pxf.html.md.erb b/gpdb-doc/markdown/pxf/using_pxf.html.md.erb index 6be198f25e..33cd49f10e 100644 --- a/gpdb-doc/markdown/pxf/using_pxf.html.md.erb +++ b/gpdb-doc/markdown/pxf/using_pxf.html.md.erb @@ -92,7 +92,7 @@ GRANT INSERT ON PROTOCOL pxf TO bill; PXF supports filter pushdown. When filter pushdown is enabled, the constraints from the `WHERE` clause of a `SELECT` query can be extracted and passed to the external data source for filtering. This process can improve query performance, and can also reduce the amount of data that is transferred to Greenplum Database. -You enable or disable filter pushdown for all external table protocols, including `pxf`, by setting the `gp_external_enable_filter_pushdown` server configuration parameter. The default value of this configuration parameter is `off`; set it to `on` to enable filter pushdown. For example: +You enable or disable filter pushdown for all external table protocols, including `pxf`, by setting the `gp_external_enable_filter_pushdown` server configuration parameter. The default value of this configuration parameter is `on`; set it to `off` to disable filter pushdown. For example: ``` sql SHOW gp_external_enable_filter_pushdown; @@ -104,6 +104,7 @@ SET gp_external_enable_filter_pushdown TO 'on'; PXF accesses data sources using different connectors, and filter pushdown support is determined by the specific connector implementation. The following PXF connectors support filter pushdown: - Hive Connector +- HBase Connector PXF filter pushdown can be used with these data types: @@ -126,7 +127,7 @@ To summarize, all of the following criteria must be met for filter pushdown to o * You enable external table filter pushdown by setting the `gp_external_enable_filter_pushdown` server configuration parameter to `'on'`. * The Greenplum Database protocol that you use to access external data source must support filter pushdown. The `pxf` external table protocol supports pushdown. * The external data source that you are accessing must support pushdown. For example, HBase and Hive support pushdown. -* For queries on external tables that you create with the `pxf` protocol, the underlying PXF connector must also support filter pushdown. For example, only the PXF Hive connector supports pushdown. +* For queries on external tables that you create with the `pxf` protocol, the underlying PXF connector must also support filter pushdown. For example, the PXF Hive and HBase connectors support pushdown. ## PXF Profiles -- GitLab