提交 b1af1f5a 编写于 作者: L Lisa Owen 提交者: David Yozie

docs - discuss pxf column projection (#7247)

* docs - discuss pxf column projection

* add note about filter serialization

* column projection is always enabled

* edits requested by david
上级 0837cd6a
......@@ -10,6 +10,7 @@
<a href="/6-0/pxf/intro_pxf.html" format="markdown">Introduction to PXF</a>
<ul>
<li><a href="/6-0/pxf/filter_push.html" format="markdown">About PXF Filter Pushdown</a></li>
<li><a href="/6-0/pxf/col_project.html" format="markdown">About Column Projection in PXF</a></li>
</ul>
</li>
<li class="has_submenu">
......
---
title: About Column Projection in PXF
---
PXF supports column projection, and it is always enabled. With column projection, only the columns required by a `SELECT` query on an external table are returned from the external data source. This process can improve query performance, and can also reduce the amount of data that is transferred to Greenplum Database.
**Note:** Some external data sources do not support column projection. If a query accesses a data source that does not support column projection, the query is instead executed without it, and the data is filtered after it is transferred to Greenplum Database.
Column projection is automatically enabled for the `pxf` external table protocol. PXF accesses external data sources using different connectors, and column projection support is also determined by the specific connector implementation. The following PXF connector and profile combinations support column projection on read operations:
- PXF Hive Connector, `HiveORC` profile
- PXF JDBC Connector, `Jdbc` profile
- PXF Hadoop and Object Store Connectors, `hdfs:parquet`, `adl:parquet`, `gs:parquet`,`s3:parquet`, and `wasbs:parquet` profiles
**Note:** PXF may disable column projection in cases where it cannot successfully serialize a query filter; for example, when the `WHERE` clause resolves to a `boolean` type.
To summarize, all of the following criteria must be met for column projection to occur:
* The external data source that you are accessing must support column projection. For example, Hive supports column projection for ORC-format data, and certain SQL databases support column projection.
* The underlying PXF connector and profile implementation must also support column projection. For example, the PXF Hive and JDBC connector profiles identified above support column projection, as do the PXF connectors that support reading Parquet data.
* PXF must be able to serialize the query filter.
......@@ -58,3 +58,10 @@ PXF may require additional information to read or write certain data formats. Yo
**Note:** When you create a PXF external table, you cannot use the `HEADER` option in your formatter specification.
## <a id="other"></a> Other PXF Features
Certain PXF connectors and profiles support filter pushdown and column projection. Refer to the following topics for detailed information about this support:
- [About PXF Filter Pushdown](filter_push.html)
- [About Column Projection in PXF](col_project.html)
Markdown is supported
0% .
You are about to add 0 people to the discussion. Proceed with caution.
先完成此消息的编辑!
想要评论请 注册