提交 · d26888bc4d1e539a82f21382b0000fe5bbf889d9 · Greenplum / Gpdb

18 7月, 2013 2 次提交

Move checking an explicit VARIADIC "any" argument into the parser. · d26888bc

由 Andrew Dunstan 提交于 7月 18, 2013

This is more efficient and simpler . It does mean that an untyped NULL
can no longer be used in such cases, which should be mentioned in
Release Notes, but doesn't seem a terrible loss. The workaround is to
cast the NULL to some array type.

Pavel Stehule, reviewed by Jeevan Chalke.

d26888bc

Fix end-of-loop optimization in pglz_find_match() function. · 3f2adace

由 Heikki Linnakangas 提交于 7月 17, 2013

After the recent pglz optimization patch, the next/prev pointers in the
hash table are never NULL, INVALID_ENTRY_PTR is used to represent invalid
entries instead. The end-of-loop check in pglz_find_match() function didn't
get the memo. The result was the same from a correctness point of view, but
because the NULL-check would never fail, the tiny optimization turned into
a pessimization.

Reported by Stephen Frost, using Coverity scanner.

3f2adace

17 7月, 2013 1 次提交

Implement the FILTER clause for aggregate function calls. · b560ec1b

由 Noah Misch 提交于 7月 16, 2013

This is SQL-standard with a few extensions, namely support for
subqueries and outer references in clause expressions.

catversion bump due to change in Aggref and WindowFunc.

David Fetter, reviewed by Dean Rasheed.

b560ec1b

09 7月, 2013 1 次提交

Fix bool abuse · 7888c612

由 Peter Eisentraut 提交于 7月 08, 2013

path_encode's "closed" argument used to take three values: TRUE, FALSE,
or -1, while being of type bool.  Replace that with a three-valued enum
for more clarity.

7888c612

05 7月, 2013 1 次提交

Expose the estimation of number of changed tuples since last analyze · c87ff71f

由 Magnus Hagander 提交于 7月 05, 2013

This value, now pg_stat_all_tables.n_mod_since_analyze, was already
tracked and used by autovacuum, but not exposed to the user.

Mark Kirkwood, review by Laurenz Albe

c87ff71f

04 7月, 2013 1 次提交

Get rid of pg_class.reltoastidxid. · 2ef085d0

由 Fujii Masao 提交于 7月 04, 2013

Treat TOAST index just the same as normal one and get the OID
of TOAST index from pg_index but not pg_class.reltoastidxid.
This change allows us to handle multiple TOAST indexes, and
which is required infrastructure for upcoming
REINDEX CONCURRENTLY feature.

Patch by Michael Paquier, reviewed by Andres Freund and me.

2ef085d0

02 7月, 2013 2 次提交

Use an MVCC snapshot, rather than SnapshotNow, for catalog scans. · 568d4138

由 Robert Haas 提交于 7月 02, 2013

SnapshotNow scans have the undesirable property that, in the face of
concurrent updates, the scan can fail to see either the old or the new
versions of the row. In many cases, we work around this by requiring
DDL operations to hold AccessExclusiveLock on the object being
modified; in some cases, the existing locking is inadequate and random
failures occur as a result. This commit doesn't change anything
related to locking, but will hopefully pave the way to allowing lock
strength reductions in the future.

The major issue has held us back from making this change in the past
is that taking an MVCC snapshot is significantly more expensive than
using a static special snapshot such as SnapshotNow. However, testing
of various worst-case scenarios reveals that this problem is not
severe except under fairly extreme workloads. To mitigate those
problems, we avoid retaking the MVCC snapshot for each new scan;
instead, we take a new snapshot only when invalidation messages have
been processed. The catcache machinery already requires that
invalidation messages be sent before releasing the related heavyweight
lock; else other backends might rely on locally-cached data rather
than scanning the catalog at all. Thus, making snapshot reuse
dependent on the same guarantees shouldn't break anything that wasn't
already subtly broken.

Patch by me. Review by Michael Paquier and Andres Freund.

568d4138

Add timezone offset output option to to_char() · 7408c5d2

由 Bruce Momjian 提交于 7月 01, 2013

Add ability for to_char() to output the timezone's UTC offset (OF).  We
already have the ability to return the timezone abbeviation (TZ/tz).
Per request from Andrew Dunstan

7408c5d2

01 7月, 2013 1 次提交

Optimize pglz compressor for small inputs. · 031cc55b

由 Heikki Linnakangas 提交于 7月 01, 2013

The pglz compressor has a significant startup cost, because it has to
initialize to zeros the history-tracking hash table. On a 64-bit system, the
hash table was 64kB in size. While clearing memory is pretty fast, for very
short inputs the relative cost of that was quite large.

This patch alleviates that in two ways. First, instead of storing pointers
in the hash table, store 16-bit indexes into the hist_entries array. That
slashes the size of the hash table to 1/2 or 1/4 of the original, depending
on the pointer width. Secondly, adjust the size of the hash table based on
input size. For very small inputs, you don't need a large hash table to
avoid collisions.

Review by Amit Kapila.

031cc55b

26 6月, 2013 1 次提交

Renovate display of non-ASCII messages on Windows. · 5f538ad0

由 Noah Misch 提交于 6月 26, 2013

GNU gettext selects a default encoding for the messages it emits in a
platform-specific manner; it uses the Windows ANSI code page on Windows
and follows LC_CTYPE on other platforms. This is inconvenient for
PostgreSQL server processes, so realize consistent cross-platform
behavior by calling bind_textdomain_codeset() on Windows each time we
permanently change LC_CTYPE. This primarily affects SQL_ASCII databases
and processes like the postmaster that do not attach to a database,
making their behavior consistent with PostgreSQL on non-Windows
platforms. Messages from SQL_ASCII databases use the encoding implied
by the database LC_CTYPE, and messages from non-database processes use
LC_CTYPE from the postmaster system environment. PlatformEncoding
becomes unused, so remove it.

Make write_console() prefer WriteConsoleW() to write() regardless of the
encodings in use. In this situation, write() will invariably mishandle
non-ASCII characters.

elog.c has assumed that messages conform to the database encoding.
While usually true, this does not hold for SQL_ASCII and MULE_INTERNAL.
Introduce MessageEncoding to track the actual encoding of message text.
The present consumers are Windows-specific code for converting messages
to UTF16 for use in system interfaces. This fixes the appearance in
Windows event logs and consoles of translated messages from SQL_ASCII
processes like the postmaster. Note that SQL_ASCII inherently disclaims
a strong notion of encoding, so non-ASCII byte sequences interpolated
into messages by %s may yet yield a nonsensical message. MULE_INTERNAL
has similar problems at present, albeit for a different reason: its lack
of libiconv support or a conversion to UTF8.

Consequently, one need no longer restart Windows with a different
Windows ANSI code page to broadly test backend logging under a given
language. Changing the user's locale ("Format") is enough. Several
accounts can simultaneously run postmasters under different locales, all
correctly logging localized messages to Windows event logs and consoles.

Alexander Law and Noah Misch

5f538ad0

16 6月, 2013 1 次提交

Use WaitLatch, not pg_usleep, for delaying in pg_sleep(). · a64ca63e

由 Tom Lane 提交于 6月 15, 2013

This avoids platform-dependent behavior wherein pg_sleep() might fail to be
interrupted by statement timeout, query cancel, SIGTERM, etc.  Also, since
there's no reason to wake up once a second any more, we can reduce the
power consumption of a sleeping backend a tad.

Back-patch to 9.3, since use of SA_RESTART for SIGALRM makes this a bigger
issue than it used to be.

a64ca63e

13 6月, 2013 3 次提交

Avoid reading past datum end when parsing JSON. · 66008564

由 Noah Misch 提交于 6月 12, 2013

Several loops in the JSON parser examined a byte in memory just before
checking whether its address was in-bounds, so they could read one byte
beyond the datum's allocation.  A SIGSEGV is possible.  New in 9.3, so
no back-patch.

66008564

Improve updatability checking for views and foreign tables. · dc3eb563

由 Tom Lane 提交于 6月 12, 2013

Extend the FDW API (which we already changed for 9.3) so that an FDW can
report whether specific foreign tables are insertable/updatable/deletable.
The default assumption continues to be that they're updatable if the
relevant executor callback function is supplied by the FDW, but finer
granularity is now possible.  As a test case, add an "updatable" option to
contrib/postgres_fdw.

This patch also fixes the information_schema views, which previously did
not think that foreign tables were ever updatable, and fixes
view_is_auto_updatable() so that a view on a foreign table can be
auto-updatable.

initdb forced due to changes in information_schema views and the functions
they rely on.  This is a bit unfortunate to do post-beta1, but if we don't
change this now then we'll have another API break for FDWs when we do
change it.

Dean Rasheed, somewhat editorialized on by Tom Lane

dc3eb563

Fix unescaping of JSON Unicode escapes, especially for non-UTF8. · 78ed8e03

由 Andrew Dunstan 提交于 6月 12, 2013

Per discussion on -hackers. We treat Unicode escapes when unescaping
them similarly to the way we treat them in PostgreSQL string literals.
Escapes in the ASCII range are always accepted, no matter what the
database encoding. Escapes for higher code points are only processed in
UTF8 databases, and attempts to process them in other databases will
result in an error. \u0000 is never unescaped, since it would result in
an impermissible null byte.

78ed8e03

08 6月, 2013 1 次提交

Handle Unicode surrogate pairs correctly when processing JSON. · 94e3311b

由 Andrew Dunstan 提交于 6月 08, 2013

In 9.2, Unicode escape sequences are not analysed at all other than
to make sure that they are in the form \uXXXX. But in 9.3 many of the
new operators and functions try to turn JSON text values into text in
the server encoding, and this includes de-escaping Unicode escape
sequences. This processing had not taken into account the possibility
that this might contain a surrogate pair to designate a character
outside the BMP. That is now handled correctly.

This also enforces correct use of surrogate pairs, something that is not
done by the type's input routines. This fact is noted in the docs.

94e3311b

03 6月, 2013 1 次提交
- S
  Additional spelling corrections · f129615f
  由 Stephen Frost 提交于 6月 03, 2013
```
A few more minor spelling corrections, no functional changes.

Thom Brown
```
  f129615f
01 6月, 2013 2 次提交
- S
  Minor spelling fixes · c9fc28a7
  由 Stephen Frost 提交于 6月 01, 2013
```
Fix a few spelling mistakes.

Per bug report #8193 from Lajos Veres.
```
  c9fc28a7
- N
  Don't emit non-canonical empty arrays in array_remove(). · 97c4d9b7
  由 Noah Misch 提交于 5月 31, 2013
```
Dean Rasheed
```
  97c4d9b7
30 5月, 2013 1 次提交

pgindent run for release 9.3 · 9af4159f

由 Bruce Momjian 提交于 5月 29, 2013

This is the first run of the Perl-based pgindent script.  Also update
pgindent instructions.

9af4159f

17 5月, 2013 1 次提交

Fix crash when trying to display a NOTIFY rule action. · 403bd6a1

由 Tom Lane 提交于 5月 16, 2013

Fixes oversight in commit 2ffa740b.
Per report from Josh Kupershmidt.

I think we've broken this case before, so let's add a regression test
this time.

403bd6a1

12 5月, 2013 1 次提交

Fix to_number() to correctly ignore thousands separator when it's '.'. · 35d50b52

由 Tom Lane 提交于 5月 11, 2013

The existing code in NUM_numpart_from_char has hard-wired logic to treat
'.' as decimal point, even when we're using a locale-aware format string
and the locale says that '.' is the thousands separator.  This results in
clearly wrong answers in FM mode (where we must be able to identify the
decimal point location), as per bug report from Patryk Kordylewski.

Since the initialization code in NUM_prepare_locale already sets up
Np->decimal as either the locale decimal-point string or "." depending
on which decimal-point format code was used, there's really no need to
have any extra logic at all in NUM_numpart_from_char: we only need to
test for a match to Np->decimal.

(Note: AFAICS there's nothing in here that explicitly checks for thousands
separators --- rather, any unmatched character is silently skipped over.
That's pretty bogus IMO but it's not the issue being complained of.)

This is a longstanding bug, but it's possible that some existing apps
are depending on '.' being recognized as decimal point even when using
a D format code.  Hence, no back-patch.  We should probably list this
as a potential incompatibility in the 9.3 release notes.

35d50b52

11 5月, 2013 1 次提交

Guard against input_rows == 0 in estimate_num_groups(). · 69cc60dc

由 Tom Lane 提交于 5月 10, 2013

This case doesn't normally happen, because the planner usually clamps
all row estimates to at least one row; but I found that it can arise
when dealing with relations excluded by constraints. Without a defense,
estimate_num_groups() can return zero, which leads to divisions by zero
inside the planner as well as assertion failures in the executor.

An alternative fix would be to change set_dummy_rel_pathlist() to make
the size estimate for a dummy relation 1 row instead of 0, but that seemed
pretty ugly; and probably someday we'll want to drop the convention that
the minimum rowcount estimate is 1 row.

Back-patch to 8.4, as the problem can be demonstrated that far back.

69cc60dc

07 5月, 2013 1 次提交

Move materialized views' is-populated status into their pg_class entries. · 1d6c72a5

由 Tom Lane 提交于 5月 06, 2013

Previously this state was represented by whether the view's disk file had
zero or nonzero size, which is problematic for numerous reasons, since it's
breaking a fundamental assumption about heap storage. This was done to
allow unlogged matviews to revert to unpopulated status after a crash
despite our lack of any ability to update catalog entries post-crash.
However, this poses enough risk of future problems that it seems better to
not support unlogged matviews until we can find another way. Accordingly,
revert that choice as well as a number of existing kluges forced by it
in favor of creating a pg_class.relispopulated flag column.

1d6c72a5

02 5月, 2013 1 次提交
- A
  Use correct length to convert json unicode escapes. · 5f8b4319
  由 Andrew Dunstan 提交于 5月 01, 2013
```
Bug reported on IRC - fix due to Andrew Gierth.
```
  5f8b4319
20 4月, 2013 1 次提交

Clean up references to SQL92 · cc26ea9f

由 Peter Eisentraut 提交于 4月 20, 2013

In most cases, these were just references to the SQL standard in
general.  In a few cases, a contrast was made between SQL92 and later
standards -- those have been kept unchanged.

cc26ea9f

16 4月, 2013 1 次提交
- A
  Correct handling of NULL arguments in json funcs. · 728ec973
  由 Andrew Dunstan 提交于 4月 15, 2013
```
Per gripe from Tom Lane.
```
  728ec973
10 4月, 2013 1 次提交

Create a distinction between a populated matview and a scannable one. · 52e6e33a

由 Kevin Grittner 提交于 4月 09, 2013

The intent was that being populated would, long term, be just one
of the conditions which could affect whether a matview was
scannable; being populated should be necessary but not always
sufficient to scan the relation.  Since only CREATE and REFRESH
currently determine the scannability, names and comments
accidentally conflated these concepts, leading to confusion.

Also add missing locking for the SQL function which allows a
test for scannability, and fix a modularity violatiion.

Per complaints from Tom Lane, although its not clear that these
will satisfy his concerns.  Hopefully this will at least better
frame the discussion.

52e6e33a

09 4月, 2013 1 次提交

Support indexing of regular-expression searches in contrib/pg_trgm. · 3ccae48f

由 Tom Lane 提交于 4月 09, 2013

This works by extracting trigrams from the given regular expression,
in generally the same spirit as the previously-existing support for
LIKE searches, though of course the details are far more complicated.

Currently, only GIN indexes are supported. We might be able to make
it work with GiST indexes later.

The implementation includes adding API functions to backend/regex/
to provide a view of the search NFA created from a regular expression.
These functions are meant to be generic enough to be supportable in
a standalone version of the regex library, should that ever happen.

Alexander Korotkov, reviewed by Heikki Linnakangas and Tom Lane

3ccae48f

05 4月, 2013 1 次提交
- A
  Fix off by one error in JSON extract path code. · e75feb28
  由 Andrew Dunstan 提交于 4月 04, 2013
```
Bug report by David Wheeler, diagnosis assistance from Tom Lane.
```
  e75feb28
04 4月, 2013 1 次提交

Avoid updating our PgBackendStatus entry when track_activities is off. · f7b0006f

由 Tom Lane 提交于 4月 03, 2013

The point of turning off track_activities is to avoid this reporting
overhead, but a thinko in commit 4f42b546
caused pgstat_report_activity() to perform half of its updates anyway.
Fix that, and also make sure that we clear all the now-disabled fields
when transitioning to the non-reporting state.

f7b0006f

30 3月, 2013 1 次提交

Add new JSON processing functions and parser API. · a570c98d

由 Andrew Dunstan 提交于 3月 29, 2013

The JSON parser is converted into a recursive descent parser, and
exposed for use by other modules such as extensions. The API provides
hooks for all the significant parser event such as the beginning and end
of objects and arrays, and providing functions to handle these hooks
allows for fairly simple construction of a wide variety of JSON
processing functions. A set of new basic processing functions and
operators is also added, which use this API, including operations to
extract array elements, object fields, get the length of arrays and the
set of keys of a field, deconstruct an object into a set of key/value
pairs, and create records from JSON objects and arrays of objects.

Catalog version bumped.

Andrew Dunstan, with some documentation assistance from Merlin Moncure.

a570c98d

29 3月, 2013 1 次提交

Add sql_drop event for event triggers · 473ab40c

由 Alvaro Herrera 提交于 3月 27, 2013

This event takes place just before ddl_command_end, and is fired if and
only if at least one object has been dropped by the command.  (For
instance, DROP TABLE IF EXISTS of a table that does not in fact exist
will not lead to such a trigger firing).  Commands that drop multiple
objects (such as DROP SCHEMA or DROP OWNED BY) will cause a single event
to fire.  Some firings might be surprising, such as
ALTER TABLE DROP COLUMN.

The trigger is fired after the drop has taken place, because that has
been deemed the safest design, to avoid exposing possibly-inconsistent
internal state (system catalogs as well as current transaction) to the
user function code.  This means that careful tracking of object
identification is required during the object removal phase.

Like other currently existing events, there is support for tag
filtering.

To support the new event, add a new pg_event_trigger_dropped_objects()
set-returning function, which returns a set of rows comprising the
objects affected by the command.  This is to be used within the user
function code, and is mostly modelled after the recently introduced
pg_identify_object() function.

Catalog version bumped due to the new function.

Dimitri Fontaine and Álvaro Herrera
Review by Robert Haas, Tom Lane

473ab40c

21 3月, 2013 2 次提交

Fix "element <@ range" cost estimation. · f897c474

由 Heikki Linnakangas 提交于 3月 21, 2013

The statistics-based cost estimation patch for range types broke that, by
incorrectly assuming that the left operand of all range oeprators is a
range. That lead to a "type x is not a range type" error. Because it took so
long for anyone to notice, add a regression test for that case.

We still don't do proper statistics-based cost estimation for that, so you
just get a default constant estimate. We should look into implementing that,
but this patch at least fixes the regression.

Spotted by Tom Lane, when testing query from Josh Berkus.

f897c474

Allow extracting machine-readable object identity · f8348ea3

由 Alvaro Herrera 提交于 3月 20, 2013

Introduce pg_identify_object(oid,oid,int4), which is similar in spirit
to pg_describe_object but instead produces a row of machine-readable
information to uniquely identify the given object, without resorting to
OIDs or other internal representation. This is intended to be used in
the event trigger implementation, to report objects being operated on;
but it has usefulness of its own.

Catalog version bumped because of the new function.

f8348ea3

15 3月, 2013 1 次提交

Extend format() to handle field width and left/right alignment. · 73e7025b

由 Tom Lane 提交于 3月 14, 2013

This change adds some more standard sprintf() functionality to format().

Pavel Stehule, reviewed by Dean Rasheed and Kyotaro Horiguchi

73e7025b

14 3月, 2013 1 次提交

Add cost estimation of range @> and <@ operators. · 59d0bf9d

由 Heikki Linnakangas 提交于 3月 14, 2013

The estimates are based on the existing lower bound histogram, and a new
histogram of range lengths.

Bump catversion, because the range length histogram now needs to be present
in statistic slot kind 6, or you get an error on @> and <@ queries. (A
re-ANALYZE would be enough to fix that, though)

Alexander Korotkov, with some refactoring by me.

59d0bf9d

11 3月, 2013 1 次提交

JSON generation improvements. · 38fb4d97

由 Andrew Dunstan 提交于 3月 10, 2013

This adds the following:

    json_agg(anyrecord) -> json
    to_json(any) -> json
    hstore_to_json(hstore) -> json (also used as a cast)
    hstore_to_json_loose(hstore) -> json

The last provides heuristic treatment of numbers and booleans.

Also, in json generation, if any non-builtin type has a cast to json,
that function is used instead of the type's output function.

Andrew Dunstan, reviewed by Steve Singer.

Catalog version bumped.

38fb4d97

08 3月, 2013 1 次提交
- H
  SP-GiST support of the range adjacent operator -|- · 23f10b64
  由 Heikki Linnakangas 提交于 3月 08, 2013
```
Alexander Korotkov, reviewed by Jeff Davis.
```
  23f10b64
06 3月, 2013 1 次提交

Fix to_char() to use ASCII-only case-folding rules where appropriate. · 80b011ef

由 Tom Lane 提交于 3月 05, 2013

formatting.c used locale-dependent case folding rules in some code paths
where the result isn't supposed to be locale-dependent, for example
to_char(timestamp, 'DAY'). Since the source data is always just ASCII
in these cases, that usually didn't matter ... but it does matter in
Turkish locales, which have unusual treatment of "i" and "I". To confuse
matters even more, the misbehavior was only visible in UTF8 encoding,
because in single-byte encodings we used pg_toupper/pg_tolower which
don't have locale-specific behavior for ASCII characters. Fix by providing
intentionally ASCII-only case-folding functions and using these where
appropriate. Per bug #7913 from Adnan Dursun. Back-patch to all active
branches, since it's been like this for a long time.

80b011ef

05 3月, 2013 1 次提交

Fix overflow check in tm2timestamp (this time for sure). · 542eeba2

由 Tom Lane 提交于 3月 04, 2013

I fixed this code back in commit 841b4a2d, but didn't think carefully
enough about the behavior near zero, which meant it improperly rejected
1999-12-31 24:00:00. Per report from Magnus Hagander.

542eeba2