create_operator.l 7.5 KB
Newer Older
1 2
.\" This is -*-nroff-*-
.\" XXX standard disclaimer belongs here....
3
.\" $Header: /cvsroot/pgsql/src/man/Attic/create_operator.l,v 1.9 1999/05/20 03:21:02 tgl Exp $
4
.TH "CREATE OPERATOR" SQL 11/05/95 PostgreSQL PostgreSQL
5
.SH NAME
B
Bruce Momjian 已提交
6
create operator - define a new user operator
7 8 9 10 11 12 13 14 15 16
.SH SYNOPSIS
.nf
\fBcreate operator\fR operator_name
	\fB(\fR[ \fBleftarg\fR \fB=\fR type-1 ]
	 [ \fB,\fR \fBrightarg\fR \fB=\fR type-2 ]
	 , \fBprocedure =\fR func_name
	 [\fB, commutator =\fR com_op ]
	 [\fB, negator =\fR neg_op ]
	 [\fB, restrict =\fR res_proc ]
	 [\fB, join =\fR join_proc ]
17 18 19
	 [\fB, hashes\fR]
	 [\fB, sort1 =\fR left_sort_op ]
	 [\fB, sort2 =\fR right_sort_op ]
20 21 22 23 24 25 26 27 28 29 30 31 32 33
	\fB)\fR
.\" \fB"arg is ("
.\" type [
.\" \fB,
.\" type ]
.\" \fB)
.fi
.SH DESCRIPTION
This command defines a new user operator,
.IR "operator_name" .
The user who defines an operator becomes its owner.
.PP
The
.IR "operator_name"
34
is a sequence of punctuation characters.  The following
35 36
characters are valid for single-character operator names:
.nf
B
Bruce Momjian 已提交
37 38

.ce 1
39
~ ! @ # % ^ & ` ?
B
Bruce Momjian 已提交
40

41 42 43 44 45
.fi
If the operator name is more than one character long, it may consist
of any combination of the above characters or the following additional
characters:
.nf
B
Bruce Momjian 已提交
46 47

.ce 1
48
| $ : + - * / < > =
B
Bruce Momjian 已提交
49

50
.fi
B
Bruce Momjian 已提交
51 52
The operator "!=" is mapped to "<>" on input, and they are
therefore equivalent.
53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69
.PP
At least one of
.IR leftarg
and
.IR rightarg
must be defined.  For binary operators, both should be defined. For
right unary operators, only
.IR arg1
should be defined, while for left unary operators only
.IR arg2
should be defined.
.PP
The name of the operator,
.IR operator_name ,
can be composed of symbols only.  Also, the
.IR func_name
procedure must have been previously defined using
B
cleanup  
Bruce Momjian 已提交
70
.IR create_function(l)
71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90
and must have one or two arguments.
.PP
.\" that multiple instances of the 
.\" operator must be be evaluated
.\" For example, consider the area-intersection operator,
.\" .q A,
.\" and the following expression:
.\" .(l
.\" MYBOXES2.description A \*(lq0,0,1,1\*(rq A MYBOXES.description
.\" .)l
.\" .in .5i
.\" The associativity flag indicates that
.\" .(l
.\" (MYBOXES2.description A \*(lq0,0,1,1\*(rq) A MYBOXES.description
.\" .)l
.\" .in .5i
.\" is the same as
.\" .(l
.\" MYBOXES2.description A (\*(lq0,0,1,1\*(rq A MYBOXES.description).
.\" .)l
91 92 93 94
The commutator operator should be identified if one exists,
so that Postgres can reverse the order of the operands if it wishes.
For example, the operator
area-less-than, >>>, would probably have a commutator operator,
95
area-greater-than, <<<.  Hence, the query optimizer
96 97
could freely convert:
.nf
B
Bruce Momjian 已提交
98 99

.ce 1
100
"0,0,1,1"::box >>> MYBOXES.description
B
Bruce Momjian 已提交
101

102 103 104
.fi
to
.nf
B
Bruce Momjian 已提交
105 106

.ce 1
107
MYBOXES.description <<< "0,0,1,1"::box
B
Bruce Momjian 已提交
108

109 110 111 112
.fi
This allows the execution code to always use the latter representation
and simplifies the query optimizer somewhat.
.PP
113
Similarly, if there is a negator operator then it should be identified.
114 115
Suppose that an operator, area-equal, ===,
exists, as well as an area not equal, !==.
116
The negator link allows the query optimizer to simplify
117
.nf
B
Bruce Momjian 已提交
118 119 120 121

.ce 1
NOT MYBOXES.description === "0,0,1,1"::box

122 123 124
.fi
to
.nf
B
Bruce Momjian 已提交
125 126

.ce 1
127
MYBOXES.description !== "0,0,1,1"::box
B
Bruce Momjian 已提交
128

129 130 131
.fi
If a commutator operator name is supplied, Postgres searches for it in
the catalog.  If it is found and it does not yet have a commutator
132 133
itself, then the commutator's entry is updated to have the newly created
operator as its commutator.  This applies to the negator, as well.
134 135 136 137 138
.PP
This is to allow the definition of two operators that are the
commutators or the negators of each other.  The first operator should
be defined without a commutator or negator (as appropriate).  When the
second operator is defined, name the first as the commutator or
139 140
negator.  The first will be updated as a side effect.  (As of Postgres 6.5,
it also works to just have both operators refer to each other.)
141
.PP
142
The next three specifications are present to support the query optimizer
143 144 145
in performing joins.  Postgres can always evaluate a join (i.e.,
processing a clause with two tuple variables separated by an operator
that returns a boolean) by iterative substitution [WONG76].  In
146
addition, Postgres can use a hash-join algorithm
147
along the lines of [SHAP86]; however, it must know whether this
148 149 150 151 152 153 154
strategy is applicable.
The current hash-join algorithm
is only correct for operators that represent equality tests;
furthermore, equality of the datatype must mean bitwise equality
of the representation of the type.  (For example, a datatype that
contains unused bits that don't matter for equality tests could
not be hashjoined.)
155 156
The
.BR hashes
157 158
flag indicates to the query optimizer that a hash join may safely be
used with this operator.
159 160
.PP
Similarly, the two sort operators indicate to the query optimizer
161 162 163 164 165 166
whether merge-sort is a usable join strategy and which operators should
be used to sort the two operand classes.
Sort operators should only be provided for an equality
operator, and they should refer to less-than operators for the
left and right side data types respectively.
.PP
167 168 169 170 171 172 173 174 175 176
If other join strategies are found to be practical, Postgres will change
the optimizer and run-time system to use them and will require
additional specification when an operator is defined.  Fortunately,
the research community invents new join strategies infrequently, and
the added generality of user-defined join strategies was not felt to
be worth the complexity involved.
.PP
The last two pieces of the specification are present so the query
optimizer can estimate result sizes.  If a clause of the form:
.nf
B
Bruce Momjian 已提交
177 178

.ce 1
179
MYBOXES.description <<< "0,0,1,1"::box
B
Bruce Momjian 已提交
180

181 182 183 184 185
.fi
is present in the qualification, then Postgres may have to estimate the
fraction of the instances in MYBOXES that satisfy the clause.  The
function res_proc must be a registered function (meaning it is already
defined using
B
cleanup  
Bruce Momjian 已提交
186
.IR create_function(l))
187
which accepts arguments of the correct data types and returns a
188
floating point number.  The query optimizer simply calls this
B
Bruce Momjian 已提交
189
function, passing the parameter "0,0,1,1"
190 191 192 193 194 195 196 197 198 199 200
and multiplies the result by the relation size to get the desired
expected number of instances.
.PP
Similarly, when the operands of the operator both contain instance
variables, the query optimizer must estimate the size of the resulting
join.  The function join_proc will return another floating point
number which will be multiplied by the cardinalities of the two
classes involved to compute the desired expected result size.
.PP
The difference between the function
.nf
B
Bruce Momjian 已提交
201 202

.ce 1
203
my_procedure_1 (MYBOXES.description, "0,0,1,1"::box)
B
Bruce Momjian 已提交
204

205 206 207
.fi
and the operator
.nf
B
Bruce Momjian 已提交
208 209

.ce 1
210
MYBOXES.description === "0,0,1,1"::box
B
Bruce Momjian 已提交
211

212 213 214 215 216 217 218 219 220 221 222 223 224 225 226 227 228 229 230
.fi
is that Postgres attempts to optimize operators and can decide to use an
index to restrict the search space when operators are involved.
However, there is no attempt to optimize functions, and they are
performed by brute force.  Moreover, functions can have any number of
arguments while operators are restricted to one or two.
.SH EXAMPLE
.nf
--
--The following command defines a new operator,
--area-equality, for the BOX data type.
--
create operator === (
	leftarg = box,
	rightarg = box,
	procedure = area_equal_procedure,
	commutator = ===,
	negator = !==,
	restrict = area_restriction_procedure,
231
	join = area_join_procedure,
232
	hashes,
233 234
	sort1 = <<<,
	sort2 = <<<)
235 236 237
.\"	arg is (box, box)
.fi
.SH "SEE ALSO"
238 239
create_function(l),
drop_operator(l).
240 241 242 243
.SH BUGS
Operator names cannot be composed of alphabetic characters in 
Postgres.
.PP
244 245 246 247
If an operator is defined before its commuting operator has been defined,
a dummy entry for the commutator (with invalid oprproc field) will be placed
in the system catalogs.  This entry will be overridden when the commutator
is eventually defined.