create_operator.l 7.3 KB
Newer Older
1 2
.\" This is -*-nroff-*-
.\" XXX standard disclaimer belongs here....
3
.\" $Header: /cvsroot/pgsql/src/man/Attic/create_operator.l,v 1.5 1998/06/23 17:52:32 momjian Exp $
4
.TH "CREATE OPERATOR" SQL 11/05/95 PostgreSQL PostgreSQL
5
.SH NAME
B
Bruce Momjian 已提交
6
create operator - define a new user operator
7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35
.SH SYNOPSIS
.nf
\fBcreate operator\fR operator_name
	\fB(\fR[ \fBleftarg\fR \fB=\fR type-1 ]
	 [ \fB,\fR \fBrightarg\fR \fB=\fR type-2 ]
	 , \fBprocedure =\fR func_name
	 [\fB, commutator =\fR com_op ]
	 [\fB, negator =\fR neg_op ]
	 [\fB, restrict =\fR res_proc ]
	 [\fB, hashes\fR]
	 [\fB, join =\fR join_proc ]
	 [\fB, sort =\fR sor_op1 {\fB,\fR sor_op2 } ]
	\fB)\fR
.\" \fB"arg is ("
.\" type [
.\" \fB,
.\" type ]
.\" \fB)
.fi
.SH DESCRIPTION
This command defines a new user operator,
.IR "operator_name" .
The user who defines an operator becomes its owner.
.PP
The
.IR "operator_name"
is a sequence of up to sixteen punctuation characters.  The following
characters are valid for single-character operator names:
.nf
B
Bruce Momjian 已提交
36 37

.ce 1
38
~ ! @ # % ^ & ` ?
B
Bruce Momjian 已提交
39

40 41 42 43 44
.fi
If the operator name is more than one character long, it may consist
of any combination of the above characters or the following additional
characters:
.nf
B
Bruce Momjian 已提交
45 46

.ce 1
47
| $ : + - * / < > =
B
Bruce Momjian 已提交
48

49
.fi
B
Bruce Momjian 已提交
50 51
The operator "!=" is mapped to "<>" on input, and they are
therefore equivalent.
52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68
.PP
At least one of
.IR leftarg
and
.IR rightarg
must be defined.  For binary operators, both should be defined. For
right unary operators, only
.IR arg1
should be defined, while for left unary operators only
.IR arg2
should be defined.
.PP
The name of the operator,
.IR operator_name ,
can be composed of symbols only.  Also, the
.IR func_name
procedure must have been previously defined using
69
.IR "create_function" (l)
70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96
and must have one or two arguments.
.PP
.\" that multiple instances of the 
.\" operator must be be evaluated
.\" For example, consider the area-intersection operator,
.\" .q A,
.\" and the following expression:
.\" .(l
.\" MYBOXES2.description A \*(lq0,0,1,1\*(rq A MYBOXES.description
.\" .)l
.\" .in .5i
.\" The associativity flag indicates that
.\" .(l
.\" (MYBOXES2.description A \*(lq0,0,1,1\*(rq) A MYBOXES.description
.\" .)l
.\" .in .5i
.\" is the same as
.\" .(l
.\" MYBOXES2.description A (\*(lq0,0,1,1\*(rq A MYBOXES.description).
.\" .)l
The commutator operator is present so that Postgres can reverse the order
of the operands if it wishes.  For example, the operator
area-less-than, >>>, would have a commutator operator,
area-greater-than, <<<.  Suppose that an operator, area-equal, ===,
exists, as well as an area not equal, !==.  Hence, the query optimizer
could freely convert:
.nf
B
Bruce Momjian 已提交
97 98

.ce 1
99
"0,0,1,1"::box >>> MYBOXES.description
B
Bruce Momjian 已提交
100

101 102 103
.fi
to
.nf
B
Bruce Momjian 已提交
104 105

.ce 1
106
MYBOXES.description <<< "0,0,1,1"::box
B
Bruce Momjian 已提交
107

108 109 110 111 112 113
.fi
This allows the execution code to always use the latter representation
and simplifies the query optimizer somewhat.
.PP
The negator operator allows the query optimizer to convert
.nf
B
Bruce Momjian 已提交
114 115 116 117

.ce 1
NOT MYBOXES.description === "0,0,1,1"::box

118 119 120
.fi
to
.nf
B
Bruce Momjian 已提交
121 122

.ce 1
123
MYBOXES.description !== "0,0,1,1"::box
B
Bruce Momjian 已提交
124

125 126 127 128 129 130 131 132 133 134 135 136 137 138 139 140 141 142 143 144 145 146
.fi
If a commutator operator name is supplied, Postgres searches for it in
the catalog.  If it is found and it does not yet have a commutator
itself, then the commutator's entry is updated to have the current
(new) operator as its commutator.  This applies to the negator, as
well.
.PP
This is to allow the definition of two operators that are the
commutators or the negators of each other.  The first operator should
be defined without a commutator or negator (as appropriate).  When the
second operator is defined, name the first as the commutator or
negator.  The first will be updated as a side effect.
.PP
The next two specifications are present to support the query optimizer
in performing joins.  Postgres can always evaluate a join (i.e.,
processing a clause with two tuple variables separated by an operator
that returns a boolean) by iterative substitution [WONG76].  In
addition, Postgres is planning on implementing a hash-join algorithm
along the lines of [SHAP86]; however, it must know whether this
strategy is applicable.  For example, a hash-join algorithm is usable
for a clause of the form:
.nf
B
Bruce Momjian 已提交
147 148

.ce 1
149
MYBOXES.description === MYBOXES2.description
B
Bruce Momjian 已提交
150

151 152 153
.fi
but not for a clause of the form:
.nf
B
Bruce Momjian 已提交
154 155

.ce 1
156
MYBOXES.description <<< MYBOXES2.description.
B
Bruce Momjian 已提交
157

158 159 160 161 162 163 164 165 166 167 168 169
.fi
The
.BR hashes
flag gives the needed information to the query optimizer concerning
whether a hash join strategy is usable for the operator in question.
.PP
Similarly, the two sort operators indicate to the query optimizer
whether merge-sort is a usable join strategy and what operators should
be used to sort the two operand classes.  For the === clause above,
the optimizer must sort both relations using the operator, <<<.  On
the other hand, merge-sort is not usable with the clause:
.nf
B
Bruce Momjian 已提交
170 171

.ce 1
172
MYBOXES.description <<< MYBOXES2.description
B
Bruce Momjian 已提交
173

174 175 176 177 178 179 180 181 182 183 184
.fi
If other join strategies are found to be practical, Postgres will change
the optimizer and run-time system to use them and will require
additional specification when an operator is defined.  Fortunately,
the research community invents new join strategies infrequently, and
the added generality of user-defined join strategies was not felt to
be worth the complexity involved.
.PP
The last two pieces of the specification are present so the query
optimizer can estimate result sizes.  If a clause of the form:
.nf
B
Bruce Momjian 已提交
185 186

.ce 1
187
MYBOXES.description <<< "0,0,1,1"::box
B
Bruce Momjian 已提交
188

189 190 191 192 193 194 195 196
.fi
is present in the qualification, then Postgres may have to estimate the
fraction of the instances in MYBOXES that satisfy the clause.  The
function res_proc must be a registered function (meaning it is already
defined using
.IR "define function" (l))
which accepts one argument of the correct data type and returns a
floating point number.  The query optimizer simply calls this
B
Bruce Momjian 已提交
197
function, passing the parameter "0,0,1,1"
198 199 200 201 202 203 204 205 206 207 208
and multiplies the result by the relation size to get the desired
expected number of instances.
.PP
Similarly, when the operands of the operator both contain instance
variables, the query optimizer must estimate the size of the resulting
join.  The function join_proc will return another floating point
number which will be multiplied by the cardinalities of the two
classes involved to compute the desired expected result size.
.PP
The difference between the function
.nf
B
Bruce Momjian 已提交
209 210

.ce 1
211
my_procedure_1 (MYBOXES.description, "0,0,1,1"::box)
B
Bruce Momjian 已提交
212

213 214 215
.fi
and the operator
.nf
B
Bruce Momjian 已提交
216 217

.ce 1
218
MYBOXES.description === "0,0,1,1"::box
B
Bruce Momjian 已提交
219

220 221 222 223 224 225 226 227 228 229 230 231 232 233 234 235 236 237 238 239 240 241 242 243 244
.fi
is that Postgres attempts to optimize operators and can decide to use an
index to restrict the search space when operators are involved.
However, there is no attempt to optimize functions, and they are
performed by brute force.  Moreover, functions can have any number of
arguments while operators are restricted to one or two.
.SH EXAMPLE
.nf
--
--The following command defines a new operator,
--area-equality, for the BOX data type.
--
create operator === (
	leftarg = box,
	rightarg = box,
	procedure = area_equal_procedure,
	commutator = ===,
	negator = !==,
	restrict = area_restriction_procedure,
	hashes,
	join = area-join-procedure,
	sort = <<<, <<<)
.\"	arg is (box, box)
.fi
.SH "SEE ALSO"
245 246
create_function(l),
drop_operator(l).
247 248 249 250 251 252 253 254
.SH BUGS
Operator names cannot be composed of alphabetic characters in 
Postgres.
.PP
If an operator is defined before its commuting operator has been defined
(a case specifically warned against above), a dummy operator with invalid
fields will be placed in the system catalogs.  This may interfere with
the definition of later operators.