Red Hat Database: SQL Guide and Reference
Prev	Chapter 4. Extending SQL	Next

Operators

An operator can be considered to be "syntactic sugar" for a call to an underlying function that does the real work; so you must first create the underlying function before you can create the operator. However, an operator is not merely syntactic sugar, because it carries additional information that helps the query planner optimize queries that use the operator. Much of this section will be devoted to explaining that additional information.

Example

Here is an example of creating an operator for adding two complex numbers. We assume we have already created the definition of type complex. First we need a function that does the work; then we can define the operator:
CREATE FUNCTION complex_add(complex, complex) RETURNS complex AS 'PGROOT/tutorial/complex.so' LANGUAGE 'c'; CREATE OPERATOR + ( leftarg = complex, rightarg = complex, procedure = complex_add, commutator = + );

Now we can do:
SELECT (a + b) AS c FROM test_complex; c ----------------- (5.2,6.05) (133.42,144.95)

We have shown how to create a binary operator here. To create unary operators, just omit one of leftarg (for left unary) or rightarg (for right unary). The procedure clause and the argument clauses are the only required items in CREATE OPERATOR. The COMMUTATOR clause shown in the example is an optional hint to the query optimizer. Further details about COMMUTATOR and other optimizer hints follow.

Operator Optimization Information

A PostgreSQL operator definition can include several optional clauses that tell the system useful things about how the operator behaves. These clauses should be provided whenever appropriate, because they can considerably speed up the execution of queries that use the operator. However, if you provide them, you must be sure that they are right! Incorrect use of an optimization clause can result in backend crashes, subtly wrong output, or other problems. You can always leave out an optimization clause if you are not sure about it; the only consequence is that queries might run slower than they need to.

Additional optimization clauses might be added in future versions of PostgreSQL. The ones described here are all the ones that release 7.2 understands.

eqsel for =

neqsel for <>

scalarltsel for < or <=

scalargtsel for > or >=

It might seem a little odd that these are the categories, but they make sense if you think about it. "=" will typically accept only a small fraction of the rows in a table; "<>" will typically reject only a small fraction. "<" will accept a fraction that depends on where the given constant falls in the range of values for that table column (which, it happens, is information collected by ANALYZE and made available to the selectivity estimator). "<=" will accept a slightly larger fraction than "<" for the same comparison constant, but they are close enough to not be worth distinguishing, especially since we are not likely to do better than a rough guess anyhow. Similar remarks apply to ">" and ">=".

You can frequently get away with using either eqsel or neqsel for operators that have very high or very low selectivity, even if they are not really equality or inequality. For example, the approximate-equality geometric operators use eqsel on the assumption that they will usually only match a small fraction of the entries in a table.

You can use scalarltsel and scalargtsel for comparisons on data types that have some sensible means of being converted into numeric scalars for range comparisons. If possible, add the data type to those understood by the routine convert_to_scalar() in src/backend/utils/adt/selfuncs.c. (Eventually, this routine should be replaced by per-datatype functions identified through a column of the pg_type table; but that has not happened yet.) If you do not do this, things will still work, but the optimizer's estimates will not be as good as they could be.

There are additional selectivity functions designed for geometric operators in src/backend/utils/adt/geo_selfuncs.c: areasel, positionsel, and contsel. At this writing these are just stubs, but you may want to use them anyway.