FAQ 33.4 KB
Newer Older
B
Bruce Momjian 已提交
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31

                Frequently Asked Questions (FAQ) for PostgreSQL
                                       
   Last updated: Sat Feb 28 10:04:28 EST 1998
   
   Current maintainer: Bruce Momjian (maillist@candle.pha.pa.us)
   
   The most recent version of this document can be viewed at the
   postgreSQL Web site, http://postgreSQL.org.
   
   Linux-specific questions are answered in
   http://postgreSQL.org/docs/faq-linux.shtml.
   
   Irix-specific questions are answered in
   http://postgreSQL.org/docs/faq-irix.shtml.
     _________________________________________________________________
   
Questions answered:

  1) General questions
  
   1.1) What is PostgreSQL?
   1.2) What does PostgreSQL run on?
   1.3) Where can I get PostgreSQL?
   1.4) What's the copyright on PostgreSQL?
   1.5) Support for PostgreSQL
   1.6) Latest release of PostgreSQL
   1.7) Is there a commercial version of PostgreSQL?
   1.8) What documentation is available for PostgreSQL?
   1.9) What version of SQL does PostgreSQL use?
   1.10) Does PostgreSQL work with databases from earlier versions of
B
Bruce Momjian 已提交
32
   PostgreSQL?
B
Bruce Momjian 已提交
33
   1.11) Are there ODBC drivers for PostgreSQL?
B
Bruce Momjian 已提交
34
   1.12) What tools are available for hooking PostgreSQL to Web pages?
B
Bruce Momjian 已提交
35 36 37
   1.13) Does PostgreSQL have a graphical user interface? A report
   generator? A embedded query language interface?
   1.14) What is a good book to learn SQL?
B
Bruce Momjian 已提交
38
   1.15) What languages are available to communicate with PostgreSQL?
B
Bruce Momjian 已提交
39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88
   
  2) Installation/Configuration questions
  
   2.1) initdb doesn't run
   2.2) when I start up the postmaster, I get "FindBackend: could not
   find a backend to execute..." "postmaster: could not find backend to
   execute..."
   2.3) The system seems to be confused about commas, decimal points, and
   date formats.
   2.4) How do I install PostgreSQL somewhere other than
   /usr/local/pgsql?
   2.5) When I run postmaster, I get a Bad System Call core dumped
   message.
   2.6) When I try to start the postmaster, I get IpcMemoryCreate errors.
   2.7) I have changed a source file, but a recompile does not see the
   change?
   2.8) How do I prevent other hosts from accessing my PostgreSQL
   2.9) I can't access the database as the 'root' user.
   2.10) All my servers crash under concurrent table access. Why?
   2.11) How do I tune the database engine for better performance?
   2.12) What debugging features are available in PostgreSQL?
   2.13) How do I enable more than 32 concurrent backends?
   
  3) Operational questions
  
   3.1) Does PostgreSQL support nested subqueries?
   3.2) I've having a lot of problems using rules.
   3.3) I can't seem to write into the middle of large objects reliably.
   3.4) How can I write client applications to PostgreSQL?
   3.5) How do I set up a pg_group?
   3.6) What is the exact difference between binary cursors and normal
   cursors?
   3.7) What is a R-tree index and what is it used for?
   3.8) What is the maximum size for a tuple?
   3.9) I defined indices but my queries don't seem to make use of them.
   Why?
   3.10) How do I do regular expression searches? case-insensitive regexp
   searching?
   3.11) I experienced a server crash during a vacuum. How do I remove
   the lock file?
   3.12) What is the difference between the various character types?
   3.13) In a query, how do I detect if a field is NULL?
   3.14) How do I see how the query optimizer is evaluating my query?
   3.15) How do I create a serial field?
   3.16) What are the pg_psort.XXX files in my database directory?
   3.17) Why can't I connect to my database from another machine?
   3.18) How do I find out what indexes or operations are defined in the
   database?
   3.19) What is the time-warp feature and how does it relate to vacuum?
   3.20) What is an oid? What is a tid?
B
Bruce Momjian 已提交
89
   3.21) What is the meaning of some of the terms used in PostgreSQL?
B
Bruce Momjian 已提交
90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 117 118 119 120 121 122 123 124 125 126 127 128 129 130 131 132 133 134 135 136 137 138 139 140 141 142 143 144 145 146 147 148 149 150 151 152 153 154 155 156 157 158 159 160 161 162 163 164 165 166 167 168 169 170 171 172 173 174 175 176 177 178 179 180 181 182 183 184 185 186 187 188 189 190 191 192 193 194 195 196 197 198 199 200 201 202 203 204 205 206 207 208 209 210 211 212
   3.22) What is Genetic Query Optimization?
   3.23) How do you remove a column from a table?
   3.24) How do SELECT only the first few rows of a query?
   3.25) Why can't I create a column named "time"?
   3.26) How much database disk space is required to store data from a
   typical flat file?
   
  4) Questions about extending PostgreSQL
  
   4.1) I wrote a user-defined function and when I run it in psql, it
   dumps core.
   4.2) I get messages of the type NOTICE:PortalHeapMemoryFree:
   0x402251d0
   4.3) I've written some nifty new types and functions for PostgreSQL.
   4.4) How do I write a C function to return a tuple?
   
  5) Bugs
  
   5.1) How do I make a bug report?
     _________________________________________________________________
   
Section 1: General Questions

  1.1) What is PostgreSQL?
  
   PostgreSQL is an enhancement of the POSTGRES database management
   system, a next-generation DBMS research prototype. While PostgreSQL
   retains the powerful data model and rich data types of POSTGRES, it
   replaces the PostQuel query language with an extended subset of SQL.
   PostgreSQL is free and the complete source is available.
   
   PostgreSQL development is being performed by a team of Internet
   developers who all subscribe to the PostgreSQL development mailing
   list. The current coordinator is Marc G. Fournier
   (scrappy@postgreSQL.org). (See below on how to join). This team is now
   responsible for all current and future development of PostgreSQL.
   
   The authors of PostgreSQL 1.01 were Andrew Yu and Jolly Chen. Many
   others have contributed to the porting, testing, debugging and
   enhancement of the code. The original Postgres code, from which
   PostgreSQL is derived, was the effort of many graduate students,
   undergraduate students, and staff programmers working under the
   direction of Professor Michael Stonebraker at the University of
   California, Berkeley.
   
   The original name of the software at Berkeley was Postgres. When SQL
   functionality was added in 1995, its name was changed to Postgres95.
   The name was changed at the end of 1996 to PostgreSQL.
   
  1.2) What does PostgreSQL run on?
  
   The authors have compiled and tested PostgreSQL on the following
   platforms(some of these compiles require gcc 2.7.0):
     * aix - IBM on AIX 3.2.5 or 4.x
     * alpha - DEC Alpha AXP on Digital Unix 2.0, 3.2, 4.0
     * BSD44_derived - OSs derived from 4.4-lite BSD (NetBSD, FreeBSD)
     * bsdi - BSD/OS 2.0, 2.01, 2.1, 3.0
     * dgux - DG/UX 5.4R4.11
     * hpux - HP PA-RISC on HP-UX 9.0, 10
     * i386_solaris - i386 Solaris
     * irix5 - SGI MIPS on IRIX 5.3
     * linux - Intel x86 on Linux 2.0 and Linux ELF SPARC on Linux ELF
       PPC on Linux Elf (For non-ELF Linux, see LINUX_ELF below).
     * sco - SCO 3.2v5
     * sparc_solaris - SUN SPARC on Solaris 2.4, 2.5, 2.5.1
     * sunos4 - SUN SPARC on SunOS 4.1.3
     * svr4 - Intel x86 on Intel SVR4 and MIPS
     * ultrix4 - DEC MIPS on Ultrix 4.4
       
   The following platforms have known problems/bugs:
     * nextstep - Motorola MC68K or Intel x86 on NeXTSTEP 3.2
       
  1.3) Where can I get PostgreSQL?
  
   The primary anonymous ftp site for PostgreSQL is:
     * ftp://ftp.postgreSQL.org/pub
       
   A mirror site exists at:
     * ftp://postgres95.vnet.net/pub/postgres95
     * ftp://ftp.luga.or.at/pub/postgres95
     * ftp://cal011111.student.utwente.nl/pub/postgres95
     * ftp://ftp.uni-trier.de/pub/database/rdbms/postgres/postgres95
     * ftp://rocker.sch.bme.hu
       
  1.4) What's the copyright on PostgreSQL?
  
   PostgreSQL is subject to the following COPYRIGHT.
   
   PostgreSQL Data Base Management System
   
   Copyright (c) 1994-6 Regents of the University of California
   
   Permission to use, copy, modify, and distribute this software and its
   documentation for any purpose, without fee, and without a written
   agreement is hereby granted, provided that the above copyright notice
   and this paragraph and the following two paragraphs appear in all
   copies.
   
   IN NO EVENT SHALL THE UNIVERSITY OF CALIFORNIA BE LIABLE TO ANY PARTY
   FOR DIRECT, INDIRECT, SPECIAL, INCIDENTAL, OR CONSEQUENTIAL DAMAGES,
   INCLUDING LOST PROFITS, ARISING OUT OF THE USE OF THIS SOFTWARE AND
   ITS DOCUMENTATION, EVEN IF THE UNIVERSITY OF CALIFORNIA HAS BEEN
   ADVISED OF THE POSSIBILITY OF SUCH DAMAGE.
   
   THE UNIVERSITY OF CALIFORNIA SPECIFICALLY DISCLAIMS ANY WARRANTIES,
   INCLUDING, BUT NOT LIMITED TO, THE IMPLIED WARRANTIES OF
   MERCHANTABILITY AND FITNESS FOR A PARTICULAR PURPOSE. THE SOFTWARE
   PROVIDED HEREUNDER IS ON AN "AS IS" BASIS, AND THE UNIVERSITY OF
   CALIFORNIA HAS NO OBLIGATIONS TO PROVIDE MAINTENANCE, SUPPORT,
   UPDATES, ENHANCEMENTS, OR MODIFICATIONS.
   
  1.5) Support for PostgreSQL
  
   There is no official support for PostgreSQL from the original
   maintainers or from University of California, Berkeley. It is
   maintained through volunteer effort only.
   
   The main mailing list is: questions@postgreSQL.org. It is available
   for discussion o f matters pertaining to PostgreSQL, including but not
   limited to bug reports and fixes. For info on how to subscribe, send a
   mail with the lines in the body (not the subject line)


B
Bruce Momjian 已提交
213 214
        subscribe
        end
B
Bruce Momjian 已提交
215 216 217 218 219 220 221

   to questions-request@postgreSQL.org.
   
   There is also a digest list available. To subscribe to this list, send
   email to: questions-digest-request@postgreSQL.org with a BODY of:


B
Bruce Momjian 已提交
222 223
        subscribe
        end
B
Bruce Momjian 已提交
224 225 226 227 228 229 230 231 232 233 234 235

   Digests are sent out to members of this list whenever the main list
   has received around 30k of messages.
   
   There is a bugs mailing list available. To subscribe to this list,
   send email to bugs-request@postgreSQL.org with a BODY of:
   
   There is also a developers discussion mailing list available. To
   subscribe to this list, send email to hackers-request@postgreSQL.org
   with a BODY of:
   

B
Bruce Momjian 已提交
236 237
        subscribe
        end
B
Bruce Momjian 已提交
238 239 240 241 242 243 244 245 246 247 248 249 250 251 252 253 254 255 256 257 258 259 260 261 262 263 264 265 266

   Additional information about PostgreSQL can be found via the
   PostgreSQL WWW home page at:
   
     http://postgreSQL.org
     
  1.6) Latest release of PostgreSQL
  
   The latest release of PostgreSQL is version 6.3, which was released on
   March 1st, 1998.
   
   We plan to have major releases every three months.
   
  1.7) Is there a commercial version of PostgreSQL?
  
   Illustra Information Technology (a wholly owned subsidiary of Informix
   Software, Inc.) sells an object-relational DBMS called Illustra that
   was originally based on postgres. Illustra has cosmetic similarities
   to PostgreSQL but has more features, is more robust, performs better,
   and offers real documentation and support. On the flip side, it costs
   money. For more information, contact sales@illustra.com
   
  1.8) What documentation is available for PostgreSQL?
  
   A user manual, manual pages, and some small test examples are included
   in the distribution. The sql and built-in manual pages are
   particularly important.
   
   The www page contains pointers to an implementation guide and five
B
Bruce Momjian 已提交
267
   papers written about Postgres design concepts and features.
B
Bruce Momjian 已提交
268 269 270 271 272 273 274 275 276 277 278
   
  1.9) What version of SQL does PostgreSQL use?
  
   PostgreSQL supports a subset of SQL-92. It has most of the important
   constructs but lacks some of the functionality. The most visible
   differences are:
     * no HAVING clause under a GROUP BY
       
   On the other hand, you get to create user-defined types, functions,
   inheritance etc.
   
B
Bruce Momjian 已提交
279 280
  1.10) Does PostgreSQL work with databases from earlier versions of
  PostgreSQL?
B
Bruce Momjian 已提交
281 282 283 284 285 286 287 288 289 290 291 292 293 294 295 296 297 298 299 300 301 302 303 304 305 306 307 308 309 310
  
   PostgreSQL v1.09 is compatible with databases created with v1.01.
   
   Upgrading to 6.3 from earlier releases requires a dump and restore.
   
   Upgrading to 6.2.1 from pre-6.2 requires a dump and restore.
   
   Upgrading to 6.2.1 from 6.2 does not require a dump, but see the
   appropriate /migration file in the distribution.
   
   Those ugrading from versions earlier than 1.09 must upgrade to 1.09
   first without a dump/reload, then dump the data from 1.09, and then
   load it into 6.2.1 or 6.3 beta.
   
  1.11) Are there ODBC drivers for PostgreSQL?
  
   There are two ODBC drivers available, PostODBC and OpenLink ODBC.
   
   PostODBC is included in the distribution. For all people being
   interested in PostODBC, there are now two mailing lists devoted to the
   discussion of PostODBC. The mailing lists are:
     * postodbc-users@listserv.direct. net
     * postodbc-developers@listse rv.direct.net
       
   these lists are ordinary majordomo mailing lists. You can subscribe by
   sending a mail to:
     * majordomo@listserv.direct.net
       
   OpenLink ODBC is very popular. You can get it from
   http://www.openlinksw.com/postgres.html. It works with our standard
B
Bruce Momjian 已提交
311
   ODBC client software so you'll have PostgreSQL ODBC available on every
B
Bruce Momjian 已提交
312 313 314 315 316 317
   client platform we support (Win, Mac, Unix, VMS).
   
   We will probably be selling this product to people who need
   commercial-quality support, but a freeware version will always be
   available. Questions to postgres95@openlink.co.uk.
   
B
Bruce Momjian 已提交
318
  1.12) What tools are available for hooking PostgreSQL to Web pages?
B
Bruce Momjian 已提交
319 320 321 322 323 324 325 326 327 328 329 330 331 332 333 334 335 336 337 338 339 340 341 342 343 344 345
  
   A nice introduction to Database-backed Web pages can be seen at:
   http://www.webtools.com
   
   For web integration, PHP is an excellent interface. The URL for that
   is http://www.php.net
   
   PHP is great for simple stuff, but for more complex stuff, some still
   use the perl interface and CGI.pm.
   
   An WWW gatway based on WDB using perl can be downloaded from:
     * http://www.eol.ists.ca/~dunlop/wdb -p95
       
  1.13) Does PostgreSQL have a graphical user interface? A report generator? A
  embedded query language interface?
  
   We have a nice graphical user interface called pgaccess, which is
   shipped as part of the distribtion. Pgaccess also has a report
   generator.
   
   We also have ecpg, which is an embedded SQL query language interface
   for C. This is also included.
   
  1.14) What is a good book to learn SQL?
  
   Many of our users like The Practical SQL Handbook, Bowman et al,
   Addison Wesley.
B
Bruce Momjian 已提交
346 347 348 349 350 351 352 353 354 355 356 357 358
   
  1.15) What languages are available to communicate with PostgreSQL?
  
   We have:
     * C(interfaces/libpq)
     * C++(interfaces/libpq++)
     * Embedded C(interfaces/ecpg)
     * Java(interfaces/jdbc)
     * Perl(interfaces/perl5)
     * ODBC(interfaces/odbc)
     * Python(interfaces/python)
     * TCL(interfaces/libpgtcl)
     * A crude C/4GL(contrib/pginterface)
B
Bruce Momjian 已提交
359 360 361 362 363 364 365 366 367 368 369 370 371 372 373 374 375 376 377 378 379 380 381 382 383 384 385 386 387 388 389 390 391 392 393 394 395 396 397 398 399 400 401 402 403 404 405 406 407 408 409 410 411 412 413 414 415 416 417 418 419 420 421 422 423 424 425 426 427 428 429 430 431 432 433 434 435 436 437 438 439 440 441 442 443 444 445 446 447 448 449 450 451 452 453 454 455
     _________________________________________________________________
   
Section 2: Installation Questions

  2.1) initdb doesn't run
  
     * check to see that you have the proper paths set
     * check that the 'postgres' user owns all the right files
     * ensure that there are files in $PGDATA/files, and that they are
       non-empty. If they aren't, then "gmake install" failed for some
       reason
       
  2.2) when I start up the postmaster, I get "FindBackend: could not find a
  backend to execute..." "postmaster: could not find backend to execute..."
  
   You probably do not have the right path set up. The 'postgres'
   executable needs to be in your path.
   
  2.3) The system seems to be confused about commas, decimal points, and date
  formats.
  
   Check your locale configuration. PostgreSQL uses the locale settings
   of the user that ran the postmaster process. Set those accordingly for
   your operating environment.
   
  2.4) How do I install PostgreSQL somewhere other than /usr/local/pgsql?
  
   You need to edit Makefile.global and change POSTGRESDIR accordingly,
   or create a Makefile.custom and define POSTGRESDIR there.
   
  2.5) When I run postmaster, I get a Bad System Call core dumped message.
  
   It could be a variety of problems, but first check to see that you
   have system V extensions installed on your kernel. PostgreSQL requires
   kernel support for shared memory.
   
  2.6) When I try to start the postmaster, I get IpcMemoryCreate errors.
  
   You either do not have shared memory configured properly in kernel or
   you need to enlarge the shared memory available in the kernel. The
   exact amount you need depends on your architecture and how many
   buffers you configure postmaster to run with. For most systems, with
   default buffer sizes, you need a minimum of ~760K.
   
  2.7) I have changed a source file, but a recompile does not see the change?
  
   The Makefiles do not have the proper dependencies for include files.
   You have to do a 'make clean' and then another 'make'.
   
  2.8) How do I prevent other hosts from accessing my PostgreSQL backend?
  
   By default, PostgreSQL only allows connections from the local machine
   using unix domain. You must add the -i flag to the postmaster, and
   enable host-based authentication by modifying the file $PGDATA/pg_hba
   accordingly.
   
  2.9) I can't access the database as the 'root' user.
  
   You should not create database users with user id 0(root). They will
   be unable to access the database. This is a security precaution
   because of the ability of any user to dynamically link object modules
   into the database engine.
   
  2.10) All my servers crash under concurrent table access. Why?
  
   This problem can be caused by a kernel that is not configured to
   support semaphores.
   
  2.11) How do I tune the database engine for better performance?
  
   There are two things that can be done. You can disable fsync() by
   starting the postmaster with a '-o -F' option. This will prevent
   fsync()'s from flushing to disk after every transaction.
   
   You can also use the postmaster -B option to increase the number of
   shared memory buffers shared among the backend processes. If you make
   this parameter too high, the process will not start or crash
   unexpectedly. Each buffer is 8K and the defualt is 64 buffers.
   
   You can also use the postgres -S option to increase the maximum amount
   of memory used by each backend process for temporary sorts. Each
   buffer is 1K and the defualt is 512 buffers.
   
  2.12) What debugging features are available in PostgreSQL?
  
   PostgreSQL has several features that report status information that
   can be valuable for debugging purposes.
   
   First, by running configure with the -enable-cassert option, many
   assert()'s monitor the progress of the backend and halt the program
   when something unexpected occurs.
   
   Both postmaster and postgres have several debug options available.
   First, whenever you start the postmaster, make sure you send the
   standard output and error to a log file, like:


B
Bruce Momjian 已提交
456
        cd /usr/local/pgsql
B
Bruce Momjian 已提交
457 458 459 460 461 462 463 464 465 466 467 468 469 470 471 472 473 474 475 476 477 478 479 480 481 482 483 484 485 486 487 488 489 490 491 492 493 494 495 496 497 498 499 500 501 502 503 504 505 506 507 508 509 510 511 512 513 514 515 516 517 518 519 520 521 522 523 524 525 526
        ./bin/postmaster >server.log 2>&1 &

   This will put a server.log file in the top-level PostgreSQL directory.
   This file can contain useful information about problems or errors
   encountered by the server. Postmaster has a -d option that allows even
   more detailed information to be reported. The -d option takes a number
   1-3 that specifies the debug level. Be warned that a debug level of 3
   generates large log files.
   
   You can actuall run the postgres backend from the command line, and
   type your SQL statement directly. This is recommended ONLY for
   debugging purposes. Note that a newline terminates the query, not a
   semicolon. If you have compiled with debugging symbols, you can use a
   debugger to see what is happening. Because the backend was not started
   from the postmaster, it is not running in an identical environment and
   locking/backend interaction problems may not be duplicated. Some
   operating system can attach to a running backend directly to diagnose
   problems.
   
   The postgres program has a -s, -A, -t options that can be very usefull
   for debugging and performance measurements.
   
   You can also compile with profiling to see what functions are taking
   execution time. The backend profile files will be deposited in the
   pgsql/data/base/dbname directory. The client profile file will be put
   in the current directory.
   
   The EXPLAIN command (see this FAQ) allows you to see how PostgreSQL is
   interpreting your query.
   
  2.13) How do I enable more than 32 concurrent backends?
  
   Edit include/storage/sinvaladt.h, and change the value of
   MaxBackendId. In the future, we plan to make this a configurable
   prameter.
     _________________________________________________________________
   
Section 3: PostgreSQL Features

  3.1) Does PostgreSQL support nested subqueries?
  
   Yes.
   
  3.2) I've having a lot of problems using rules.
  
   Currently, the rule system in PostgreSQL has some limitations. It
   works enough to support the view mechanism, but does not handle
   Insert/Update/Delete well.
   
  3.3) I can't seem to write into the middle of large objects reliably.
  
   The Inversion large object system now works perfectly. You should no
   longer have problems with large objects.
   
  3.4) How can I write client applications to PostgreSQL?
  
   PostgreSQL supports a C-callable library interface called libpq as
   well as many others. See the /src/interfaces directory.
   
   Others have contributed a perl interface and a WWW gateway to
   PostgreSQL. See the PostgreSQL home pages for more details.
   
  3.5) How do I set up a pg_group?
  
   Currently, there is no easy interface to set up user groups. You have
   to explicitly insert/update the pg_group table. For example:


        jolly=> insert into pg_group (groname, grosysid, grolist)
        jolly=>     values ('posthackers', '1234', '{5443, 8261}');
B
Bruce Momjian 已提交
527
        INSERT 548224
B
Bruce Momjian 已提交
528
        jolly=> grant insert on foo to group posthackers;
B
Bruce Momjian 已提交
529
        CHANGE
B
Bruce Momjian 已提交
530 531 532 533 534 535 536 537 538 539 540 541 542 543 544 545 546 547 548 549 550 551 552 553 554 555 556 557 558 559 560 561 562 563 564 565 566 567 568 569 570 571 572 573 574 575 576 577 578 579 580 581 582 583 584 585 586 587 588 589 590 591 592 593
        jolly=>

   The fields in pg_group are:
     * groname: the group name. This a char16 and should be purely
       alphanumeric. Do not include underscores or other punctuation.
     * grosysid: the group id. This is an int4. This should be unique for
       each group.
     * grolist: the list of pg_user id's that belong in the group. This
       is an int4[].
       
  3.6) What is the exact difference between binary cursors and normal cursors?
  
   See the declare manual page for a description.
   
  3.7) What is a R-tree index and what is it used for?
  
   An r-tree index is used for indexing spatial data. A hash index can't
   handle range searches. A B-tree index only handles range searches in a
   single dimension. R-tree's can handle multi-dimensional data. For
   example, if a R-tree index can be built on an attribute of type
   'point', the system can more efficient answer queries like select all
   points within a bounding rectangle.
   
   The canonical paper that describes the original R-Tree design is:
   
   Guttman, A. "R-Trees: A Dynamic Index Structure for Spatial
   Searching." Proc of the 1984 ACM SIGMOD Int'l Conf on Mgmt of Data,
   45-57.
   
   You can also find this paper in Stonebraker's "Readings in Database
   Systems"
   
   Builtin R-Trees can handle polygons and boxes. In theory, R-trees can
   be extended to handle higher number of dimensions. In practice,
   extending R-trees require a bit of work and we don't currently have
   any documentation on how to do it.
   
  3.8) What is the maximum size for a tuple?
  
   Tuples are limited to 8K bytes. Taking into account system attributes
   and other overhead, one should stay well shy of 8,000 bytes to be on
   the safe side. To use attributes larger than 8K, try using the large
   objects interface.
   
   Tuples do not cross 8k boundaries so a 5k tuple will require 8k of
   storage.
   
  3.9) I defined indices but my queries don't seem to make use of them. Why?
  
   PostgreSQL does not automatically maintain statistics. One has to make
   an explicit 'vacuum' call to update the statistics. After statistics
   are updated, the optimizer has a better shot at using indices. Note
   that the optimizer is limited and does not use indices in some
   circumstances (such as OR clauses). For column-specific optimization
   statistics, use 'vacuum analyze'.
   
   If the system still does not see the index, it is probably because you
   have created an index on a field with the improper *_ops type. For
   example, you have created a CHAR(4) field, but have specified a
   char_ops index type_class.
   
   See the create_index manual page for information on what type classes
   are available. It must match the field type.
   
B
Bruce Momjian 已提交
594
   PostgreSQL does not warn the user when the improper index is created.
B
Bruce Momjian 已提交
595 596 597 598 599 600 601 602 603 604 605 606 607 608 609
   
   Indexes not used for ORDER BY operations.
   
  3.10) How do I do regular expression searches? case-insensitive regexp
  searching?
  
   See the pgbuiltin manual page. Search for regular expression.
   
  3.11) I experienced a server crash during a vacuum. How do I remove the lock
  file?
  
   See the vacuum manual page.
   
  3.12) What is the difference between the various character types?
  
B
Bruce Momjian 已提交
610
Type            Internal Name   Notes
M
Marc G. Fournier 已提交
611
--------------------------------------------------
B
Bruce Momjian 已提交
612 613 614 615 616 617 618 619 620
CHAR            char            1 character   }
CHAR2           char2           2 characters  }
CHAR4           char4           4 characters  } optimized for a fixed length
CHAR8           char8           8 characters  }
CHAR16          char16          16 characters }
CHAR(#)         bpchar          blank padded to the specified fixed length
VARCHAR(#)      varchar         size specifies maximum length, no padding
TEXT            text            length limited only by maximum tuple length
BYTEA           bytea           variable-length array of bytes
B
Bruce Momjian 已提交
621 622 623 624 625 626 627 628 629 630 631 632 633 634 635 636 637 638 639 640 641

   You need to use the internal name when doing internal operations.
   
   The last four types above are "varlena" types (i.e. the first four
   bytes are the length, followed by the data). CHAR(#) allocates the
   maximum number of bytes no matter how much data is stored in the
   field. TEXT, VARCHAR(#), and BYTEA all have variable length on the
   disk, and because of this, there is a small performance penalty for
   using them. Specifically, the penalty is for access to any columns
   after the first column of this type.
   
  3.13) In a query, how do I detect if a field is NULL?
  
   You test the column with IS NULL and IS NOT NULL.
   
  3.14) How do I see how the query optimizer is evaluating my query?
  
   See the explain manual page.
   
  3.15) How do I create a serial field?
  
B
Bruce Momjian 已提交
642
   PostgreSQL does not allow the user to specifiy a user column as type
B
Bruce Momjian 已提交
643 644 645 646 647 648 649 650 651
   SERIAL. Instead, you can use each row's oid field as a unique value.
   However, if you need to dump and reload the database, you need to use
   pgdump's -o option or COPY's WITH OIDS option to preserver the oids.
   
   We also have a SEQUENCE function that is similar to SERIAL. See the
   create_sequence manual page.
   
   Another valid way of doing this is to create a function:

B
Bruce Momjian 已提交
652 653 654 655 656
        create table my_oids (f1 int4);
        insert into my_oids values (1);
        create function new_oid () returns int4 as
                'update my_oids set f1 = f1 + 1;  select f1 from my_oids; '
        language 'sql';
B
Bruce Momjian 已提交
657 658 659

   then:

B
Bruce Momjian 已提交
660 661
        create table my_stuff (my_key int4, value text);
        insert into my_stuff values (new_oid(), 'hello');
B
Bruce Momjian 已提交
662 663 664 665 666 667 668 669 670 671 672 673 674 675 676 677 678 679 680 681 682 683 684 685 686 687 688 689 690 691 692 693 694 695 696 697 698 699 700 701 702 703 704

   However, keep in mind there is a race condition here where one server
   could do the update, then another one do an update, and they both
   could select the same new id. This statement should be performed
   within a transaction.
   
   Yet another way is to use general trigger function autoinc() from
   contrib/spi/autoinc.c.
   
  3.16) What are the pg_psort.XXX files in my database directory?
  
   They are temporary sort files generated by the query executor. For
   example, if a sort needs to be done to satisfy an ORDER BY, some temp
   files are generated as a result of the sort.
   
   If you have no transactions or sorts running at the time, it is safe
   to delete the pg_psort.XXX files.
   
  3.17) Why can't I connect to my database from another machine?
  
   The default configuration allows only unix domain socket connections
   from the local machine. To enable TCP/IP connections, use the
   postmaster -i option You need to add a host entry to the file
   pgsql/data/pg_hba. See the hba_conf manual page.
   
  3.18) How do I find out what indexes or operations are defined in the
  database?
  
   psql has a variety of backslash commands to show such information. Use
   \? to see them.
   
   Also try the file pgsql/src/tutorial/syscat.source. It illustrates
   many of the 'select's needed to get information out of the database
   system tables.
   
  3.19) What is the time-warp feature and how does it relate to vacuum?
  
   PostgreSQL no longer supports this feature. All support code has been
   removed. This was done to improve performance and reduce disk storage
   overhead.
   
  3.20) What is an oid? What is a tid?
  
B
Bruce Momjian 已提交
705 706 707 708 709 710 711
   Oids are PostgreSQL's answer to unique row ids or serial columns.
   Every row that is created in PostgreSQL gets a unique oid. All oids
   generated by initdb are less than 16384 (from
   backend/access/transam.h). All post-initdb (user-created) oids are
   equal or greater that this. All these oids are unique not only within
   a table, or database, but unique within the entire PostgreSQL
   installation.
B
Bruce Momjian 已提交
712
   
B
Bruce Momjian 已提交
713
   PostgreSQL uses oids in its internal system tables to link rows in
B
Bruce Momjian 已提交
714 715 716 717 718 719 720 721 722 723
   separate tables. These oids can be used to identify specific user rows
   and used in joins. It is recommended you use column type oid to store
   oid values. See the sql(l) manual page to see the other internal
   columns. You can create an index on the oid field for faster access.
   
   Tids are used to indentify specific physical rows with block and
   offset values. Tids change after rows are modified or reloaded. They
   are used by index entries to point to physical rows. They can not be
   accessed through sql.
   
B
Bruce Momjian 已提交
724
  3.21) What is the meaning of some of the terms used in PostgreSQL?
B
Bruce Momjian 已提交
725 726 727 728 729 730 731 732 733 734 735 736 737 738 739 740 741 742 743 744 745 746 747 748 749 750 751 752
  
   Some of the source code and older documentation use terms that have
   more common usage. Here are some:
     * row, record, tuple
     * attribute, field, column
     * table, class
     * retrieve, select
     * replace, update
     * append, insert
     * oid, serial value
     * portal, cursor
     * range variable, table name, table alias
       
   Please let me know if you think of any more.
   
  3.22) What is Genetic Query Optimization?
  
   The GEQO module in PostgreSQL is intended to solve the query
   optimization problem of joining many tables by means of a Genetic
   Algorithm (GA). It allows the handling of large join queries through
   non-exhaustive search.
   
   For further information see README.GEQO <utesch@aut.tu-freiberg.de>.
   
  3.23) How do you remove a column from a table?
  
   We do not support ALTER TABLE DROP COLUMN, but do this:

753
        SELECT ...  -- select all columns but the one you want to remove
B
Bruce Momjian 已提交
754 755 756 757
        INTO TABLE new_table
        FROM old_table;
        DROP TABLE old_table;
        ALTER TABLE new_table RENAME TO old_table;
B
Bruce Momjian 已提交
758 759 760 761 762 763 764 765 766 767 768 769 770 771 772 773 774 775 776 777 778 779 780 781

  3.24) How do SELECT only the first few rows of a query?
  
   See the fetch manual page.
   
   This only prevents all row results from being transfered to the
   client. The entire query must be evaluated, even if you only want just
   first few rows. Consider a query that has and ORDER BY. There is no
   way to return any rows until the entire query is evaluated and sorted.
   
  3.25) Why can't I create a column named "time"?
  
   6.2.1 has added some new restricted keywords as we make PostgreSQL
   more ANSI-92 compilant. The next release will have this restriction
   removed. There is a patch on ftp.postgresql.org that will allow this
   feature now.
   
  3.26)How much database disk space is required to store data from a typical
  flat file?
  
   Consider a file with 300,000 lines with two integers on each line. The
   flat file is 2.4MB. The size of the PostgreSQL database file
   containing this data can be estimated:

B
Bruce Momjian 已提交
782 783 784 785 786 787 788 789 790 791 792 793 794 795 796 797 798
40 bytes + each row header (approximate)
 8 bytes + two int fields @ 4 bytes each
 4 bytes + pointer on page to tuple
-------- =
52 bytes per row

The data page size in PostgreSQL is 8192(8k) bytes, so:

8192 bytes per page
-------------------  =  157 rows per database page (rounded up)
 52 bytes per row

300000 data rows
-----------------  =   1911 database pages
157 rows per page

1911 database pages * 8192 bytes per page  =  15,654,912 or 15.5MB
B
Bruce Momjian 已提交
799

B
Bruce Momjian 已提交
800 801
Indexes do not contain as much overhead, but do contain the data that
is being indexed, so they can be large also.
B
Bruce Momjian 已提交
802 803 804 805 806 807 808 809 810 811 812 813 814 815 816 817 818 819 820 821 822 823 824 825 826 827 828 829 830 831 832 833 834 835
     _________________________________________________________________
   
Section 4: Extending PostgreSQL

  4.1) I wrote a user-defined function and when I run it in psql, it dumps
  core.
  
   The problem could be a number of things. Try testing your user-defined
   function in a stand alone test program first. Also, make sure you are
   not sending elog NOTICES when the front-end is expecting data, such as
   during a type_in() or type_out() functions
   
  4.2) I get messages of the type NOTICE:PortalHeapMemoryFree: 0x402251d0 not
  in alloc set!
  
   You are pfree'ing something that was not palloc'ed. When writing
   user-defined functions, do not include the file "libpq-fe.h". Doing so
   will cause your palloc to be a malloc instead of a free. Then, when
   the backend pfrees the storage, you get the notice message.
   
  4.3) I've written some nifty new types and functions for PostgreSQL.
  
   Please share them with other PostgreSQL users. Send your extensions to
   mailing list, and they will eventually end up in the contrib/
   subdirectory.
   
  4.4) How do I write a C function to return a tuple?
  
   This requires extreme wizardry, so extreme that the authors have not
   ever tried it, though in principle it can be done. The short answer is
   ... you can't. This capability is forthcoming in the future.
     _________________________________________________________________
   
Section 5: Bugs
B
Bruce Momjian 已提交
836

B
Bruce Momjian 已提交
837 838 839 840 841 842 843 844 845 846 847
  5.1) How do I make a bug report?
  
   Check the current FAQ at http://postgreSQL.org
   
   Also check out our ftp site ftp://ftp.postgreSQL.org/pub to see if
   there is a more recent PostgreSQL version.
   
   You can also fill out the "bug-template" file and send it to:
     * bugs@postgreSQL.org
       
   This is the address of the developers mailing list.