regress.sgml 13.8 KB
Newer Older
1
<Chapter Id="regress">
2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84
<Title>Regression Test</Title>

<Abstract>
<Para>
Regression test instructions and analysis.
</Para>
</Abstract>

<Para>
  The PostgreSQL regression tests are a comprehensive set of tests for the
  SQL implementation embedded in PostgreSQL developed by Jolly Chen and
  Andrew Yu. It tests standard SQL operations as well as the extended
  capabilities of PostgreSQL.
</Para>

<Para>
  These tests have recently been revised by Marc Fournier and Thomas Lockhart
and are now packaged as
  functional units which should make them easier to run and easier to interpret.
From <ProductName>PostgreSQL</ProductName> v6.1 onward
 the regression tests are current for every official release. 
</Para>

<Para>
  Some properly installed and fully functional PostgreSQL installations
  can fail some of these regression tests due to artifacts of floating point
  representation and time zone support. The current tests are evaluated
  using a simple "diff" algorithm, and are sensitive to small system
  differences. For apparently failed tests, examining the differences
  may reveal that the differences are not significant.
</Para>

<Para>
The regression testing notes below assume the following (except where noted):
<ItemizedList Mark="bullet" Spacing="compact">
<ListItem>
<Para>
Commands are Unix-compatible. See note below.
</Para>
</ListItem>
<ListItem>
<Para>
Defaults are used except where noted.
</Para>
</ListItem>
<ListItem>
<Para>
User postgres is the <ProductName>Postgres</ProductName> superuser.
</Para>
</ListItem>
<ListItem>
<Para>
The source path is /usr/src/pgsql (other paths are possible).
</Para>
</ListItem>
<ListItem>
<Para>
The runtime path is /usr/local/pgsql (other paths are possible).
</Para>
</ListItem>
</ItemizedList>
</Para>

<Sect1>
<Title>Regression Environment</Title>

<Para>
  The regression test is invoked by the <Command>make</Command> command which compiles
  a <Acronym>C</Acronym> program into a shared library
  in the current directory.  Localized shell scripts are also created in
  the current directory. The output file templates are massaged into the
  <FileName>./expected/*.out</FileName> files.  The localization replaces macros in the source
  files with absolute pathnames and user names.
</Para>

<Para>
  Normally, the regression test should be run as the pg_superuser since
  the 'src/test/regress' directory and sub-directories are owned by the
  pg_superuser. If you run the regression test as another user the
  'src/test/regress' directory tree should be writeable to that user.
</Para>

<Para>
85 86 87 88 89 90 91 92 93
  It was formerly necessary to run the postmaster with system time zone
  set to PST, but this is no longer required.  You can run the regression
  tests under your normal postmaster configuration.  The test script will
  set the PGTZ environment variable to ensure that timezone-dependent tests
  produce the expected results.  However, your system must provide
  library support for the PST8PDT time zone, or the timezone-dependent
  tests will fail.
  To verify that your machine does have this support, type
  the following:
94 95 96 97 98 99 100 101 102 103 104 105 106
<ProgramListing>
    setenv TZ PST8PDT
    date
</ProgramListing>
</Para>

<Para>
  The "date" command above should have returned the current system time
  in the PST8PDT time zone. If the PST8PDT database is not available, then
  your system may have returned the time in GMT. If the PST8PDT time zone
  is not available, you can set the time zone rules explicitly:
<ProgramListing>
    setenv PGTZ PST8PDT7,M04.01.0,M10.05.03
107 108 109 110 111 112 113 114 115 116 117 118 119 120 121 122 123
      </ProgramListing>
    </Para>
  </sect1>
  
  <Sect1>
    <Title>Directory Layout</Title>
    
    <Para>
      <Note>
	<Para>
	  This should become a table in the previous section.
	</Para>
      </Note>
    </Para>
    
    <Para>
      <ProgramListing>
124 125 126 127 128 129 130 131 132 133 134 135 136
  input/ .... .source files that are converted using 'make all' into
              some of the .sql files in the 'sql' subdirectory

  output/ ... .source files that are converted using 'make all' into
              .out files in the 'expected' subdirectory

  sql/ ...... .sql files used to perform the regression tests

  expected/ . .out files that represent what we *expect* the results to
              look like

  results/ .. .out files that represent what the results *actually* look
              like. Also used as temporary storage for table copy testing.
137 138 139 140 141 142 143 144 145 146 147 148 149 150 151 152 153 154
      </ProgramListing>
    </Para>
  </Sect1>
  
  <Sect1>
    <Title>Regression Test Procedure</Title>
    
    <Para>
      Commands were tested on RedHat Linux version 4.2 using the bash shell.
      Except where noted, they will probably work on most systems. Commands
      like <FileName>ps</FileName> and <FileName>tar</FileName> vary wildly on what options you should use on each
      platform. <Emphasis>Use common sense</Emphasis> before typing in these commands.
    </Para>
      
      <Para>
	For a fresh install or upgrading from previous releases of
	<ProductName>Postgres</ProductName>:
      </Para>
155 156 157 158
    
    <Procedure>
      <Title><ProductName>Postgres</ProductName> Regression Configuration</Title>

159 160 161 162 163 164 165 166 167 168 169 170 171 172 173 174 175 176 177 178 179 180 181 182 183 184 185 186 187 188 189 190 191
      <Step Performance="required">
	<Para>
	  The file /usr/src/pgsql/src/test/regress/README has detailed
	  instructions for running and interpreting the regression tests.
	  A short version follows here:
	</Para>
	
	<Para>
	  If the postmaster is not already running, start the postmaster on an
	  available window by typing
	  <ProgramListing>
	    postmaster
	  </ProgramListing>
	  
	  or start the postmaster daemon running in the background by typing
	  <ProgramListing>
	    cd
	    nohup postmaster > regress.log 2>&1 &
	  </ProgramListing>
	</Para>
	
	<Para>
	  Run postmaster from your <ProductName>Postgres</ProductName> super user account (typically
	  account postgres).
	  
	  <Note>
	    <Para>
	      Do not run <FileName>postmaster</FileName> from the root account.
	    </Para>
	  </Note>
	</Para>
      </Step>
      
192
      <Step Performance="optional">
193
	<Para>
194 195
	  If you have previously invoked the regression test, clean up the
	  working directory with:
196 197 198
	  
	  <ProgramListing>
	    cd /usr/src/pgsql/src/test/regress
199
	    gmake clean
200
	  </ProgramListing>
201
	</para>
202 203 204 205
	<Para>
	  You do not need to type "gmake clean" if this is the first time you
	  are running the tests.
	</Para>
206 207 208 209 210 211 212 213 214 215 216 217 218 219 220 221 222 223 224 225 226
      </step>

      
      <Step Performance="required">
	<Para>
	  Build the regression test. Type
	  <ProgramListing>
	    cd /usr/src/pgsql/src/test/regress
	    gmake all
	  </ProgramListing>
	</Para>
      </Step>
      
      <Step Performance="required">
	<Para>
	  Run the regression tests.  Type
	  <ProgramListing>
	    cd /usr/src/pgsql/src/test/regress
	    gmake runtest
	  </ProgramListing>
	</Para>
227 228 229 230 231 232 233 234 235 236 237 238 239 240 241 242 243 244 245 246
      </Step>
      
      <Step Performance="required">
	<Para>
	  
	  You should get on the screen (and also written to file ./regress.out)
	  a series of statements stating which tests passed and which tests
	  failed.  Please note that it can be normal for some of the tests to
	  "fail".  For the failed tests, use diff to compare the files in
	  directories ./results and ./expected.  If float8 failed, type
	  something like:
	  <ProgramListing>
	    cd /usr/src/pgsql/src/test/regress
	    diff -w expected/float8.out results
	  </ProgramListing>
	</Para>
      </Step>
      
      <Step Performance="required">
	<Para>
247
	  After running the tests and examining the results, type
248
	  <ProgramListing>
249
	    dropdb regression
250 251 252
	    cd /usr/src/pgsql/src/test/regress
	    gmake clean
	  </ProgramListing>
253
	  to recover the temporary disk space used by the tests.
254 255 256 257 258 259 260
	</Para>
      </Step>
    </procedure>
  </Sect1>
  
  <Sect1>
    <Title>Regression Analysis</Title>
261 262 263 264 265 266 267 268 269 270 271 272 273

     <Para>
       The results are in files in the ./results directory. These results
       can be compared with results in the ./expected directory using 'diff'.
       (The test script does this for you, and leaves the differences
       in ./regression.diffs.)
     </Para>

     <Para>
       The files might not compare exactly.  The test script will report
       any difference as a "failure", but the difference might be due
       to small cross-system differences in error message wording,
       math library behavior, etc.
274 275 276 277 278
      "Failures" of this type do not indicate a problem with
      <ProductName>Postgres</ProductName>.
    </Para>
    
    <Para>
279 280 281 282
      Thus, it is necessary to examine the actual differences for each
      "failed" test to determine whether there is really a problem.
      The following paragraphs attempt to provide some guidance in
      determining whether a difference is significant or not.
283 284 285 286 287 288 289 290 291 292 293 294 295 296 297 298 299 300 301 302 303 304 305 306 307 308 309 310 311 312 313 314 315 316 317 318 319 320 321
    </Para>
    
    <Sect2>
      <Title>Error message differences</Title>
      
      <Para>
	Some of the regression tests involve intentional invalid input values.
	Error messages can come from either the Postgres code or from the host
	platform system routines. In the latter case, the messages may vary
	between platforms, but should reflect similar information. These
	differences in messages will result in a "failed" regression test which
	can be validated by inspection.
      </Para>
      
    </Sect2>
    
    <Sect2>
      <Title>OID differences</Title>
      
      <Para>
	There are several places where PostgreSQL OID (object identifiers) appear
	in 'regress.out'. OID's are unique 32-bit integers which are generated
	by the PostgreSQL backend whenever a table row is inserted or updated.
	If you run the regression test on a non-virgin database or run it multiple
	times, the OID's reported will have different values. 
	
	The following SQL statements in 'misc.out' have shown this behavior:
	
	QUERY: SELECT user_relns() AS user_relns ORDER BY user_relns;
	
	The 'a,523676' row is composed from an OID.
      </Para>
      
    </Sect2>
    
    <Sect2>
      <Title>Date and time differences</Title>
      
      <Para>
322 323 324 325 326 327 328 329 330 331 332 333 334 335 336 337 338 339
  Most of the date and time results are dependent on timezone environment.
  The reference files are generated for timezone PST8PDT (Berkeley,
  California) and there will be apparent failures if the tests are not
  run with that timezone setting.  The regression test driver sets
  environment variable PGTZ to PST8PDT to ensure proper results.
      </Para>

      <Para>
  There appear to be some systems which do not accept the recommended syntax
  for explicitly setting the local time zone rules; you may need to use
  a different PGTZ setting on such machines.
      </Para>

      <Para>
  Some systems using older timezone libraries fail to apply daylight-savings
  corrections to pre-1970 dates, causing pre-1970 PDT times to be displayed
  in PST instead.  This will result in localized differences in the test
  results.
340 341 342 343 344 345 346 347
      </Para>
      
    </Sect2>
    
    <Sect2>
      <Title>Floating point differences</Title>
      
      <Para>
348
	Some of the tests involve computing 64-bit (<Type>float8</Type>) numbers from table
349
	columns. Differences in results involving mathematical functions of
350 351 352
	<Type>float8</Type> columns have been observed.  The float8
	and geometry tests are particularly prone to small differences
	across platforms.
353 354 355
	Human eyeball comparison is needed to determine the real significance
	of these differences which are usually 10 places to the right of
	the decimal point.
356 357 358
      </Para>

      <Para>
359 360 361 362 363 364 365 366 367 368 369 370 371 372 373 374 375 376 377 378 379 380 381 382 383 384 385 386 387 388 389 390 391 392 393
	Some systems signal errors from pow() and exp() differently from
	the mechanism expected by the current Postgres code.
      </Para>
      
    </Sect2>
    
    <Sect2>
      <Title>Polygon differences</Title>
      
      <Para>
	Several of the tests involve operations on geographic date about the
	Oakland/Berkley CA street map. The map data is expressed as polygons
	whose vertices are represented as pairs of <Type>float8</Type> numbers (decimal
	latitude and longitude). Initially, some tables are created and
	loaded with geographic data, then some views are created which join
	two tables using the polygon intersection operator (##), then a select
	is done on the view. 
	
	When comparing the results from different platforms, differences occur
	in the 2nd or 3rd place to the right of the decimal point. The SQL
	statements where these problems occur are the folowing:
	
	<ProgramListing>
	  QUERY: SELECT * from street;
	  QUERY: SELECT * from iexit;
	</ProgramListing>
      </Para>
      
    </Sect2>
    
    <Sect2>
      <Title>Random differences</Title>
      
      <Para>
	There is at least one test case in random.out which is intended to produce
394 395
	random results. This causes random to fail the regression testing
	once in a while.
396 397 398 399 400 401 402 403 404 405 406 407 408 409 410 411 412 413 414 415 416 417 418 419 420 421 422 423 424 425 426 427 428 429 430 431 432 433 434
	Typing
	<ProgramListing>
	  diff results/random.out expected/random.out
	</ProgramListing>
	
	should produce only
	one or a few lines of differences for this reason, but other floating
	point differences on dissimilar architectures might cause many more
	differences. See the release notes below.
      </Para>
      
    </Sect2>
    
    <Sect2>
      <Title>The <Quote>expected</Quote> files</Title>
      
      <Para>
	The <FileName>./expected/*.out</FileName> files were adapted from the original monolithic
	<FileName>expected.input</FileName> file provided by Jolly Chen et al. Newer versions of these
	files generated on various development machines have been substituted after
	careful (?) inspection. Many of the development machines are running a
	Unix OS variant (FreeBSD, Linux, etc) on Ix86 hardware.
	
	The original <FileName>expected.input</FileName> file was created on a SPARC Solaris 2.4
	system using the <FileName>postgres5-1.02a5.tar.gz</FileName> source tree. It was compared
	with a file created on an I386 Solaris 2.4 system and the differences
	were only in the floating point polygons in the 3rd digit to the right
	of the decimal point. (see below)
	
	The original <FileName>sample.regress.out</FileName> file was from the postgres-1.01 release
	constructed by Jolly Chen and is included here for reference. It may
	have been created on a DEC ALPHA machine as the <FileName>Makefile.global</FileName>
	in the postgres-1.01 release has PORTNAME=alpha.
      </Para>
      
    </Sect2>
    
  </Sect1>
  
435
</Chapter>