top-skimming import from sf.net

git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@2 d0cd1f9f-072b-0410-8dd7-cf729c803f20

top-skimming import from sf.net
git-svn-id: https://tesseract-ocr.googlecode.com/svn/trunk@2 d0cd1f9f-072b-0410-8dd7-cf729c803f20
cd651045 · tmbdev · 6e077331 · cd651045 · cd651045 · cd651045
539 changed file
--- a/trunk/.cvsignore
+++ b/trunk/.cvsignore
+BUILD
+OWNERS
+Makefile
+README.google
+runautoconf
+config_auto.h
--- a/trunk/AUTHORS
+++ b/trunk/AUTHORS
+Ray Smith (lead developer) <theraysmith@users.sourceforge.net>
+Phil Cheatle
+Simon Crouch
+Dan Johnson
+Mark Seaman
+Sheelagh Huddleston
+Chris Newton
+... and several others.
--- a/trunk/COPYING
+++ b/trunk/COPYING
+This package contains the Tesseract Open Source OCR Engine.
+Orignally developed at Hewlett Packard Laboratories Bristol and
+at Hewlett Packard Co, Greeley Colorado, all the code
+in this distribution is now licensed under the Apache License:
+
+** Licensed under the Apache License, Version 2.0 (the "License");
+** you may not use this file except in compliance with the License.
+** You may obtain a copy of the License at
+** http://www.apache.org/licenses/LICENSE-2.0
+** Unless required by applicable law or agreed to in writing, software
+** distributed under the License is distributed on an "AS IS" BASIS,
+** WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+** See the License for the specific language governing permissions and
+** limitations under the License.
+
+
+Other Dependencies and Licenses:
+================================
+The Aspirin/MIGRAINES system is no longer used.
+
+Tesseract can also make use of the libtiff library. (www.libtiff.org)
+Without libtiff, Tesseract can only read uncompressed and G3 compressed
+TIFF files.
--- a/trunk/ChangeLog
+++ b/trunk/ChangeLog
+June  2006 - V1.0 of open source Tesseract checked-in.
+Sep 7 2006 - V1.01.
+          Added mfcpch.cpp and getopt.cpp for VC++.
+          Fixed problem with greyscale images and no libtiff.
+          Stopped debug window from being used for the usage output.
+          Fixed load of inttemp for big-endian architectures.
+          Fixed some Mac compilation issues.
+Oct 4 2006 - V1.02
+          Removed dependency on Aspirin.
+          Fixed a few missing Apache license headers.
+          Removed $log.
+Feb 2 2007 - V1.03
+          Added mftraining and cntraining.
+          Added baseapi with adaptive thresholding for grey and color.
+          Fixed many memory leaks.
+          Fixed several bugs including lack of use of adaptive classifier.
+          Added ifdefs to eliminate graphics code and add embedded platform support.
+          Incorporated several patches, including 64-bit builds, Mac builds.
+          Minor accuracy improvements.
+
--- a/trunk/INSTALL
+++ b/trunk/INSTALL
+Copyright 1994, 1995, 1996, 1999, 2000, 2001, 2002 Free Software
+Foundation, Inc.
+
+   This file is free documentation; the Free Software Foundation gives
+unlimited permission to copy, distribute and modify it.
+
+Basic Installation
+==================
+
+   These are generic installation instructions.
+
+   The `configure' shell script attempts to guess correct values for
+various system-dependent variables used during compilation.  It uses
+those values to create a `Makefile' in each directory of the package.
+It may also create one or more `.h' files containing system-dependent
+definitions.  Finally, it creates a shell script `config.status' that
+you can run in the future to recreate the current configuration, and a
+file `config.log' containing compiler output (useful mainly for
+debugging `configure').
+
+   It can also use an optional file (typically called `config.cache'
+and enabled with `--cache-file=config.cache' or simply `-C') that saves
+the results of its tests to speed up reconfiguring.  (Caching is
+disabled by default to prevent problems with accidental use of stale
+cache files.)
+
+   If you need to do unusual things to compile the package, please try
+to figure out how `configure' could check whether to do them, and mail
+diffs or instructions to the address given in the `README' so they can
+be considered for the next release.  If you are using the cache, and at
+some point `config.cache' contains results you don't want to keep, you
+may remove or edit it.
+
+   The file `configure.ac' (or `configure.in') is used to create
+`configure' by a program called `autoconf'.  You only need
+`configure.ac' if you want to change it or regenerate `configure' using
+a newer version of `autoconf'.
+
+The simplest way to compile this package is:
+
+  1. `cd' to the directory containing the package's source code and type
+     `./configure' to configure the package for your system.  If you're
+     using `csh' on an old version of System V, you might need to type
+     `sh ./configure' instead to prevent `csh' from trying to execute
+     `configure' itself.
+
+     Running `configure' takes awhile.  While running, it prints some
+     messages telling which features it is checking for.
+
+  2. Type `make' to compile the package.
+
+  3. Optionally, type `make check' to run any self-tests that come with
+     the package.
+
+  4. Type `make install' to install the programs and any data files and
+     documentation.
+
+  5. You can remove the program binaries and object files from the
+     source code directory by typing `make clean'.  To also remove the
+     files that `configure' created (so you can compile the package for
+     a different kind of computer), type `make distclean'.  There is
+     also a `make maintainer-clean' target, but that is intended mainly
+     for the package's developers.  If you use it, you may have to get
+     all sorts of other programs in order to regenerate files that came
+     with the distribution.
+
+Compilers and Options
+=====================
+
+   Some systems require unusual options for compilation or linking that
+the `configure' script does not know about.  Run `./configure --help'
+for details on some of the pertinent environment variables.
+
+   You can give `configure' initial values for configuration parameters
+by setting variables in the command line or in the environment.  Here
+is an example:
+
+     ./configure CC=c89 CFLAGS=-O2 LIBS=-lposix
+
+   *Note Defining Variables::, for more details.
+
+Compiling For Multiple Architectures
+====================================
+
+   You can compile the package for more than one kind of computer at the
+same time, by placing the object files for each architecture in their
+own directory.  To do this, you must use a version of `make' that
+supports the `VPATH' variable, such as GNU `make'.  `cd' to the
+directory where you want the object files and executables to go and run
+the `configure' script.  `configure' automatically checks for the
+source code in the directory that `configure' is in and in `..'.
+
+   If you have to use a `make' that does not support the `VPATH'
+variable, you have to compile the package for one architecture at a
+time in the source code directory.  After you have installed the
+package for one architecture, use `make distclean' before reconfiguring
+for another architecture.
+
+Installation Names
+==================
+
+   By default, `make install' will install the package's files in
+`/usr/local/bin', `/usr/local/man', etc.  You can specify an
+installation prefix other than `/usr/local' by giving `configure' the
+option `--prefix=PATH'.
+
+   You can specify separate installation prefixes for
+architecture-specific files and architecture-independent files.  If you
+give `configure' the option `--exec-prefix=PATH', the package will use
+PATH as the prefix for installing programs and libraries.
+Documentation and other data files will still use the regular prefix.
+
+   In addition, if you use an unusual directory layout you can give
+options like `--bindir=PATH' to specify different values for particular
+kinds of files.  Run `configure --help' for a list of the directories
+you can set and what kinds of files go in them.
+
+   If the package supports it, you can cause programs to be installed
+with an extra prefix or suffix on their names by giving `configure' the
+option `--program-prefix=PREFIX' or `--program-suffix=SUFFIX'.
+
+Optional Features
+=================
+
+   Some packages pay attention to `--enable-FEATURE' options to
+`configure', where FEATURE indicates an optional part of the package.
+They may also pay attention to `--with-PACKAGE' options, where PACKAGE
+is something like `gnu-as' or `x' (for the X Window System).  The
+`README' should mention any `--enable-' and `--with-' options that the
+package recognizes.
+
+   For packages that use the X Window System, `configure' can usually
+find the X include and library files automatically, but if it doesn't,
+you can use the `configure' options `--x-includes=DIR' and
+`--x-libraries=DIR' to specify their locations.
+
+Specifying the System Type
+==========================
+
+   There may be some features `configure' cannot figure out
+automatically, but needs to determine by the type of machine the package
+will run on.  Usually, assuming the package is built to be run on the
+_same_ architectures, `configure' can figure that out, but if it prints
+a message saying it cannot guess the machine type, give it the
+`--build=TYPE' option.  TYPE can either be a short name for the system
+type, such as `sun4', or a canonical name which has the form:
+
+     CPU-COMPANY-SYSTEM
+
+where SYSTEM can have one of these forms:
+
+     OS KERNEL-OS
+
+   See the file `config.sub' for the possible values of each field.  If
+`config.sub' isn't included in this package, then this package doesn't
+need to know the machine type.
+
+   If you are _building_ compiler tools for cross-compiling, you should
+use the `--target=TYPE' option to select the type of system they will
+produce code for.
+
+   If you want to _use_ a cross compiler, that generates code for a
+platform different from the build platform, you should specify the
+"host" platform (i.e., that on which the generated programs will
+eventually be run) with `--host=TYPE'.
+
+Sharing Defaults
+================
+
+   If you want to set default values for `configure' scripts to share,
+you can create a site shell script called `config.site' that gives
+default values for variables like `CC', `cache_file', and `prefix'.
+`configure' looks for `PREFIX/share/config.site' if it exists, then
+`PREFIX/etc/config.site' if it exists.  Or, you can set the
+`CONFIG_SITE' environment variable to the location of the site script.
+A warning: not all `configure' scripts look for a site script.
+
+Defining Variables
+==================
+
+   Variables not defined in a site shell script can be set in the
+environment passed to `configure'.  However, some packages may run
+configure again during the build, and the customized values of these
+variables may be lost.  In order to avoid this problem, you should set
+them in the `configure' command line, using `VAR=value'.  For example:
+
+     ./configure CC=/usr/local2/bin/gcc
+
+will cause the specified gcc to be used as the C compiler (unless it is
+overridden in the site shell script).
+
+`configure' Invocation
+======================
+
+   `configure' recognizes the following options to control how it
+operates.
+
+`--help'
+`-h'
+     Print a summary of the options to `configure', and exit.
+
+`--version'
+`-V'
+     Print the version of Autoconf used to generate the `configure'
+     script, and exit.
+
+`--cache-file=FILE'
+     Enable the cache: use and save the results of the tests in FILE,
+     traditionally `config.cache'.  FILE defaults to `/dev/null' to
+     disable caching.
+
+`--config-cache'
+`-C'
+     Alias for `--cache-file=config.cache'.
+
+`--quiet'
+`--silent'
+`-q'
+     Do not print messages saying which checks are being made.  To
+     suppress all normal output, redirect it to `/dev/null' (any error
+     messages will still be shown).
+
+`--srcdir=DIR'
+     Look for the package's source code in directory DIR.  Usually
+     `configure' can determine that directory automatically.
+
+`configure' also accepts some other, not widely useful, options.  Run
+`configure --help' for more details.
+
--- a/trunk/Makefile.am
+++ b/trunk/Makefile.am
+# TODO(luc) Add 'doc' to this list when ready
+SUBDIRS = ccstruct ccutil classify cutil dict display image textord viewer wordrec ccmain training
+
+EXTRA_DIST = tessdata phototest.tif tesseract.dsp tesseract.dsw
+#EXTRA_DIST = doc/html doc/@PACKAGE_NAME@_@PACKAGE_VERSION@.pdf doc/@PACKAGE_NAME@_@PACKAGE_VERSION@.ps.gz
+
+dist-hook:
+# Need to remove CVS directories from directories
+# added using EXTRA_DIST. $(distdir)/tessdata would in
+# theory suffice.
+	rm -rf `find $(distdir) -name CVS`
+# Also remove extra files not needed in a distribution
+	rm -rf `find $(distdir) -name configure.ac`
+	rm -rf `find $(distdir) -name acinclude.m4`
+	rm -rf `find $(distdir) -name aclocal.m4`
--- a/trunk/Makefile.in
+++ b/trunk/Makefile.in
--- a/trunk/NEWS
+++ b/trunk/NEWS
+Stub file. To be populated at a later stage.
--- a/trunk/README
+++ b/trunk/README
+Introduction
+============
+This package contains the Tesseract Open Source OCR Engine.
+Orignally developed at Hewlett Packard Laboratories Bristol and
+at Hewlett Packard Co, Greeley Colorado, all the code
+in this distribution is now licensed under the Apache License:
+
+** Licensed under the Apache License, Version 2.0 (the "License");
+** you may not use this file except in compliance with the License.
+** You may obtain a copy of the License at
+** http://www.apache.org/licenses/LICENSE-2.0
+** Unless required by applicable law or agreed to in writing, software
+** distributed under the License is distributed on an "AS IS" BASIS,
+** WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+** See the License for the specific language governing permissions and
+** limitations under the License.
+
+
+Other Dependencies and Licenses:
+================================
+The Aspirin/MIGRAINES system is no longer required.
+
+Tesseract can also make use of the libtiff library. (www.libtiff.org)
+Without libtiff, Tesseract can only read uncompressed and G3 compressed
+TIFF files.
+
+
+History:
+========
+The engine was developed at Hewlett Packard Laboratories Bristol and
+at Hewlett Packard Co, Greeley Colorado between 1985 and 1994, with some
+more changes made in 1996 to port to Windows, and some C++izing in 1998.
+A lot of the code was written in C, and then some more was written in C++.
+Since then all the code has been converted to at least compile with a C++
+compiler. Currently it builds under Linux with gcc2.95 and under Windows
+with VC++6. The C++ code makes heavy use of a list system using macros.
+This predates stl, was portable before stl, and is more efficent than stl
+lists, but has the big negative that if you do get a segmentation violation,
+it is hard to debug. Another "feature" of the C/C++ split is that the C++
+data structures get converted to C data structures to call the low-level C
+code. This is ugly, and the C++izing of the C code is a step towards
+eliminating the conversion, but it has not happened yet.
+
+
+Directory Structure (ordered by dependency):
+============================================
+ccmain     Top-level code. The main program resides in tesseractmain.cpp.
+display    An "editor" to view and operate on the internal structures.
+           (Requires a working viewer - batteries not included.)
+wordrec    The word-level recognizer.
+textord    The module that organizes(orders) text into lines and words.
+classify   The low-level character classifiers.
+ccstruct   Classes to hold information about a page as it is being processed.
+viewer     The client side of a client server viewing system.
+           Unfortunately, at this time, the server side is not available.
+image      Image class and processing functions.
+dict       Language model code.
+cutil      Code for file I/O, lists, heaps etc, from the old C code.
+ccutil     Somewhat newer code for lists, memory allocation etc from the
+           old C++ code.
+
+
+About the Engine
+================
+This code is a raw OCR engine. It has NO PAGE LAYOUT ANALYSIS, NO OUTPUT
+FORMATTING, and NO UI. It can only process an image of a single column
+and create text from it. It can detect fixed pitch vs proportional text.
+Having said that, in 1995, this engine was in the top 3 in terms of character
+accuracy, and it compiles and runs on both Linux and Windows. Another current
+limitation is that it only recognizes English and its character set is only
+US-ASCII. Training code IS included in the open source release however, and
+will be included in a future release.
+
+
+Using the Engine
+================
+The usage of both Windows and Linux versions is the same.
+The executable must reside in the same directory as the tessdata directory
+The command line is:
+tesseract <image.tif> <output> batch
+The image file requires an .tif extension for its type to be recognized
+correctly. If a file exists with the .tif extension replaced by .uzn, then it
+will be interpreted as a UNLV-style zone file. (See www.isri.unlv.edu for
+details of the zone files.)
+
--- a/trunk/ReleaseNotes
+++ b/trunk/ReleaseNotes
+Tesseract release notes Feb 2, 2007 - V1.03.
+Added mftraining and cntraining. Using an image with a box file, tesseract
+generates .tr output files. cntraining runs on the .tr files to make
+normproto that lives in tessdata. mftraining runs on the .tr files to
+make inttemp and pffmtable in tessdata. These are the main data files
+that tesseract uses to recognize characters. At present, the code to make
+dictionary files is not yet available, nor are any sample box files or
+rebuilt inttemp or documentation to create any of these. Recognition is
+still limited to the ASCII set, but when this problem is fixed, documentation
+will follow.
+
+Added a new API with adaptive thresholding for grey and color images.
+See ccmain/baseapi.h/cpp for details. The main program has been converted
+to use the API as an example. See main() in ccmain/tesseractmain.cpp for
+details. The API is designed to make it easy to add subclasses with ability
+to output the bounding boxes etc from the internal structures. The adaptive
+thresholding improves accuracy (most of the time) on non-binary images.
+
+Many memory leaks have been fixed. There are no known leaks left from using
+the API correctly.
+
+The adaptive classifier was not operating correctly. This bug, and several
+others have been fixed, including poor chopping, an indefinite (if not quite
+infinite) loop in the number parser, and a couple of crash bugs. Thanks to
+all that have contributed bugs and bug fixes.
+
+It is now possible to build without any of the graphics support to save code
+size using #define GRAPHICS_DISABLED. There is also a new EMBEDDED define
+for use on operating systems with limited library support.
+
+64-bit and Mac OSX buildability is now included in the mainline source tree.
+Thanks to all that have contributed patches and comments to help with that.
+1.03 is also endian-independent, apart from the tiff i/o, so if you use
+libtiff, the code should run on all platforms, even if you get/create new
+data files of a different endinanness.
+
+Some of the bug fixes improve accuracy, and so do some of the changes to
+DangAmbigs and user-words.
+
+Tesseract release notes, Oct 4 2006 - V1.02.
+Removed dependency on aspirin. *All* code is now licensed under Apache2.0.
+
+Tesseract release notes, Sep 7 2006 - V1.01.
+
+Fixes for this release:
+Added mfcpch.cpp and getopt.cpp for VC++.
+Fixed problem with greyscale images and no libtiff.
+Stopped debug window from being used for the usage output.
+Fixed load of inttemp for big-endian architectures.
+Fixed some Mac compilation issues.
+
+This version should read uncompressed 8 bit grey and 24 bit color tiffs
+without having to have libtiff. It does a dumb threshold though, so don't
+expect good results from poor contrast or images of natural scenes etc.
+
+If you just run tesseract with no command line args you should now get a
+sensible usage message on linux, with or without X-windows.
+
+If you can get it to compile on a PPC Mac, it may now run correctly,
+although not all the build issues are fixed yet.
+
+Building Tesseract:
+Windows:
+Unpack the tar.gz archive
+Open tesseract.dsw in DevStudio (preferably version 6, higher versions will be more difficult)
+Set Win32 - Release as the active configuration.
+Build.
+Copy tesseract.exe from bin.rel up one directory level.
+Run tesseract phototest.tif phototest
+This will create phototest.txt.
+
+Linux:
+Unpack the tar.gz archive
+./configure
+make
+Copy tesseract from ccmain up one directory level (or create a symbolic link)
+Run tesseract phototest.tif phototest
+This will create phototest.txt.
--- a/trunk/acinclude.m4
+++ b/trunk/acinclude.m4
+# Master include for AC macros. This directory structure allows
+# for more flexibility with respect to CVS modules.
+#
+# Author: Luc Vincent
+
+### m4_include(config/ac_compile_check_sizeof.m4)dnl
+#m4_include(config/ac_create_stdint_h.m4)dnl
+#m4_include(config/ax_create_stdint_h.m4)dnl
+m4_include(config/ac_define_versionlevel.m4)dnl
+m4_include(config/acinclude_custom.m4)dnl
--- a/trunk/aclocal.m4
+++ b/trunk/aclocal.m4
--- a/trunk/ccmain/Makefile.am
+++ b/trunk/ccmain/Makefile.am
+SUBDIRS =
+AM_CPPFLAGS = \
+    -I$(top_srcdir)/ccutil -I$(top_srcdir)/ccstruct \
+    -I$(top_srcdir)/image -I$(top_srcdir)/viewer \
+    -I$(top_srcdir)/ccops -I$(top_srcdir)/dict \
+    -I$(top_srcdir)/classify -I$(top_srcdir)/display \
+    -I$(top_srcdir)/wordrec -I$(top_srcdir)/cutil \
+    -I$(top_srcdir)/textord
+
+EXTRA_DIST = \
+    adaptions.h applybox.h baseapi.h blobcmp.h \
+    callnet.h charcut.h \
+    control.h docqual.h expandblob.h fixspace.h fixxht.h \
+    imgscale.h matmatch.h output.h paircmp.h reject.h scaleimg.h \
+    tessbox.h tessedit.h tesseractmain.h tessvars.h tfacep.h \
+    tessembedded.h tfacepp.h tstruct.h werdit.h
+
+noinst_LIBRARIES = libtesseract_main.a
+libtesseract_main_a_SOURCES = \
+    tessedit.cpp adaptions.cpp applybox.cpp \
+    baseapi.cpp blobcmp.cpp \
+    callnet.cpp charcut.cpp charsample.cpp control.cpp \
+    docqual.cpp expandblob.cpp fixspace.cpp fixxht.cpp \
+    imgscale.cpp matmatch.cpp output.cpp paircmp.cpp \
+    reject.cpp scaleimg.cpp tessbox.cpp tessvars.cpp \
+    tfacepp.cpp tstruct.cpp werdit.cpp
+
+bin_PROGRAMS = tesseract
+tesseract_SOURCES = tesseractmain.cpp
+tesseract_LDADD = \
+    libtesseract_main.a \
+    ../display/libtesseract_display.a \
+    ../textord/libtesseract_textord.a \
+    ../wordrec/libtesseract_wordrec.a \
+    ../classify/libtesseract_classify.a \
+    ../dict/libtesseract_dict.a \
+    ../viewer/libtesseract_viewer.a \
+    ../image/libtesseract_image.a \
+    ../cutil/libtesseract_cutil.a \
+    ../ccstruct/libtesseract_ccstruct.a \
+    ../ccutil/libtesseract_ccutil.a
--- a/trunk/ccmain/Makefile.in
+++ b/trunk/ccmain/Makefile.in
--- a/trunk/ccmain/adaptions.cpp
+++ b/trunk/ccmain/adaptions.cpp
--- a/trunk/ccmain/adaptions.h
+++ b/trunk/ccmain/adaptions.h
+/**********************************************************************
+ * File:        adaptions.h  (Formerly adaptions.h)
+ * Description: Functions used to adapt to blobs already confidently
+ *					identified
+ * Author:		Chris Newton
+ * Created:		Thu Oct  7 10:17:28 BST 1993
+ *
+ * (C) Copyright 1992, Hewlett-Packard Ltd.
+ ** Licensed under the Apache License, Version 2.0 (the "License");
+ ** you may not use this file except in compliance with the License.
+ ** You may obtain a copy of the License at
+ ** http://www.apache.org/licenses/LICENSE-2.0
+ ** Unless required by applicable law or agreed to in writing, software
+ ** distributed under the License is distributed on an "AS IS" BASIS,
+ ** WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+ ** See the License for the specific language governing permissions and
+ ** limitations under the License.
+ *
+ **********************************************************************/
+
+#ifndef           ADAPTIONS_H
+#define           ADAPTIONS_H
+
+#include          "charsample.h"
+#include          "charcut.h"
+#include          "notdll.h"
+
+extern BOOL_VAR_H (tessedit_reject_ems, FALSE, "Reject all m's");
+extern BOOL_VAR_H (tessedit_reject_suspect_ems, FALSE, "Reject suspect m's");
+extern double_VAR_H (tessedit_cluster_t1, 0.20,
+"t1 threshold for clustering samples");
+extern double_VAR_H (tessedit_cluster_t2, 0.40,
+"t2 threshold for clustering samples");
+extern double_VAR_H (tessedit_cluster_t3, 0.12,
+"Extra threshold for clustering samples, only keep a new sample if best score greater than this value");
+extern double_VAR_H (tessedit_cluster_accept_fraction, 0.80,
+"Largest fraction of characters in cluster for it to be used for adaption");
+extern INT_VAR_H (tessedit_cluster_min_size, 3,
+"Smallest number of samples in a cluster for it to be used for adaption");
+extern BOOL_VAR_H (tessedit_cluster_debug, FALSE,
+"Generate and print debug information for adaption by clustering");
+extern BOOL_VAR_H (tessedit_use_best_sample, FALSE,
+"Use best sample from cluster when adapting");
+extern BOOL_VAR_H (tessedit_test_cluster_input, FALSE,
+"Set reject map to enable cluster input to be measured");
+extern BOOL_VAR_H (tessedit_matrix_match, TRUE, "Use matrix matcher");
+extern BOOL_VAR_H (tessedit_old_matrix_match, FALSE, "Use matrix matcher");
+extern BOOL_VAR_H (tessedit_mm_use_non_adaption_set, FALSE,
+"Don't try to adapt to characters on this list");
+extern STRING_VAR_H (tessedit_non_adaption_set, ",.;:'~@*",
+"Characters to be avoided when adapting");
+extern BOOL_VAR_H (tessedit_mm_adapt_using_prototypes, TRUE,
+"Use prototypes when adapting");
+extern BOOL_VAR_H (tessedit_mm_use_prototypes, TRUE,
+"Use prototypes as clusters are built");
+extern BOOL_VAR_H (tessedit_mm_use_rejmap, FALSE,
+"Adapt to characters using reject map");
+extern BOOL_VAR_H (tessedit_mm_all_rejects, FALSE,
+"Adapt to all characters using, matrix matcher");
+extern BOOL_VAR_H (tessedit_mm_only_match_same_char, FALSE,
+"Only match samples against clusters for the same character");
+extern BOOL_VAR_H (tessedit_process_rns, FALSE, "Handle m - rn ambigs");
+extern BOOL_VAR_H (tessedit_demo_adaption, FALSE,
+"Display cut images and matrix match for demo purposes");
+extern INT_VAR_H (tessedit_demo_word1, 62,
+"Word number of first word to display");
+extern INT_VAR_H (tessedit_demo_word2, 64,
+"Word number of second word to display");
+extern STRING_VAR_H (tessedit_demo_file, "academe",
+"Name of document containing demo words");
+BOOL8 word_adaptable(  //should we adapt?
+                     WERD_RES *word,
+                     UINT16 mode);
+void collect_ems_for_adaption(WERD_RES *word,
+                              CHAR_SAMPLES_LIST *char_clusters,
+                              CHAR_SAMPLE_LIST *chars_waiting);
+void collect_characters_for_adaption(WERD_RES *word,
+                                     CHAR_SAMPLES_LIST *char_clusters,
+                                     CHAR_SAMPLE_LIST *chars_waiting);
+void cluster_sample(CHAR_SAMPLE *sample,
+                    CHAR_SAMPLES_LIST *char_clusters,
+                    CHAR_SAMPLE_LIST *chars_waiting);
+void check_wait_list(CHAR_SAMPLE_LIST *chars_waiting,
+                     CHAR_SAMPLE *sample,
+                     CHAR_SAMPLES *best_cluster);
+void complete_clustering(CHAR_SAMPLES_LIST *char_clusters,
+                         CHAR_SAMPLE_LIST *chars_waiting);
+void adapt_to_good_ems(WERD_RES *word,
+                       CHAR_SAMPLES_LIST *char_clusters,
+                       CHAR_SAMPLE_LIST *chars_waiting);
+void adapt_to_good_samples(WERD_RES *word,
+                           CHAR_SAMPLES_LIST *char_clusters,
+                           CHAR_SAMPLE_LIST *chars_waiting);
+void print_em_stats(CHAR_SAMPLES_LIST *char_clusters,
+                    CHAR_SAMPLE_LIST *chars_waiting);
+                                 //lines of the image
+CHAR_SAMPLE *clip_sample(PIXROW *pixrow,
+                         IMAGELINE *imlines,
+                         BOX pix_box,  //box of imlines extent
+                         BOOL8 white_on_black,
+                         char c);
+void display_cluster_prototypes(CHAR_SAMPLES_LIST *char_clusters); 
+void reject_all_ems(WERD_RES *word); 
+void reject_all_fullstops(WERD_RES *word); 
+void reject_suspect_ems(WERD_RES *word); 
+void reject_suspect_fullstops(WERD_RES *word); 
+BOOL8 suspect_em(WERD_RES *word, INT16 index); 
+BOOL8 suspect_fullstop(WERD_RES *word, INT16 i); 
+#endif
--- a/trunk/ccmain/applybox.cpp
+++ b/trunk/ccmain/applybox.cpp
--- a/trunk/ccmain/applybox.h
+++ b/trunk/ccmain/applybox.h
+/**********************************************************************
+ * File:        applybox.h  (Formerly applybox.h)
+ * Description: Re segment rows according to box file data
+ * Author:		Phil Cheatle
+ * Created:		Wed Nov 24 09:11:23 GMT 1993
+ *
+ * (C) Copyright 1993, Hewlett-Packard Ltd.
+ ** Licensed under the Apache License, Version 2.0 (the "License");
+ ** you may not use this file except in compliance with the License.
+ ** You may obtain a copy of the License at
+ ** http://www.apache.org/licenses/LICENSE-2.0
+ ** Unless required by applicable law or agreed to in writing, software
+ ** distributed under the License is distributed on an "AS IS" BASIS,
+ ** WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+ ** See the License for the specific language governing permissions and
+ ** limitations under the License.
+ *
+ **********************************************************************/
+
+#ifndef           APPLYBOX_H
+#define           APPLYBOX_H
+
+#include          "varable.h"
+#include          "ocrblock.h"
+#include          "ocrrow.h"
+#include          "notdll.h"
+
+extern BOOL_VAR_H (applybox_rebalance, TRUE, "Drop dead");
+extern INT_VAR_H (applybox_debug, 0, "Debug level");
+extern STRING_VAR_H (applybox_test_exclusions, "|",
+"Chars ignored for testing");
+extern double_VAR_H (applybox_error_band, 0.15, "Err band as fract of xht");
+void apply_boxes(BLOCK_LIST *block_list    //real blocks
+                );
+void clear_any_old_text(                        //remove correct text
+                        BLOCK_LIST *block_list  //real blocks
+                       );
+BOOL8 read_next_box(FILE* box_file,  //
+                    BOX *box,
+                    char *ch);
+ROW *find_row_of_box(                         //
+                     BLOCK_LIST *block_list,  //real blocks
+                     BOX box,                 //from boxfile
+                     INT16 &block_id,
+                     INT16 &row_id_to_process);
+INT16 resegment_box(  //
+                    ROW *row,
+                    BOX box,
+                    char *ch,
+                    INT16 block_id,
+                    INT16 row_id,
+                    INT16 boxfile_lineno,
+                    INT16 boxfile_charno);
+void tidy_up(                         //
+             BLOCK_LIST *block_list,  //real blocks
+             INT16 &ok_char_count,
+             INT16 &ok_row_count,
+             INT16 &unlabelled_words,
+             INT16 *tgt_char_counts,
+             INT16 &rebalance_count,
+             char &min_char,
+             INT16 &min_samples,
+             INT16 &final_labelled_blob_count);
+void report_failed_box(INT16 boxfile_lineno,
+                       INT16 boxfile_charno,
+                       BOX box,
+                       char *box_ch,
+                       const char *err_msg);
+void apply_box_training(BLOCK_LIST *block_list);
+void apply_box_testing(BLOCK_LIST *block_list);
+#endif
--- a/trunk/ccmain/baseapi.cpp
+++ b/trunk/ccmain/baseapi.cpp
--- a/trunk/ccmain/baseapi.h
+++ b/trunk/ccmain/baseapi.h
+///////////////////////////////////////////////////////////////////////
+// File:        baseapi.h
+// Description: Simple API for calling tesseract.
+// Author:      Ray Smith
+// Created:     Fri Oct 06 15:35:01 PDT 2006
+//
+// (C) Copyright 2006, Google Inc.
+// Licensed under the Apache License, Version 2.0 (the "License");
+// you may not use this file except in compliance with the License.
+// You may obtain a copy of the License at
+// http://www.apache.org/licenses/LICENSE-2.0
+// Unless required by applicable law or agreed to in writing, software
+// distributed under the License is distributed on an "AS IS" BASIS,
+// WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+// See the License for the specific language governing permissions and
+// limitations under the License.
+//
+///////////////////////////////////////////////////////////////////////
+
+#ifndef THIRD_PARTY_TESSERACT_CCMAIN_BASEAPI_H__
+#define THIRD_PARTY_TESSERACT_CCMAIN_BASEAPI_H__
+
+#include <string>
+
+#include "host.h"
+#include "ocrclass.h"
+
+class PAGE_RES;
+class BLOCK_LIST;
+
+// Base class for all tesseract APIs.
+// Specific classes can add ability to work on different inputs or produce
+// different outputs.
+
+class TessBaseAPI {
+ public:
+  // Start tesseract.
+  // The datapath must be the name of the data directory or some other file
+  // in which the data directory resides (for instance argv[0].)
+  // The configfile is the name of a file in the tessconfigs directory
+  // (eg batch) or NULL to run on defaults.
+  // Outputbase may also be NULL, and is the basename of various output files.
+  // If the output of any of these files is enabled, then a name must be given.
+  // If numeric_mode is true, only possible digits and roman numbers are
+  // returned. Returns 0 if successful. Crashes if not.
+  // The argc and argv may be 0 and NULL respectively. They are used for
+  // providing config files for debug/display purposes.
+  // TODO(rays) get the facts straight. Is it OK to call
+  // it more than once? Make it properly check for errors and return them.
+  static int Init(const char* datapath, const char* outputbase,
+                  const char* configfile, bool numeric_mode,
+                  int argc, char* argv[]);
+
+  // Recognize a rectangle from an image and return the result as a string.
+  // May be called many times for a single Init.
+  // Currently has no error checking.
+  // Greyscale of 8 and color of 24 or 32 bits per pixel may be given.
+  // Palette color images will not work properly and must be converted to
+  // 24 bit.
+  // Binary images of 1 bit per pixel may also be given but they must be
+  // byte packed with the MSB of the first byte being the first pixel, and a
+  // 1 represents WHITE. For binary images set bytes_per_pixel=0.
+  // The recognized text is returned as a char* which (in future will be coded
+  // as UTF8 and) must be freed with the delete [] operator.
+  static char* TesseractRect(const UINT8* imagedata,
+                             int bytes_per_pixel,
+                             int bytes_per_line,
+                             int left, int top, int width, int height);
+
+  // Call between pages or documents etc to free up memory and forget
+  // adaptive data.
+  static void ClearAdaptiveClassifier();
+
+  // Close down tesseract and free up memory.
+  static void End();
+
+  // Dump the internal binary image to a PGM file.
+  static void DumpPGM(const char* filename);
+
+ protected:
+  // Copy the given image rectangle to Tesseract, with adaptive thresholding
+  // if the image is not already binary.
+  static void CopyImageToTesseract(const UINT8* imagedata,
+                                   int bytes_per_pixel,
+                                   int bytes_per_line,
+                                   int left, int top, int width, int height);
+
+  // Compute the Otsu threshold(s) for the given image rectangle, making one
+  // for each channel. Each channel is always one byte per pixel.
+  // Returns an array of threshold values and an array of hi_values, such
+  // that a pixel value >threshold[channel] is considered foreground if
+  // hi_values[channel] is 0 or background if 1. A hi_value of -1 indicates
+  // that there is no apparent foreground. At least one hi_value will not be -1.
+  // thresholds and hi_values are assumed to be of bytes_per_pixel size.
+  static void OtsuThreshold(const UINT8* imagedata,
+                           int bytes_per_pixel,
+                           int bytes_per_line,
+                           int left, int top, int right, int bottom,
+                           int* thresholds,
+                           int* hi_values);
+
+  // Compute the histogram for the given image rectangle, and the given
+  // channel. (Channel pointed to by imagedata.) Each channel is always
+  // one byte per pixel.
+  // Bytes per pixel is used to skip channels not being
+  // counted with this call in a multi-channel (pixel-major) image.
+  // Histogram is always a 256 element array to count occurrences of
+  // each pixel value.
+  static void HistogramRect(const UINT8* imagedata,
+                            int bytes_per_pixel,
+                            int bytes_per_line,
+                            int left, int top, int right, int bottom,
+                            int* histogram);
+
+  // Compute the Otsu threshold(s) for the given histogram.
+  // Also returns H = total count in histogram, and
+  // omega0 = count of histogram below threshold.
+  static int OtsuStats(const int* histogram,
+                       int* H_out,
+                       int* omega0_out);
+
+  // Threshold the given grey or color image into the tesseract global
+  // image ready for recognition. Requires thresholds and hi_value
+  // produced by OtsuThreshold above.
+  static void ThresholdRect(const UINT8* imagedata,
+                            int bytes_per_pixel,
+                            int bytes_per_line,
+                            int left, int top,
+                            int width, int height,
+                            const int* thresholds,
+                            const int* hi_values);
+
+  // Cut out the requested rectangle of the binary image to the
+  // tesseract global image ready for recognition.
+  static void CopyBinaryRect(const UINT8* imagedata,
+                             int bytes_per_line,
+                             int left, int top,
+                             int width, int height);
+
+  // Low-level function to recognize the current global image to a string.
+  static char* RecognizeToString();
+
+  // Find lines from the image making the BLOCK_LIST.
+  static void FindLines(BLOCK_LIST* block_list);
+
+  // Recognize the tesseract global image and return the result as Tesseract
+  // internal structures.
+  static PAGE_RES* Recognize(BLOCK_LIST* block_list, ETEXT_DESC* monitor);
+
+  // Convert (and free) the internal data structures into a text string.
+  static char* TesseractToText(PAGE_RES* page_res);
+};
+
+#endif  // THIRD_PARTY_TESSERACT_CCMAIN_BASEAPI_H__
--- a/trunk/ccmain/blobcmp.cpp
+++ b/trunk/ccmain/blobcmp.cpp
--- a/trunk/ccmain/blobcmp.h
+++ b/trunk/ccmain/blobcmp.h
+/**********************************************************************
+ * File:			blobcmp.c
+ * Description: Code to compare blobs using the adaptive matcher.
+ * Author:		Ray Smith
+ * Created:		Wed Apr 21 09:28:51 BST 1993
+ *
+ * (C) Copyright 1993, Hewlett-Packard Ltd.
+ ** Licensed under the Apache License, Version 2.0 (the "License");
+ ** you may not use this file except in compliance with the License.
+ ** You may obtain a copy of the License at
+ ** http://www.apache.org/licenses/LICENSE-2.0
+ ** Unless required by applicable law or agreed to in writing, software
+ ** distributed under the License is distributed on an "AS IS" BASIS,
+ ** WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+ ** See the License for the specific language governing permissions and
+ ** limitations under the License.
+ *
+ **********************************************************************/
+
+#ifndef           BLOBCMP_H
+#define           BLOBCMP_H
+
+#include          "tstruct.h"
+
+float compare_tess_blobs(TBLOB *blob1,
+                         TEXTROW *row1,
+                         TBLOB *blob2,
+                         TEXTROW *row2);
+#endif
--- a/trunk/ccmain/callnet.cpp
+++ b/trunk/ccmain/callnet.cpp
--- a/trunk/ccmain/callnet.h
+++ b/trunk/ccmain/callnet.h
--- a/trunk/ccmain/charcut.cpp
+++ b/trunk/ccmain/charcut.cpp
--- a/trunk/ccmain/charcut.h
+++ b/trunk/ccmain/charcut.h
--- a/trunk/ccmain/charsample.cpp
+++ b/trunk/ccmain/charsample.cpp
--- a/trunk/ccmain/control.cpp
+++ b/trunk/ccmain/control.cpp
--- a/trunk/ccmain/control.h
+++ b/trunk/ccmain/control.h
--- a/trunk/ccmain/docqual.cpp
+++ b/trunk/ccmain/docqual.cpp
--- a/trunk/ccmain/docqual.h
+++ b/trunk/ccmain/docqual.h
--- a/trunk/ccmain/expandblob.cpp
+++ b/trunk/ccmain/expandblob.cpp
--- a/trunk/ccmain/expandblob.h
+++ b/trunk/ccmain/expandblob.h
--- a/trunk/ccmain/fixspace.cpp
+++ b/trunk/ccmain/fixspace.cpp
--- a/trunk/ccmain/fixspace.h
+++ b/trunk/ccmain/fixspace.h
--- a/trunk/ccmain/fixxht.cpp
+++ b/trunk/ccmain/fixxht.cpp
--- a/trunk/ccmain/fixxht.h
+++ b/trunk/ccmain/fixxht.h
--- a/trunk/ccmain/imgscale.cpp
+++ b/trunk/ccmain/imgscale.cpp
--- a/trunk/ccmain/imgscale.h
+++ b/trunk/ccmain/imgscale.h
--- a/trunk/ccmain/matmatch.cpp
+++ b/trunk/ccmain/matmatch.cpp
--- a/trunk/ccmain/matmatch.h
+++ b/trunk/ccmain/matmatch.h
--- a/trunk/ccmain/output.cpp
+++ b/trunk/ccmain/output.cpp
--- a/trunk/ccmain/output.h
+++ b/trunk/ccmain/output.h
--- a/trunk/ccmain/paircmp.cpp
+++ b/trunk/ccmain/paircmp.cpp
--- a/trunk/ccmain/paircmp.h
+++ b/trunk/ccmain/paircmp.h
--- a/trunk/ccmain/reject.cpp
+++ b/trunk/ccmain/reject.cpp
--- a/trunk/ccmain/reject.h
+++ b/trunk/ccmain/reject.h
--- a/trunk/ccmain/scaleimg.cpp
+++ b/trunk/ccmain/scaleimg.cpp
--- a/trunk/ccmain/scaleimg.h
+++ b/trunk/ccmain/scaleimg.h
--- a/trunk/ccmain/tessbox.cpp
+++ b/trunk/ccmain/tessbox.cpp
--- a/trunk/ccmain/tessbox.h
+++ b/trunk/ccmain/tessbox.h
--- a/trunk/ccmain/tessedit.cpp
+++ b/trunk/ccmain/tessedit.cpp
--- a/trunk/ccmain/tessedit.h
+++ b/trunk/ccmain/tessedit.h
--- a/trunk/ccmain/tessembedded.h
+++ b/trunk/ccmain/tessembedded.h
--- a/trunk/ccmain/tesseractmain.cpp
+++ b/trunk/ccmain/tesseractmain.cpp
--- a/trunk/ccmain/tesseractmain.h
+++ b/trunk/ccmain/tesseractmain.h
--- a/trunk/ccmain/tessvars.cpp
+++ b/trunk/ccmain/tessvars.cpp
--- a/trunk/ccmain/tessvars.h
+++ b/trunk/ccmain/tessvars.h
--- a/trunk/ccmain/tfacep.h
+++ b/trunk/ccmain/tfacep.h
--- a/trunk/ccmain/tfacepp.cpp
+++ b/trunk/ccmain/tfacepp.cpp
--- a/trunk/ccmain/tfacepp.h
+++ b/trunk/ccmain/tfacepp.h
--- a/trunk/ccmain/tstruct.cpp
+++ b/trunk/ccmain/tstruct.cpp
--- a/trunk/ccmain/tstruct.h
+++ b/trunk/ccmain/tstruct.h
--- a/trunk/ccmain/werdit.cpp
+++ b/trunk/ccmain/werdit.cpp
--- a/trunk/ccmain/werdit.h
+++ b/trunk/ccmain/werdit.h
--- a/trunk/ccstruct/Makefile.am
+++ b/trunk/ccstruct/Makefile.am
--- a/trunk/ccstruct/Makefile.in
+++ b/trunk/ccstruct/Makefile.in
--- a/trunk/ccstruct/blckerr.h
+++ b/trunk/ccstruct/blckerr.h
--- a/trunk/ccstruct/blobbox.cpp
+++ b/trunk/ccstruct/blobbox.cpp
--- a/trunk/ccstruct/blobbox.h
+++ b/trunk/ccstruct/blobbox.h
--- a/trunk/ccstruct/blobs.cpp
+++ b/trunk/ccstruct/blobs.cpp
--- a/trunk/ccstruct/blobs.h
+++ b/trunk/ccstruct/blobs.h
--- a/trunk/ccstruct/blread.cpp
+++ b/trunk/ccstruct/blread.cpp
--- a/trunk/ccstruct/blread.h
+++ b/trunk/ccstruct/blread.h
--- a/trunk/ccstruct/callcpp.cpp
+++ b/trunk/ccstruct/callcpp.cpp
--- a/trunk/ccstruct/coutln.cpp
+++ b/trunk/ccstruct/coutln.cpp
--- a/trunk/ccstruct/coutln.h
+++ b/trunk/ccstruct/coutln.h
--- a/trunk/ccstruct/crakedge.h
+++ b/trunk/ccstruct/crakedge.h
--- a/trunk/ccstruct/genblob.cpp
+++ b/trunk/ccstruct/genblob.cpp
--- a/trunk/ccstruct/genblob.h
+++ b/trunk/ccstruct/genblob.h
--- a/trunk/ccstruct/hpddef.h
+++ b/trunk/ccstruct/hpddef.h
--- a/trunk/ccstruct/hpdsizes.h
+++ b/trunk/ccstruct/hpdsizes.h
--- a/trunk/ccstruct/ipoints.h
+++ b/trunk/ccstruct/ipoints.h
--- a/trunk/ccstruct/labls.cpp
+++ b/trunk/ccstruct/labls.cpp
--- a/trunk/ccstruct/labls.h
+++ b/trunk/ccstruct/labls.h
--- a/trunk/ccstruct/linlsq.cpp
+++ b/trunk/ccstruct/linlsq.cpp
--- a/trunk/ccstruct/linlsq.h
+++ b/trunk/ccstruct/linlsq.h
--- a/trunk/ccstruct/lmedsq.cpp
+++ b/trunk/ccstruct/lmedsq.cpp
--- a/trunk/ccstruct/lmedsq.h
+++ b/trunk/ccstruct/lmedsq.h
--- a/trunk/ccstruct/mod128.cpp
+++ b/trunk/ccstruct/mod128.cpp
--- a/trunk/ccstruct/mod128.h
+++ b/trunk/ccstruct/mod128.h
--- a/trunk/ccstruct/normalis.cpp
+++ b/trunk/ccstruct/normalis.cpp
--- a/trunk/ccstruct/normalis.h
+++ b/trunk/ccstruct/normalis.h
--- a/trunk/ccstruct/ocrblock.cpp
+++ b/trunk/ccstruct/ocrblock.cpp
--- a/trunk/ccstruct/ocrblock.h
+++ b/trunk/ccstruct/ocrblock.h
--- a/trunk/ccstruct/ocrrow.cpp
+++ b/trunk/ccstruct/ocrrow.cpp
--- a/trunk/ccstruct/ocrrow.h
+++ b/trunk/ccstruct/ocrrow.h
--- a/trunk/ccstruct/pageblk.cpp
+++ b/trunk/ccstruct/pageblk.cpp
--- a/trunk/ccstruct/pageblk.h
+++ b/trunk/ccstruct/pageblk.h
--- a/trunk/ccstruct/pageres.cpp
+++ b/trunk/ccstruct/pageres.cpp
--- a/trunk/ccstruct/pageres.h
+++ b/trunk/ccstruct/pageres.h
--- a/trunk/ccstruct/pdblock.cpp
+++ b/trunk/ccstruct/pdblock.cpp
--- a/trunk/ccstruct/pdblock.h
+++ b/trunk/ccstruct/pdblock.h
--- a/trunk/ccstruct/pdclass.h
+++ b/trunk/ccstruct/pdclass.h
--- a/trunk/ccstruct/points.cpp
+++ b/trunk/ccstruct/points.cpp
--- a/trunk/ccstruct/points.h
+++ b/trunk/ccstruct/points.h
--- a/trunk/ccstruct/polyaprx.cpp
+++ b/trunk/ccstruct/polyaprx.cpp
--- a/trunk/ccstruct/polyaprx.h
+++ b/trunk/ccstruct/polyaprx.h
--- a/trunk/ccstruct/polyblk.cpp
+++ b/trunk/ccstruct/polyblk.cpp
--- a/trunk/ccstruct/polyblk.h
+++ b/trunk/ccstruct/polyblk.h
--- a/trunk/ccstruct/polyblob.cpp
+++ b/trunk/ccstruct/polyblob.cpp
--- a/trunk/ccstruct/polyblob.h
+++ b/trunk/ccstruct/polyblob.h
--- a/trunk/ccstruct/polyvert.cpp
+++ b/trunk/ccstruct/polyvert.cpp
--- a/trunk/ccstruct/polyvert.h
+++ b/trunk/ccstruct/polyvert.h
--- a/trunk/ccstruct/poutline.cpp
+++ b/trunk/ccstruct/poutline.cpp
--- a/trunk/ccstruct/poutline.h
+++ b/trunk/ccstruct/poutline.h
--- a/trunk/ccstruct/quadlsq.cpp
+++ b/trunk/ccstruct/quadlsq.cpp
--- a/trunk/ccstruct/quadlsq.h
+++ b/trunk/ccstruct/quadlsq.h
--- a/trunk/ccstruct/quadratc.cpp
+++ b/trunk/ccstruct/quadratc.cpp
--- a/trunk/ccstruct/quadratc.h
+++ b/trunk/ccstruct/quadratc.h
--- a/trunk/ccstruct/quspline.cpp
+++ b/trunk/ccstruct/quspline.cpp
--- a/trunk/ccstruct/quspline.h
+++ b/trunk/ccstruct/quspline.h
--- a/trunk/ccstruct/ratngs.cpp
+++ b/trunk/ccstruct/ratngs.cpp
--- a/trunk/ccstruct/ratngs.h
+++ b/trunk/ccstruct/ratngs.h
--- a/trunk/ccstruct/rect.cpp
+++ b/trunk/ccstruct/rect.cpp
--- a/trunk/ccstruct/rect.h
+++ b/trunk/ccstruct/rect.h
--- a/trunk/ccstruct/rejctmap.cpp
+++ b/trunk/ccstruct/rejctmap.cpp
--- a/trunk/ccstruct/rejctmap.h
+++ b/trunk/ccstruct/rejctmap.h
--- a/trunk/ccstruct/rwpoly.cpp
+++ b/trunk/ccstruct/rwpoly.cpp
--- a/trunk/ccstruct/rwpoly.h
+++ b/trunk/ccstruct/rwpoly.h
--- a/trunk/ccstruct/statistc.cpp
+++ b/trunk/ccstruct/statistc.cpp
--- a/trunk/ccstruct/statistc.h
+++ b/trunk/ccstruct/statistc.h
--- a/trunk/ccstruct/stepblob.cpp
+++ b/trunk/ccstruct/stepblob.cpp
--- a/trunk/ccstruct/stepblob.h
+++ b/trunk/ccstruct/stepblob.h
--- a/trunk/ccstruct/txtregn.cpp
+++ b/trunk/ccstruct/txtregn.cpp
--- a/trunk/ccstruct/txtregn.h
+++ b/trunk/ccstruct/txtregn.h
--- a/trunk/ccstruct/vecfuncs.cpp
+++ b/trunk/ccstruct/vecfuncs.cpp
--- a/trunk/ccstruct/vecfuncs.h
+++ b/trunk/ccstruct/vecfuncs.h
--- a/trunk/ccstruct/werd.cpp
+++ b/trunk/ccstruct/werd.cpp
--- a/trunk/ccstruct/werd.h
+++ b/trunk/ccstruct/werd.h
--- a/trunk/ccutil/Makefile.am
+++ b/trunk/ccutil/Makefile.am
--- a/trunk/ccutil/Makefile.in
+++ b/trunk/ccutil/Makefile.in
--- a/trunk/ccutil/basedir.cpp
+++ b/trunk/ccutil/basedir.cpp
--- a/trunk/ccutil/basedir.h
+++ b/trunk/ccutil/basedir.h
--- a/trunk/ccutil/bits16.cpp
+++ b/trunk/ccutil/bits16.cpp
--- a/trunk/ccutil/bits16.h
+++ b/trunk/ccutil/bits16.h
--- a/trunk/ccutil/clst.cpp
+++ b/trunk/ccutil/clst.cpp
--- a/trunk/ccutil/clst.h
+++ b/trunk/ccutil/clst.h
--- a/trunk/ccutil/debugwin.cpp
+++ b/trunk/ccutil/debugwin.cpp
--- a/trunk/ccutil/debugwin.h
+++ b/trunk/ccutil/debugwin.h
--- a/trunk/ccutil/elst.cpp
+++ b/trunk/ccutil/elst.cpp
--- a/trunk/ccutil/elst.h
+++ b/trunk/ccutil/elst.h
--- a/trunk/ccutil/elst2.cpp
+++ b/trunk/ccutil/elst2.cpp
--- a/trunk/ccutil/elst2.h
+++ b/trunk/ccutil/elst2.h
--- a/trunk/ccutil/errcode.cpp
+++ b/trunk/ccutil/errcode.cpp
--- a/trunk/ccutil/errcode.h
+++ b/trunk/ccutil/errcode.h
--- a/trunk/ccutil/fileerr.h
+++ b/trunk/ccutil/fileerr.h
--- a/trunk/ccutil/getopt.cpp
+++ b/trunk/ccutil/getopt.cpp
--- a/trunk/ccutil/getopt.h
+++ b/trunk/ccutil/getopt.h
--- a/trunk/ccutil/globaloc.cpp
+++ b/trunk/ccutil/globaloc.cpp
--- a/trunk/ccutil/globaloc.h
+++ b/trunk/ccutil/globaloc.h
--- a/trunk/ccutil/hashfn.cpp
+++ b/trunk/ccutil/hashfn.cpp
--- a/trunk/ccutil/hashfn.h
+++ b/trunk/ccutil/hashfn.h
--- a/trunk/ccutil/host.h
+++ b/trunk/ccutil/host.h
--- a/trunk/ccutil/hosthplb.h
+++ b/trunk/ccutil/hosthplb.h
--- a/trunk/ccutil/lsterr.h
+++ b/trunk/ccutil/lsterr.h
--- a/trunk/ccutil/mainblk.cpp
+++ b/trunk/ccutil/mainblk.cpp
--- a/trunk/ccutil/mainblk.h
+++ b/trunk/ccutil/mainblk.h
--- a/trunk/ccutil/memblk.cpp
+++ b/trunk/ccutil/memblk.cpp
--- a/trunk/ccutil/memblk.h
+++ b/trunk/ccutil/memblk.h
--- a/trunk/ccutil/memry.cpp
+++ b/trunk/ccutil/memry.cpp
--- a/trunk/ccutil/memry.h
+++ b/trunk/ccutil/memry.h
--- a/trunk/ccutil/memryerr.h
+++ b/trunk/ccutil/memryerr.h
--- a/trunk/ccutil/mfcpch.cpp
+++ b/trunk/ccutil/mfcpch.cpp
--- a/trunk/ccutil/mfcpch.h
+++ b/trunk/ccutil/mfcpch.h
--- a/trunk/ccutil/ndminx.h
+++ b/trunk/ccutil/ndminx.h
--- a/trunk/ccutil/notdll.h
+++ b/trunk/ccutil/notdll.h
--- a/trunk/ccutil/nwmain.h
+++ b/trunk/ccutil/nwmain.h
--- a/trunk/ccutil/ocrclass.h
+++ b/trunk/ccutil/ocrclass.h
--- a/trunk/ccutil/ocrshell.cpp
+++ b/trunk/ccutil/ocrshell.cpp
--- a/trunk/ccutil/ocrshell.h
+++ b/trunk/ccutil/ocrshell.h
--- a/trunk/ccutil/platform.h
+++ b/trunk/ccutil/platform.h
--- a/trunk/ccutil/scanutils.cpp
+++ b/trunk/ccutil/scanutils.cpp
--- a/trunk/ccutil/scanutils.h
+++ b/trunk/ccutil/scanutils.h
--- a/trunk/ccutil/secname.h
+++ b/trunk/ccutil/secname.h
--- a/trunk/ccutil/serialis.cpp
+++ b/trunk/ccutil/serialis.cpp
--- a/trunk/ccutil/serialis.h
+++ b/trunk/ccutil/serialis.h
--- a/trunk/ccutil/stderr.h
+++ b/trunk/ccutil/stderr.h
--- a/trunk/ccutil/strngs.cpp
+++ b/trunk/ccutil/strngs.cpp
--- a/trunk/ccutil/strngs.h
+++ b/trunk/ccutil/strngs.h
--- a/trunk/ccutil/tessclas.h
+++ b/trunk/ccutil/tessclas.h
--- a/trunk/ccutil/tprintf.cpp
+++ b/trunk/ccutil/tprintf.cpp
--- a/trunk/ccutil/tprintf.h
+++ b/trunk/ccutil/tprintf.h
--- a/trunk/ccutil/unichar.cpp
+++ b/trunk/ccutil/unichar.cpp
--- a/trunk/ccutil/unichar.h
+++ b/trunk/ccutil/unichar.h
--- a/trunk/ccutil/varable.cpp
+++ b/trunk/ccutil/varable.cpp
--- a/trunk/ccutil/varable.h
+++ b/trunk/ccutil/varable.h
--- a/trunk/classify/Makefile.am
+++ b/trunk/classify/Makefile.am
--- a/trunk/classify/Makefile.in
+++ b/trunk/classify/Makefile.in
--- a/trunk/classify/adaptive.cpp
+++ b/trunk/classify/adaptive.cpp
--- a/trunk/classify/adaptive.h
+++ b/trunk/classify/adaptive.h
--- a/trunk/classify/adaptmatch.cpp
+++ b/trunk/classify/adaptmatch.cpp
--- a/trunk/classify/adaptmatch.h
+++ b/trunk/classify/adaptmatch.h
--- a/trunk/classify/baseline.cpp
+++ b/trunk/classify/baseline.cpp
--- a/trunk/classify/baseline.h
+++ b/trunk/classify/baseline.h
--- a/trunk/classify/blobclass.cpp
+++ b/trunk/classify/blobclass.cpp
--- a/trunk/classify/blobclass.h
+++ b/trunk/classify/blobclass.h
--- a/trunk/classify/chartoname.cpp
+++ b/trunk/classify/chartoname.cpp
--- a/trunk/classify/chartoname.h
+++ b/trunk/classify/chartoname.h
--- a/trunk/classify/cluster.cpp
+++ b/trunk/classify/cluster.cpp
--- a/trunk/classify/cluster.h
+++ b/trunk/classify/cluster.h
--- a/trunk/classify/clusttool.cpp
+++ b/trunk/classify/clusttool.cpp
--- a/trunk/classify/clusttool.h
+++ b/trunk/classify/clusttool.h
--- a/trunk/classify/cutoffs.cpp
+++ b/trunk/classify/cutoffs.cpp
--- a/trunk/classify/cutoffs.h
+++ b/trunk/classify/cutoffs.h
--- a/trunk/classify/extern.h
+++ b/trunk/classify/extern.h
--- a/trunk/classify/extract.cpp
+++ b/trunk/classify/extract.cpp
--- a/trunk/classify/extract.h
+++ b/trunk/classify/extract.h
--- a/trunk/classify/featdefs.cpp
+++ b/trunk/classify/featdefs.cpp
--- a/trunk/classify/featdefs.h
+++ b/trunk/classify/featdefs.h
--- a/trunk/classify/flexfx.cpp
+++ b/trunk/classify/flexfx.cpp
--- a/trunk/classify/flexfx.h
+++ b/trunk/classify/flexfx.h
--- a/trunk/classify/float2int.cpp
+++ b/trunk/classify/float2int.cpp
--- a/trunk/classify/float2int.h
+++ b/trunk/classify/float2int.h
--- a/trunk/classify/fpoint.cpp
+++ b/trunk/classify/fpoint.cpp
--- a/trunk/classify/fpoint.h
+++ b/trunk/classify/fpoint.h
--- a/trunk/classify/fxdefs.cpp
+++ b/trunk/classify/fxdefs.cpp
--- a/trunk/classify/fxdefs.h
+++ b/trunk/classify/fxdefs.h
--- a/trunk/classify/fxid.h
+++ b/trunk/classify/fxid.h
--- a/trunk/classify/hideedge.cpp
+++ b/trunk/classify/hideedge.cpp
--- a/trunk/classify/hideedge.h
+++ b/trunk/classify/hideedge.h
--- a/trunk/classify/intfx.cpp
+++ b/trunk/classify/intfx.cpp
--- a/trunk/classify/intfx.h
+++ b/trunk/classify/intfx.h
--- a/trunk/classify/intmatcher.cpp
+++ b/trunk/classify/intmatcher.cpp
--- a/trunk/classify/intmatcher.h
+++ b/trunk/classify/intmatcher.h
--- a/trunk/classify/intproto.cpp
+++ b/trunk/classify/intproto.cpp
--- a/trunk/classify/intproto.h
+++ b/trunk/classify/intproto.h
--- a/trunk/classify/kdtree.cpp
+++ b/trunk/classify/kdtree.cpp
--- a/trunk/classify/kdtree.h
+++ b/trunk/classify/kdtree.h
--- a/trunk/classify/mf.cpp
+++ b/trunk/classify/mf.cpp
--- a/trunk/classify/mf.h
+++ b/trunk/classify/mf.h
--- a/trunk/classify/mfdefs.cpp
+++ b/trunk/classify/mfdefs.cpp
--- a/trunk/classify/mfdefs.h
+++ b/trunk/classify/mfdefs.h
--- a/trunk/classify/mfoutline.cpp
+++ b/trunk/classify/mfoutline.cpp
--- a/trunk/classify/mfoutline.h
+++ b/trunk/classify/mfoutline.h
--- a/trunk/classify/mfx.cpp
+++ b/trunk/classify/mfx.cpp
--- a/trunk/classify/mfx.h
+++ b/trunk/classify/mfx.h
--- a/trunk/classify/normfeat.cpp
+++ b/trunk/classify/normfeat.cpp
--- a/trunk/classify/normfeat.h
+++ b/trunk/classify/normfeat.h
--- a/trunk/classify/normmatch.cpp
+++ b/trunk/classify/normmatch.cpp
--- a/trunk/classify/normmatch.h
+++ b/trunk/classify/normmatch.h
--- a/trunk/classify/ocrfeatures.cpp
+++ b/trunk/classify/ocrfeatures.cpp
--- a/trunk/classify/ocrfeatures.h
+++ b/trunk/classify/ocrfeatures.h
--- a/trunk/classify/outfeat.cpp
+++ b/trunk/classify/outfeat.cpp
--- a/trunk/classify/outfeat.h
+++ b/trunk/classify/outfeat.h
--- a/trunk/classify/picofeat.cpp
+++ b/trunk/classify/picofeat.cpp
--- a/trunk/classify/picofeat.h
+++ b/trunk/classify/picofeat.h
--- a/trunk/classify/protos.cpp
+++ b/trunk/classify/protos.cpp
--- a/trunk/classify/protos.h
+++ b/trunk/classify/protos.h
--- a/trunk/classify/sigmenu.cpp
+++ b/trunk/classify/sigmenu.cpp
--- a/trunk/classify/sigmenu.h
+++ b/trunk/classify/sigmenu.h
--- a/trunk/classify/speckle.cpp
+++ b/trunk/classify/speckle.cpp
--- a/trunk/classify/speckle.h
+++ b/trunk/classify/speckle.h
--- a/trunk/classify/xform2d.cpp
+++ b/trunk/classify/xform2d.cpp
--- a/trunk/classify/xform2d.h
+++ b/trunk/classify/xform2d.h
--- a/trunk/config/ac_compile_check_sizeof.m4
+++ b/trunk/config/ac_compile_check_sizeof.m4
--- a/trunk/config/ac_create_stdint_h.m4
+++ b/trunk/config/ac_create_stdint_h.m4
--- a/trunk/config/ac_define_versionlevel.m4
+++ b/trunk/config/ac_define_versionlevel.m4
--- a/trunk/config/acinclude_custom.m4
+++ b/trunk/config/acinclude_custom.m4
--- a/trunk/config/ax_create_stdint_h.m4
+++ b/trunk/config/ax_create_stdint_h.m4
--- a/trunk/config/config.guess
+++ b/trunk/config/config.guess
--- a/trunk/config/config.h.in
+++ b/trunk/config/config.h.in
--- a/trunk/config/config.sub
+++ b/trunk/config/config.sub
--- a/trunk/config/depcomp
+++ b/trunk/config/depcomp
--- a/trunk/config/install-sh
+++ b/trunk/config/install-sh
--- a/trunk/config/missing
+++ b/trunk/config/missing
--- a/trunk/config/mkinstalldirs
+++ b/trunk/config/mkinstalldirs
--- a/trunk/configure
+++ b/trunk/configure
--- a/trunk/configure.ac
+++ b/trunk/configure.ac
--- a/trunk/cutil/Makefile.am
+++ b/trunk/cutil/Makefile.am
--- a/trunk/cutil/Makefile.in
+++ b/trunk/cutil/Makefile.in
--- a/trunk/cutil/bitvec.cpp
+++ b/trunk/cutil/bitvec.cpp
--- a/trunk/cutil/bitvec.h
+++ b/trunk/cutil/bitvec.h
--- a/trunk/cutil/callcpp.h
+++ b/trunk/cutil/callcpp.h
--- a/trunk/cutil/const.h
+++ b/trunk/cutil/const.h
--- a/trunk/cutil/cutil.cpp
+++ b/trunk/cutil/cutil.cpp
--- a/trunk/cutil/cutil.h
+++ b/trunk/cutil/cutil.h
--- a/trunk/cutil/danerror.cpp
+++ b/trunk/cutil/danerror.cpp
--- a/trunk/cutil/danerror.h
+++ b/trunk/cutil/danerror.h
--- a/trunk/cutil/debug.cpp
+++ b/trunk/cutil/debug.cpp
--- a/trunk/cutil/debug.h
+++ b/trunk/cutil/debug.h
--- a/trunk/cutil/efio.cpp
+++ b/trunk/cutil/efio.cpp
--- a/trunk/cutil/efio.h
+++ b/trunk/cutil/efio.h
--- a/trunk/cutil/emalloc.cpp
+++ b/trunk/cutil/emalloc.cpp
--- a/trunk/cutil/emalloc.h
+++ b/trunk/cutil/emalloc.h
--- a/trunk/cutil/freelist.cpp
+++ b/trunk/cutil/freelist.cpp
--- a/trunk/cutil/freelist.h
+++ b/trunk/cutil/freelist.h
--- a/trunk/cutil/funcdefs.h
+++ b/trunk/cutil/funcdefs.h
--- a/trunk/cutil/general.h
+++ b/trunk/cutil/general.h
--- a/trunk/cutil/globals.cpp
+++ b/trunk/cutil/globals.cpp
--- a/trunk/cutil/globals.h
+++ b/trunk/cutil/globals.h
--- a/trunk/cutil/listio.cpp
+++ b/trunk/cutil/listio.cpp
--- a/trunk/cutil/listio.h
+++ b/trunk/cutil/listio.h
--- a/trunk/cutil/minmax.h
+++ b/trunk/cutil/minmax.h
--- a/trunk/cutil/oldheap.cpp
+++ b/trunk/cutil/oldheap.cpp
--- a/trunk/cutil/oldheap.h
+++ b/trunk/cutil/oldheap.h
--- a/trunk/cutil/oldlist.cpp
+++ b/trunk/cutil/oldlist.cpp
--- a/trunk/cutil/oldlist.h
+++ b/trunk/cutil/oldlist.h
--- a/trunk/cutil/structures.cpp
+++ b/trunk/cutil/structures.cpp
--- a/trunk/cutil/structures.h
+++ b/trunk/cutil/structures.h
--- a/trunk/cutil/tessarray.cpp
+++ b/trunk/cutil/tessarray.cpp
--- a/trunk/cutil/tessarray.h
+++ b/trunk/cutil/tessarray.h
--- a/trunk/cutil/tordvars.cpp
+++ b/trunk/cutil/tordvars.cpp
--- a/trunk/cutil/tordvars.h
+++ b/trunk/cutil/tordvars.h
--- a/trunk/cutil/variables.cpp
+++ b/trunk/cutil/variables.cpp
--- a/trunk/cutil/variables.h
+++ b/trunk/cutil/variables.h
--- a/trunk/dict/Makefile.am
+++ b/trunk/dict/Makefile.am
--- a/trunk/dict/Makefile.in
+++ b/trunk/dict/Makefile.in
--- a/trunk/dict/choicearr.h
+++ b/trunk/dict/choicearr.h
--- a/trunk/dict/choices.cpp
+++ b/trunk/dict/choices.cpp
--- a/trunk/dict/choices.h
+++ b/trunk/dict/choices.h
--- a/trunk/dict/context.cpp
+++ b/trunk/dict/context.cpp
--- a/trunk/dict/context.h
+++ b/trunk/dict/context.h
--- a/trunk/dict/dawg.cpp
+++ b/trunk/dict/dawg.cpp
--- a/trunk/dict/dawg.h
+++ b/trunk/dict/dawg.h
--- a/trunk/dict/hyphen.cpp
+++ b/trunk/dict/hyphen.cpp
--- a/trunk/dict/hyphen.h
+++ b/trunk/dict/hyphen.h
--- a/trunk/dict/matchdefs.h
+++ b/trunk/dict/matchdefs.h
--- a/trunk/dict/permdawg.cpp
+++ b/trunk/dict/permdawg.cpp
--- a/trunk/dict/permdawg.h
+++ b/trunk/dict/permdawg.h
--- a/trunk/dict/permnum.cpp
+++ b/trunk/dict/permnum.cpp
--- a/trunk/dict/permnum.h
+++ b/trunk/dict/permnum.h
--- a/trunk/dict/permute.cpp
+++ b/trunk/dict/permute.cpp
--- a/trunk/dict/permute.h
+++ b/trunk/dict/permute.h
--- a/trunk/dict/states.cpp
+++ b/trunk/dict/states.cpp
--- a/trunk/dict/states.h
+++ b/trunk/dict/states.h
--- a/trunk/dict/stopper.cpp
+++ b/trunk/dict/stopper.cpp
--- a/trunk/dict/stopper.h
+++ b/trunk/dict/stopper.h
--- a/trunk/dict/trie.cpp
+++ b/trunk/dict/trie.cpp
--- a/trunk/dict/trie.h
+++ b/trunk/dict/trie.h
--- a/trunk/display/Makefile.am
+++ b/trunk/display/Makefile.am
--- a/trunk/display/Makefile.in
+++ b/trunk/display/Makefile.in
--- a/trunk/display/cmndwin.cpp
+++ b/trunk/display/cmndwin.cpp
--- a/trunk/display/cmndwin.h
+++ b/trunk/display/cmndwin.h
--- a/trunk/display/pagewalk.cpp
+++ b/trunk/display/pagewalk.cpp
--- a/trunk/display/pagewalk.h
+++ b/trunk/display/pagewalk.h
--- a/trunk/display/pgedit.cpp
+++ b/trunk/display/pgedit.cpp
--- a/trunk/display/pgedit.h
+++ b/trunk/display/pgedit.h
--- a/trunk/display/pgeditx.h
+++ b/trunk/display/pgeditx.h
--- a/trunk/display/sbdmenu.cpp
+++ b/trunk/display/sbdmenu.cpp
--- a/trunk/display/sbdmenu.h
+++ b/trunk/display/sbdmenu.h
--- a/trunk/display/submen.h
+++ b/trunk/display/submen.h
--- a/trunk/display/tessio.h
+++ b/trunk/display/tessio.h
--- a/trunk/display/varabled.cpp
+++ b/trunk/display/varabled.cpp
--- a/trunk/display/varabled.h
+++ b/trunk/display/varabled.h
--- a/trunk/display/varblmen.cpp
+++ b/trunk/display/varblmen.cpp
--- a/trunk/display/varblmen.h
+++ b/trunk/display/varblmen.h
--- a/trunk/display/varblwin.cpp
+++ b/trunk/display/varblwin.cpp
--- a/trunk/display/varblwin.h
+++ b/trunk/display/varblwin.h
--- a/trunk/doc/main.txt
+++ b/trunk/doc/main.txt
--- a/trunk/image/Makefile.am
+++ b/trunk/image/Makefile.am
--- a/trunk/image/Makefile.in
+++ b/trunk/image/Makefile.in
--- a/trunk/image/bitstrm.cpp
+++ b/trunk/image/bitstrm.cpp
--- a/trunk/image/bitstrm.h
+++ b/trunk/image/bitstrm.h
--- a/trunk/image/img.h
+++ b/trunk/image/img.h
--- a/trunk/image/imgbmp.cpp
+++ b/trunk/image/imgbmp.cpp
--- a/trunk/image/imgbmp.h
+++ b/trunk/image/imgbmp.h
--- a/trunk/image/imgerrs.h
+++ b/trunk/image/imgerrs.h
--- a/trunk/image/imgio.cpp
+++ b/trunk/image/imgio.cpp
--- a/trunk/image/imgio.h
+++ b/trunk/image/imgio.h
--- a/trunk/image/imgs.cpp
+++ b/trunk/image/imgs.cpp
--- a/trunk/image/imgs.h
+++ b/trunk/image/imgs.h
--- a/trunk/image/imgtiff.cpp
+++ b/trunk/image/imgtiff.cpp
--- a/trunk/image/imgtiff.h
+++ b/trunk/image/imgtiff.h
--- a/trunk/image/imgunpk.h
+++ b/trunk/image/imgunpk.h
--- a/trunk/phototest.tif
+++ b/trunk/phototest.tif
--- a/trunk/tessdata/DangAmbigs
+++ b/trunk/tessdata/DangAmbigs
--- a/trunk/tessdata/blackText.params
+++ b/trunk/tessdata/blackText.params
--- a/trunk/tessdata/configs/api_config
+++ b/trunk/tessdata/configs/api_config
--- a/trunk/tessdata/configs/api_resaljet
+++ b/trunk/tessdata/configs/api_resaljet
--- a/trunk/tessdata/configs/box.train
+++ b/trunk/tessdata/configs/box.train
--- a/trunk/tessdata/configs/inter
+++ b/trunk/tessdata/configs/inter
--- a/trunk/tessdata/configs/oldapi_config
+++ b/trunk/tessdata/configs/oldapi_config
--- a/trunk/tessdata/configs/oldbox.train
+++ b/trunk/tessdata/configs/oldbox.train
--- a/trunk/tessdata/configs/var_api_config
+++ b/trunk/tessdata/configs/var_api_config
--- a/trunk/tessdata/configs/var_box.train
+++ b/trunk/tessdata/configs/var_box.train
--- a/trunk/tessdata/configs/variable_config
+++ b/trunk/tessdata/configs/variable_config
--- a/trunk/tessdata/confsets
+++ b/trunk/tessdata/confsets
--- a/trunk/tessdata/fmtable.cls
+++ b/trunk/tessdata/fmtable.cls
--- a/trunk/tessdata/fnetwts
+++ b/trunk/tessdata/fnetwts
--- a/trunk/tessdata/freq-dawg
+++ b/trunk/tessdata/freq-dawg
--- a/trunk/tessdata/inttemp
+++ b/trunk/tessdata/inttemp
--- a/trunk/tessdata/netwts
+++ b/trunk/tessdata/netwts
--- a/trunk/tessdata/newdiff.asccodes
+++ b/trunk/tessdata/newdiff.asccodes
--- a/trunk/tessdata/normproto
+++ b/trunk/tessdata/normproto
--- a/trunk/tessdata/pffmtable
+++ b/trunk/tessdata/pffmtable
--- a/trunk/tessdata/soptable.cls
+++ b/trunk/tessdata/soptable.cls
--- a/trunk/tessdata/tessconfigs/batch
+++ b/trunk/tessdata/tessconfigs/batch
--- a/trunk/tessdata/tessconfigs/matdemo
+++ b/trunk/tessdata/tessconfigs/matdemo
--- a/trunk/tessdata/tessconfigs/old_batch
+++ b/trunk/tessdata/tessconfigs/old_batch
--- a/trunk/tessdata/tessconfigs/segdemo
+++ b/trunk/tessdata/tessconfigs/segdemo
--- a/trunk/tessdata/tessconfigs/var_batch
+++ b/trunk/tessdata/tessconfigs/var_batch
--- a/trunk/tessdata/test_matrix
+++ b/trunk/tessdata/test_matrix
--- a/trunk/tessdata/user-words
+++ b/trunk/tessdata/user-words
--- a/trunk/tessdata/word-dawg
+++ b/trunk/tessdata/word-dawg
--- a/trunk/tesseract.dsp
+++ b/trunk/tesseract.dsp
--- a/trunk/tesseract.dsw
+++ b/trunk/tesseract.dsw
--- a/trunk/textord/Makefile.am
+++ b/trunk/textord/Makefile.am
--- a/trunk/textord/Makefile.in
+++ b/trunk/textord/Makefile.in
--- a/trunk/textord/blkocc.cpp
+++ b/trunk/textord/blkocc.cpp
--- a/trunk/textord/blkocc.h
+++ b/trunk/textord/blkocc.h
--- a/trunk/textord/blobcmpl.h
+++ b/trunk/textord/blobcmpl.h
--- a/trunk/textord/drawedg.cpp
+++ b/trunk/textord/drawedg.cpp
--- a/trunk/textord/drawedg.h
+++ b/trunk/textord/drawedg.h
--- a/trunk/textord/drawtord.cpp
+++ b/trunk/textord/drawtord.cpp
--- a/trunk/textord/drawtord.h
+++ b/trunk/textord/drawtord.h
--- a/trunk/textord/edgblob.cpp
+++ b/trunk/textord/edgblob.cpp
--- a/trunk/textord/edgblob.h
+++ b/trunk/textord/edgblob.h
--- a/trunk/textord/edgloop.cpp
+++ b/trunk/textord/edgloop.cpp
--- a/trunk/textord/edgloop.h
+++ b/trunk/textord/edgloop.h
--- a/trunk/textord/fpchop.cpp
+++ b/trunk/textord/fpchop.cpp
--- a/trunk/textord/fpchop.h
+++ b/trunk/textord/fpchop.h
--- a/trunk/textord/gap_map.cpp
+++ b/trunk/textord/gap_map.cpp
--- a/trunk/textord/gap_map.h
+++ b/trunk/textord/gap_map.h
--- a/trunk/textord/makerow.cpp
+++ b/trunk/textord/makerow.cpp
--- a/trunk/textord/makerow.h
+++ b/trunk/textord/makerow.h
--- a/trunk/textord/oldbasel.cpp
+++ b/trunk/textord/oldbasel.cpp
--- a/trunk/textord/oldbasel.h
+++ b/trunk/textord/oldbasel.h
--- a/trunk/textord/pithsync.cpp
+++ b/trunk/textord/pithsync.cpp
--- a/trunk/textord/pithsync.h
+++ b/trunk/textord/pithsync.h
--- a/trunk/textord/pitsync1.cpp
+++ b/trunk/textord/pitsync1.cpp
--- a/trunk/textord/pitsync1.h
+++ b/trunk/textord/pitsync1.h
--- a/trunk/textord/scanedg.cpp
+++ b/trunk/textord/scanedg.cpp
--- a/trunk/textord/scanedg.h
+++ b/trunk/textord/scanedg.h
--- a/trunk/textord/sortflts.cpp
+++ b/trunk/textord/sortflts.cpp
--- a/trunk/textord/sortflts.h
+++ b/trunk/textord/sortflts.h
--- a/trunk/textord/tessout.h
+++ b/trunk/textord/tessout.h
--- a/trunk/textord/topitch.cpp
+++ b/trunk/textord/topitch.cpp
--- a/trunk/textord/topitch.h
+++ b/trunk/textord/topitch.h
--- a/trunk/textord/tordmain.cpp
+++ b/trunk/textord/tordmain.cpp
--- a/trunk/textord/tordmain.h
+++ b/trunk/textord/tordmain.h
--- a/trunk/textord/tospace.cpp
+++ b/trunk/textord/tospace.cpp
--- a/trunk/textord/tospace.h
+++ b/trunk/textord/tospace.h
--- a/trunk/textord/tovars.cpp
+++ b/trunk/textord/tovars.cpp
--- a/trunk/textord/tovars.h
+++ b/trunk/textord/tovars.h
--- a/trunk/textord/underlin.cpp
+++ b/trunk/textord/underlin.cpp
--- a/trunk/textord/underlin.h
+++ b/trunk/textord/underlin.h
--- a/trunk/textord/wordseg.cpp
+++ b/trunk/textord/wordseg.cpp
--- a/trunk/textord/wordseg.h
+++ b/trunk/textord/wordseg.h
--- a/trunk/training/Makefile.am
+++ b/trunk/training/Makefile.am
--- a/trunk/training/Makefile.in
+++ b/trunk/training/Makefile.in
--- a/trunk/training/cnTraining.cpp
+++ b/trunk/training/cnTraining.cpp
--- a/trunk/training/cnTraining.dsp
+++ b/trunk/training/cnTraining.dsp
--- a/trunk/training/cnTraining1.dsp
+++ b/trunk/training/cnTraining1.dsp
--- a/trunk/training/mergenf.cpp
+++ b/trunk/training/mergenf.cpp
--- a/trunk/training/mergenf.h
+++ b/trunk/training/mergenf.h
--- a/trunk/training/mfTraining.cpp
+++ b/trunk/training/mfTraining.cpp
--- a/trunk/training/mfTraining.dsp
+++ b/trunk/training/mfTraining.dsp
--- a/trunk/training/name2char.cpp
+++ b/trunk/training/name2char.cpp
--- a/trunk/training/name2char.h
+++ b/trunk/training/name2char.h
--- a/trunk/training/training.cpp
+++ b/trunk/training/training.cpp
--- a/trunk/training/training.h
+++ b/trunk/training/training.h
--- a/trunk/viewer/Makefile.am
+++ b/trunk/viewer/Makefile.am
--- a/trunk/viewer/Makefile.in
+++ b/trunk/viewer/Makefile.in
--- a/trunk/viewer/evntlst.cpp
+++ b/trunk/viewer/evntlst.cpp
--- a/trunk/viewer/evntlst.h
+++ b/trunk/viewer/evntlst.h
--- a/trunk/viewer/evnts.cpp
+++ b/trunk/viewer/evnts.cpp
--- a/trunk/viewer/evnts.h
+++ b/trunk/viewer/evnts.h
--- a/trunk/viewer/grphics.cpp
+++ b/trunk/viewer/grphics.cpp
--- a/trunk/viewer/grphics.h
+++ b/trunk/viewer/grphics.h
--- a/trunk/viewer/grphshm.cpp
+++ b/trunk/viewer/grphshm.cpp
--- a/trunk/viewer/grphshm.h
+++ b/trunk/viewer/grphshm.h
--- a/trunk/viewer/sbgconst.h
+++ b/trunk/viewer/sbgconst.h
--- a/trunk/viewer/sbgdefs.h
+++ b/trunk/viewer/sbgdefs.h
--- a/trunk/viewer/sbgtypes.h
+++ b/trunk/viewer/sbgtypes.h
--- a/trunk/viewer/showim.cpp
+++ b/trunk/viewer/showim.cpp
--- a/trunk/viewer/showim.h
+++ b/trunk/viewer/showim.h
--- a/trunk/wordrec/Makefile.am
+++ b/trunk/wordrec/Makefile.am
--- a/trunk/wordrec/Makefile.in
+++ b/trunk/wordrec/Makefile.in
--- a/trunk/wordrec/associate.cpp
+++ b/trunk/wordrec/associate.cpp
--- a/trunk/wordrec/associate.h
+++ b/trunk/wordrec/associate.h
--- a/trunk/wordrec/badwords.cpp
+++ b/trunk/wordrec/badwords.cpp
--- a/trunk/wordrec/badwords.h
+++ b/trunk/wordrec/badwords.h
--- a/trunk/wordrec/bestfirst.cpp
+++ b/trunk/wordrec/bestfirst.cpp
--- a/trunk/wordrec/bestfirst.h
+++ b/trunk/wordrec/bestfirst.h
--- a/trunk/wordrec/charsample.h
+++ b/trunk/wordrec/charsample.h
--- a/trunk/wordrec/chop.cpp
+++ b/trunk/wordrec/chop.cpp
--- a/trunk/wordrec/chop.h
+++ b/trunk/wordrec/chop.h
--- a/trunk/wordrec/chopper.cpp
+++ b/trunk/wordrec/chopper.cpp
--- a/trunk/wordrec/chopper.h
+++ b/trunk/wordrec/chopper.h
--- a/trunk/wordrec/closed.cpp
+++ b/trunk/wordrec/closed.cpp
--- a/trunk/wordrec/closed.h
+++ b/trunk/wordrec/closed.h
--- a/trunk/wordrec/djmenus.cpp
+++ b/trunk/wordrec/djmenus.cpp
--- a/trunk/wordrec/djmenus.h
+++ b/trunk/wordrec/djmenus.h
--- a/trunk/wordrec/drawfx.cpp
+++ b/trunk/wordrec/drawfx.cpp
--- a/trunk/wordrec/drawfx.h
+++ b/trunk/wordrec/drawfx.h
--- a/trunk/wordrec/findseam.cpp
+++ b/trunk/wordrec/findseam.cpp
--- a/trunk/wordrec/findseam.h
+++ b/trunk/wordrec/findseam.h
--- a/trunk/wordrec/gradechop.cpp
+++ b/trunk/wordrec/gradechop.cpp
--- a/trunk/wordrec/gradechop.h
+++ b/trunk/wordrec/gradechop.h
--- a/trunk/wordrec/heuristic.cpp
+++ b/trunk/wordrec/heuristic.cpp
--- a/trunk/wordrec/heuristic.h
+++ b/trunk/wordrec/heuristic.h
--- a/trunk/wordrec/makechop.cpp
+++ b/trunk/wordrec/makechop.cpp
--- a/trunk/wordrec/makechop.h
+++ b/trunk/wordrec/makechop.h
--- a/trunk/wordrec/matchtab.cpp
+++ b/trunk/wordrec/matchtab.cpp
--- a/trunk/wordrec/matchtab.h
+++ b/trunk/wordrec/matchtab.h
--- a/trunk/wordrec/matrix.cpp
+++ b/trunk/wordrec/matrix.cpp
--- a/trunk/wordrec/matrix.h
+++ b/trunk/wordrec/matrix.h
--- a/trunk/wordrec/measure.h
+++ b/trunk/wordrec/measure.h
--- a/trunk/wordrec/metrics.cpp
+++ b/trunk/wordrec/metrics.cpp
--- a/trunk/wordrec/metrics.h
+++ b/trunk/wordrec/metrics.h
--- a/trunk/wordrec/mfvars.cpp
+++ b/trunk/wordrec/mfvars.cpp
--- a/trunk/wordrec/mfvars.h
+++ b/trunk/wordrec/mfvars.h
--- a/trunk/wordrec/msmenus.cpp
+++ b/trunk/wordrec/msmenus.cpp
--- a/trunk/wordrec/msmenus.h
+++ b/trunk/wordrec/msmenus.h
--- a/trunk/wordrec/olutil.cpp
+++ b/trunk/wordrec/olutil.cpp
--- a/trunk/wordrec/olutil.h
+++ b/trunk/wordrec/olutil.h
--- a/trunk/wordrec/outlines.cpp
+++ b/trunk/wordrec/outlines.cpp
--- a/trunk/wordrec/outlines.h
+++ b/trunk/wordrec/outlines.h
--- a/trunk/wordrec/pieces.cpp
+++ b/trunk/wordrec/pieces.cpp
--- a/trunk/wordrec/pieces.h
+++ b/trunk/wordrec/pieces.h
--- a/trunk/wordrec/plotedges.cpp
+++ b/trunk/wordrec/plotedges.cpp
--- a/trunk/wordrec/plotedges.h
+++ b/trunk/wordrec/plotedges.h
--- a/trunk/wordrec/plotseg.cpp
+++ b/trunk/wordrec/plotseg.cpp
--- a/trunk/wordrec/plotseg.h
+++ b/trunk/wordrec/plotseg.h
--- a/trunk/wordrec/render.cpp
+++ b/trunk/wordrec/render.cpp
--- a/trunk/wordrec/render.h
+++ b/trunk/wordrec/render.h
--- a/trunk/wordrec/seam.cpp
+++ b/trunk/wordrec/seam.cpp
--- a/trunk/wordrec/seam.h
+++ b/trunk/wordrec/seam.h
--- a/trunk/wordrec/split.cpp
+++ b/trunk/wordrec/split.cpp
--- a/trunk/wordrec/split.h
+++ b/trunk/wordrec/split.h
--- a/trunk/wordrec/tally.cpp
+++ b/trunk/wordrec/tally.cpp
--- a/trunk/wordrec/tally.h
+++ b/trunk/wordrec/tally.h
--- a/trunk/wordrec/tessinit.cpp
+++ b/trunk/wordrec/tessinit.cpp
--- a/trunk/wordrec/tessinit.h
+++ b/trunk/wordrec/tessinit.h
--- a/trunk/wordrec/tface.cpp
+++ b/trunk/wordrec/tface.cpp
--- a/trunk/wordrec/tface.h
+++ b/trunk/wordrec/tface.h
--- a/trunk/wordrec/wordclass.cpp
+++ b/trunk/wordrec/wordclass.cpp
--- a/trunk/wordrec/wordclass.h
+++ b/trunk/wordrec/wordclass.h