Some hints to building and running on Cygwin. Make sure you have installed the gcc compiler packages. They are not installed by default. Install the libtiff* and libtiff-devel packages. Not sure this is necessary but it can't hurt. Install the netpbm and tiff packages if you'll be converting to tiff from other formats. ./configure --with-libtiff=/usr/lib make # OK, let's assume 'make' worked OK. It did for me. If it didn't for you, # chances are you're still missing some necessary parts of Cygwin. Do a full # install this time... come back in a few days and try again. I'll wait. # # This next step is necessary because strace shows it looking for './tesseract' # in order to find output path. the error message isn't informative: # # $ tesseract phototest.tif phototest # main:Error:Can't open file:phototest to get path # 6 [sig] tesseract 2228 C:\cygwin\usr\src\tesseract-1.03\tesseract.exe: *** # fatal error - called with threadlist_ix -1 # Hangup # # $ mv -f tesseract.exe tesseract # # $ tesseract phototest.tif phototest # Tesseract Open Source OCR Engine # Image has 1 bit per pixel and size (640,480) # Resolution=200 mv -f ccmain/tesseract.exe tesseract # $ cat phototest.txt # This is a lot of 12 point text to test the # ocr code and see if it works on all types # of file format. # The quick brown dog jumped over the # lazy fox. The quick brown dog jumped # over the lazy fox. The quick brown dog # Jumped over the lazy fox. The quick # brown dog Jumped over the lazy fox. OK, now for a real test: # $ jpegtopnm.exe 1975_articles_of_incorporation_p1.jpg > ../test/aoi1.ppm # $ ppm2tiff -c none aoi1.ppm aoi1.tif # no compression on output! # $ tesseract aoi1.tif aoi1 # Tesseract Open Source OCR Engine # Image has 24 bits per pixel and size (912,1481) # # $ head aoi1.txt # I Filed in Office of State Corporation # Commission of New Mexico # August 25, l975 # ARTICLES of AMENDMENT TO THE ARTICLES OF HJCORPORATION OF # Tl-[E CITY OF THE SUN FOUNDATION # We, the undersigned, do hereby certify that these Amended Articles of Incorporat # ion are # to supencede any previous issues. # ARTICLE I - NAME # The name of this corporation is: CITY OF THE SUN FOUNDATION # ARTICLE II - PURFOSE Awesome. Imperfect, but awesome. Thank you guys! -- jc at unternet dot net