Hello,
I installed FuzzyOcr? 3.5.1 on a RHEL5 server and it works on plain email files but not on sendmail's queue files. Problem is I call it through SA that is called by MailScanner? and it looks like it is receiving the email in the same format because it doesn't detect any image most of the time.
Here is the output of my debug run. Should I be concerned by the first 3 warnings/errors?
spamassassin --debug FuzzyOcr < dfl3QDoDBn018376
Subroutine FuzzyOcr::O_CREAT redefined at /usr/lib/perl5/5.8.8/Exporter.pm line 65.
at /usr/lib/perl5/5.8.8/i386-linux-thread-multi/POSIX.pm line 19
Subroutine FuzzyOcr::O_EXCL redefined at /usr/lib/perl5/5.8.8/Exporter.pm line 65.
at /usr/lib/perl5/5.8.8/i386-linux-thread-multi/POSIX.pm line 19
Subroutine FuzzyOcr::O_RDWR redefined at /usr/lib/perl5/5.8.8/Exporter.pm line 65.
at /usr/lib/perl5/5.8.8/i386-linux-thread-multi/POSIX.pm line 19
[18573] dbg: FuzzyOcr: focr_bin_helper: 'pnmnorm,pnminvert,pamthreshold,ppmtopgm,pamtopnm'
[18573] info: FuzzyOcr: Adding <5> new helper apps
[18573] info: FuzzyOcr: Starting preprocessor parser for file "/etc/mail/spamassassin/FuzzyOcr.preps"...
[18573] dbg: FuzzyOcr: line: preprocessor normalize {
[18573] dbg: FuzzyOcr: line: command = pnmnorm
[18573] dbg: FuzzyOcr: line: }
[18573] dbg: FuzzyOcr: line: preprocessor invert {
[18573] dbg: FuzzyOcr: line: command = pnminvert
[18573] dbg: FuzzyOcr: line: }
[18573] dbg: FuzzyOcr: line: preprocessor ppmtopgm {
[18573] dbg: FuzzyOcr: line: command = ppmtopgm
[18573] dbg: FuzzyOcr: line: }
[18573] dbg: FuzzyOcr: line: preprocessor pamtopnm {
[18573] dbg: FuzzyOcr: line: command = pamtopnm
[18573] dbg: FuzzyOcr: line: }
[18573] dbg: FuzzyOcr: line: preprocessor pamthreshold {
[18573] dbg: FuzzyOcr: line: command = pamthreshold
[18573] dbg: FuzzyOcr: line: args = -simple -threshold 0.5
[18573] dbg: FuzzyOcr: line: }
[18573] dbg: FuzzyOcr: line: preprocessor maketiff {
[18573] dbg: FuzzyOcr: line: command = pnmtotiff
[18573] dbg: FuzzyOcr: line: args = -color -truecolor
[18573] dbg: FuzzyOcr: line: }
[18573] info: FuzzyOcr: Starting scanset parser for file "/etc/mail/spamassassin/FuzzyOcr.scansets"...
[18573] dbg: FuzzyOcr: line scanset ocrad {
[18573] dbg: FuzzyOcr: line command = $ocrad
[18573] dbg: FuzzyOcr: line args = -s5 $input
[18573] dbg: FuzzyOcr: line }
[18573] dbg: FuzzyOcr: line scanset ocrad-invert {
[18573] dbg: FuzzyOcr: line command = $ocrad
[18573] dbg: FuzzyOcr: line args = -s5 -i $input
[18573] dbg: FuzzyOcr: line }
[18573] dbg: FuzzyOcr: line scanset ocrad-decolorize-invert {
[18573] dbg: FuzzyOcr: line preprocessors = ppmtopgm, pamthreshold, pamtopnm
[18573] dbg: FuzzyOcr: line command = $ocrad
[18573] dbg: FuzzyOcr: line args = -s5 -i $input
[18573] dbg: FuzzyOcr: line }
[18573] dbg: FuzzyOcr: line scanset ocrad-decolorize {
[18573] dbg: FuzzyOcr: line preprocessors = ppmtopgm, pamthreshold, pamtopnm
[18573] dbg: FuzzyOcr: line command = $ocrad
[18573] dbg: FuzzyOcr: line args = -s5 $input
[18573] dbg: FuzzyOcr: line }
[18573] dbg: FuzzyOcr: line scanset gocr {
[18573] dbg: FuzzyOcr: line command = $gocr
[18573] dbg: FuzzyOcr: line args = -i $input
[18573] dbg: FuzzyOcr: line }
[18573] dbg: FuzzyOcr: line scanset gocr-180 {
[18573] dbg: FuzzyOcr: line command = $gocr
[18573] dbg: FuzzyOcr: line args = -l 180 -d 2 -i $input
[18573] dbg: FuzzyOcr: line }
[18573] info: FuzzyOcr: Searching in: /usr/local/bin
[18573] info: FuzzyOcr: Searching in: /usr/bin
[18573] info: FuzzyOcr: Using gifsicle => /usr/bin/gifsicle
[18573] info: FuzzyOcr: Using giffix => /usr/bin/giffix
[18573] info: FuzzyOcr: Using giftext => /usr/bin/giftext
[18573] info: FuzzyOcr: Using gifinter => /usr/bin/gifinter
[18573] info: FuzzyOcr: Using giftopnm => /usr/bin/giftopnm
[18573] info: FuzzyOcr: Using jpegtopnm => /usr/bin/jpegtopnm
[18573] info: FuzzyOcr: Using pngtopnm => /usr/bin/pngtopnm
[18573] info: FuzzyOcr: Using bmptopnm => /usr/bin/bmptopnm
[18573] info: FuzzyOcr: Using tifftopnm => /usr/bin/tifftopnm
[18573] info: FuzzyOcr: Using ppmhist => /usr/bin/ppmhist
[18573] info: FuzzyOcr: Using pamfile => /usr/bin/pamfile
[18573] info: FuzzyOcr: Using ocrad => /usr/local/bin/ocrad
[18573] info: FuzzyOcr: Using gocr => /usr/bin/gocr
[18573] info: FuzzyOcr: Using pnmnorm => /usr/bin/pnmnorm
[18573] info: FuzzyOcr: Using pnminvert => /usr/bin/pnminvert
[18573] info: FuzzyOcr: Using pamthreshold => /usr/bin/pamthreshold
[18573] info: FuzzyOcr: Using ppmtopgm => /usr/bin/ppmtopgm
[18573] info: FuzzyOcr: Using pamtopnm => /usr/bin/pamtopnm
[18573] dbg: FuzzyOcr: Threshold[max_hash] => 5
[18573] dbg: FuzzyOcr: Threshold[c] => 5
[18573] dbg: FuzzyOcr: Threshold[s] => 0.01
[18573] dbg: FuzzyOcr: Threshold[w] => 0.01
[18573] dbg: FuzzyOcr: Threshold[h] => 0.01
[18573] dbg: FuzzyOcr: Threshold[cn] => 0.01
[18573] dbg: FuzzyOcr: focr_add_score => 1
[18573] dbg: FuzzyOcr: focr_autodisable_negative_score => -10
[18573] dbg: FuzzyOcr: focr_autodisable_score => 1000
[18573] dbg: FuzzyOcr: focr_autosort_buffer => 10
[18573] dbg: FuzzyOcr: focr_autosort_scanset => 1
[18573] dbg: FuzzyOcr: focr_base_score => 5
[18573] dbg: FuzzyOcr: focr_corrupt_score => 2.5
[18573] dbg: FuzzyOcr: focr_corrupt_unfixable_score => 0.5
[18573] dbg: FuzzyOcr: focr_counts_required => 2
[18573] dbg: FuzzyOcr: focr_db_hash => /etc/mail/spamassassin/FuzzyOcr.db
[18573] dbg: FuzzyOcr: focr_db_max_days => 21
[18573] dbg: FuzzyOcr: focr_db_safe => /etc/mail/spamassassin/FuzzyOcr.safe.db
[18573] dbg: FuzzyOcr: focr_digest_db => /etc/mail/spamassassin/FuzzyOcr.hashdb
[18573] dbg: FuzzyOcr: focr_enable_image_hashing => 0
[18573] dbg: FuzzyOcr: focr_global_timeout => 0
[18573] dbg: FuzzyOcr: focr_global_wordlist => /etc/mail/spamassassin/FuzzyOcr.words
[18573] dbg: FuzzyOcr: focr_hashing_learn_scanned => 1
[18573] dbg: FuzzyOcr: focr_keep_bad_images => 1
[18573] dbg: FuzzyOcr: focr_log_pmsinfo => 1
[18573] dbg: FuzzyOcr: focr_log_stderr => 1
[18573] dbg: FuzzyOcr: focr_logfile => /var/log/FuzzyOcr.log
[18573] dbg: FuzzyOcr: focr_max_height => 800
[18573] dbg: FuzzyOcr: focr_max_width => 800
[18573] dbg: FuzzyOcr: focr_min_height => 4
[18573] dbg: FuzzyOcr: focr_min_width => 4
[18573] dbg: FuzzyOcr: focr_minimal_scanset => 0
[18573] dbg: FuzzyOcr: focr_mysql_db => FuzzyOcr
[18573] dbg: FuzzyOcr: focr_mysql_hash => Hash
[18573] dbg: FuzzyOcr: focr_mysql_host => localhost
[18573] dbg: FuzzyOcr: focr_mysql_port => 3306
[18573] dbg: FuzzyOcr: focr_mysql_safe => Safe
[18573] dbg: FuzzyOcr: focr_mysql_update_hash => 0
[18573] dbg: FuzzyOcr: focr_mysql_user => fuzzyocr
[18573] dbg: FuzzyOcr: focr_no_homedirs => 1
[18573] dbg: FuzzyOcr: focr_path_bin => /usr/local/bin:/usr/bin
[18573] dbg: FuzzyOcr: focr_personal_wordlist => __userstate__/FuzzyOcr.words
[18573] dbg: FuzzyOcr: focr_preprocessor_file => /etc/mail/spamassassin/FuzzyOcr.preps
[18573] dbg: FuzzyOcr: focr_scanset_file => /etc/mail/spamassassin/FuzzyOcr.scansets
[18573] dbg: FuzzyOcr: focr_score_ham => 0
[18573] dbg: FuzzyOcr: focr_skip_bmp => 0
[18573] dbg: FuzzyOcr: focr_skip_gif => 0
[18573] dbg: FuzzyOcr: focr_skip_jpeg => 0
[18573] dbg: FuzzyOcr: focr_skip_png => 0
[18573] dbg: FuzzyOcr: focr_skip_tiff => 0
[18573] dbg: FuzzyOcr: focr_skip_updates => 0
[18573] dbg: FuzzyOcr: focr_strip_numbers => 1
[18573] dbg: FuzzyOcr: focr_threshold => 0.25
[18573] dbg: FuzzyOcr: focr_timeout => 15
[18573] dbg: FuzzyOcr: focr_twopass_scoring_factor => 1.5
[18573] dbg: FuzzyOcr: focr_unique_matches => 0
[18573] dbg: FuzzyOcr: focr_verbose => 2
[18573] dbg: FuzzyOcr: focr_wrongctype_score => 1.5
[18573] dbg: FuzzyOcr: focr_wrongext_score => 1.5
[18573] info: FuzzyOcr: Loaded preprocessor normalize: /usr/bin/pnmnorm
[18573] info: FuzzyOcr: Loaded preprocessor invert: /usr/bin/pnminvert
[18573] info: FuzzyOcr: Loaded preprocessor ppmtopgm: /usr/bin/ppmtopgm
[18573] info: FuzzyOcr: Loaded preprocessor pamtopnm: /usr/bin/pamtopnm
[18573] info: FuzzyOcr: Loaded preprocessor pamthreshold: /usr/bin/pamthreshold -simple -threshold 0.5
[18573] info: FuzzyOcr: Loaded preprocessor maketiff: pnmtotiff -color -truecolor
[18573] info: FuzzyOcr: Using scan ocrad: /usr/local/bin/ocrad -s5 $input
[18573] info: FuzzyOcr: Using scan ocrad-invert: /usr/local/bin/ocrad -s5 -i $input
[18573] info: FuzzyOcr: Using scan ocrad-decolorize-invert: /usr/local/bin/ocrad -s5 -i $input
[18573] info: FuzzyOcr: Using scan ocrad-decolorize: /usr/local/bin/ocrad -s5 $input
[18573] info: FuzzyOcr: Using scan gocr: /usr/bin/gocr -i $input
[18573] info: FuzzyOcr: Using scan gocr-180: /usr/bin/gocr -l 180 -d 2 -i $input
[18573] info: FuzzyOcr: Added <43> words from "/etc/mail/spamassassin/FuzzyOcr.words"
[18573] info: rules: meta test SARE_SPEC_PROLEO_M2a has dependency 'MIME_QP_LONG_LINE' with a zero score
[18573] info: rules: meta test SARE_HEAD_SUBJ_RAND has undefined dependency 'SARE_XMAIL_SUSP2'
[18573] info: rules: meta test SARE_HEAD_SUBJ_RAND has undefined dependency 'SARE_HEAD_XAUTH_WARN'
[18573] info: rules: meta test SARE_RD_SAFE has undefined dependency 'SARE_RD_SAFE_MKSHRT'
[18573] info: rules: meta test SARE_RD_SAFE has undefined dependency 'SARE_RD_SAFE_GT'
[18573] info: rules: meta test SARE_RD_SAFE has undefined dependency 'SARE_RD_SAFE_TINY'
[18573] info: rules: meta test ACKME_OBFURL1 has undefined dependency 'ACKME_OBFURL1c'
[18573] info: rules: meta test SARE_OBFU_CIALIS has undefined dependency 'SARE_OBFU_CIALIS2'
[18573] dbg: FuzzyOcr: Starting FuzzyOcr...
[18573] info: FuzzyOcr: Processing Message with ID "<no messageid>" (<no sender> -> <no receipients>)
[18573] dbg: FuzzyOcr: Skipping OCR, no image files found...
[18573] dbg: FuzzyOcr: Processed in 0.000339 sec.
Thanks!