Ticket #387 (new defect)

Opened 1 year ago

Last modified 5 months ago

FuzzyOcr doesn't see the embedded image

Reported by: UdeS Assigned to: decoder
Priority: major Milestone:
Component: Don't know Version: 3.5.1
Keywords: Cc:

Description

Hello,

I installed FuzzyOcr? 3.5.1 on a RHEL5 server and it works on plain email files but not on sendmail's queue files. Problem is I call it through SA that is called by MailScanner? and it looks like it is receiving the email in the same format because it doesn't detect any image most of the time.

Here is the output of my debug run. Should I be concerned by the first 3 warnings/errors?

spamassassin --debug FuzzyOcr < dfl3QDoDBn018376 
Subroutine FuzzyOcr::O_CREAT redefined at /usr/lib/perl5/5.8.8/Exporter.pm line 65.
 at /usr/lib/perl5/5.8.8/i386-linux-thread-multi/POSIX.pm line 19
Subroutine FuzzyOcr::O_EXCL redefined at /usr/lib/perl5/5.8.8/Exporter.pm line 65.
 at /usr/lib/perl5/5.8.8/i386-linux-thread-multi/POSIX.pm line 19
Subroutine FuzzyOcr::O_RDWR redefined at /usr/lib/perl5/5.8.8/Exporter.pm line 65.
 at /usr/lib/perl5/5.8.8/i386-linux-thread-multi/POSIX.pm line 19
[18573] dbg: FuzzyOcr: focr_bin_helper: 'pnmnorm,pnminvert,pamthreshold,ppmtopgm,pamtopnm'
[18573] info: FuzzyOcr: Adding <5> new helper apps
[18573] info: FuzzyOcr: Starting preprocessor parser for file "/etc/mail/spamassassin/FuzzyOcr.preps"...
[18573] dbg: FuzzyOcr: line: preprocessor normalize {
[18573] dbg: FuzzyOcr: line: command = pnmnorm
[18573] dbg: FuzzyOcr: line: }
[18573] dbg: FuzzyOcr: line: preprocessor invert {
[18573] dbg: FuzzyOcr: line: command = pnminvert
[18573] dbg: FuzzyOcr: line: }
[18573] dbg: FuzzyOcr: line: preprocessor ppmtopgm {
[18573] dbg: FuzzyOcr: line: command = ppmtopgm
[18573] dbg: FuzzyOcr: line: }
[18573] dbg: FuzzyOcr: line: preprocessor pamtopnm {
[18573] dbg: FuzzyOcr: line: command = pamtopnm
[18573] dbg: FuzzyOcr: line: }
[18573] dbg: FuzzyOcr: line: preprocessor pamthreshold {
[18573] dbg: FuzzyOcr: line: command = pamthreshold
[18573] dbg: FuzzyOcr: line: args = -simple -threshold 0.5
[18573] dbg: FuzzyOcr: line: }
[18573] dbg: FuzzyOcr: line: preprocessor maketiff {
[18573] dbg: FuzzyOcr: line: command = pnmtotiff
[18573] dbg: FuzzyOcr: line: args = -color -truecolor
[18573] dbg: FuzzyOcr: line: }
[18573] info: FuzzyOcr: Starting scanset parser for file "/etc/mail/spamassassin/FuzzyOcr.scansets"...
[18573] dbg: FuzzyOcr: line scanset ocrad {
[18573] dbg: FuzzyOcr: line command = $ocrad
[18573] dbg: FuzzyOcr: line args = -s5 $input
[18573] dbg: FuzzyOcr: line }
[18573] dbg: FuzzyOcr: line scanset ocrad-invert {
[18573] dbg: FuzzyOcr: line command = $ocrad
[18573] dbg: FuzzyOcr: line args = -s5 -i $input
[18573] dbg: FuzzyOcr: line }
[18573] dbg: FuzzyOcr: line scanset ocrad-decolorize-invert {
[18573] dbg: FuzzyOcr: line preprocessors = ppmtopgm, pamthreshold, pamtopnm
[18573] dbg: FuzzyOcr: line command = $ocrad
[18573] dbg: FuzzyOcr: line args = -s5 -i $input
[18573] dbg: FuzzyOcr: line }
[18573] dbg: FuzzyOcr: line scanset ocrad-decolorize {
[18573] dbg: FuzzyOcr: line preprocessors = ppmtopgm, pamthreshold, pamtopnm
[18573] dbg: FuzzyOcr: line command = $ocrad
[18573] dbg: FuzzyOcr: line args = -s5 $input
[18573] dbg: FuzzyOcr: line }
[18573] dbg: FuzzyOcr: line scanset gocr {
[18573] dbg: FuzzyOcr: line command = $gocr
[18573] dbg: FuzzyOcr: line args = -i $input
[18573] dbg: FuzzyOcr: line }
[18573] dbg: FuzzyOcr: line scanset gocr-180 {
[18573] dbg: FuzzyOcr: line command = $gocr
[18573] dbg: FuzzyOcr: line args = -l 180 -d 2 -i $input
[18573] dbg: FuzzyOcr: line }
[18573] info: FuzzyOcr: Searching in: /usr/local/bin
[18573] info: FuzzyOcr: Searching in: /usr/bin
[18573] info: FuzzyOcr: Using gifsicle => /usr/bin/gifsicle
[18573] info: FuzzyOcr: Using giffix => /usr/bin/giffix
[18573] info: FuzzyOcr: Using giftext => /usr/bin/giftext
[18573] info: FuzzyOcr: Using gifinter => /usr/bin/gifinter
[18573] info: FuzzyOcr: Using giftopnm => /usr/bin/giftopnm
[18573] info: FuzzyOcr: Using jpegtopnm => /usr/bin/jpegtopnm
[18573] info: FuzzyOcr: Using pngtopnm => /usr/bin/pngtopnm
[18573] info: FuzzyOcr: Using bmptopnm => /usr/bin/bmptopnm
[18573] info: FuzzyOcr: Using tifftopnm => /usr/bin/tifftopnm
[18573] info: FuzzyOcr: Using ppmhist => /usr/bin/ppmhist
[18573] info: FuzzyOcr: Using pamfile => /usr/bin/pamfile
[18573] info: FuzzyOcr: Using ocrad => /usr/local/bin/ocrad
[18573] info: FuzzyOcr: Using gocr => /usr/bin/gocr
[18573] info: FuzzyOcr: Using pnmnorm => /usr/bin/pnmnorm
[18573] info: FuzzyOcr: Using pnminvert => /usr/bin/pnminvert
[18573] info: FuzzyOcr: Using pamthreshold => /usr/bin/pamthreshold
[18573] info: FuzzyOcr: Using ppmtopgm => /usr/bin/ppmtopgm
[18573] info: FuzzyOcr: Using pamtopnm => /usr/bin/pamtopnm
[18573] dbg: FuzzyOcr: Threshold[max_hash] => 5
[18573] dbg: FuzzyOcr: Threshold[c] => 5
[18573] dbg: FuzzyOcr: Threshold[s] => 0.01
[18573] dbg: FuzzyOcr: Threshold[w] => 0.01
[18573] dbg: FuzzyOcr: Threshold[h] => 0.01
[18573] dbg: FuzzyOcr: Threshold[cn] => 0.01
[18573] dbg: FuzzyOcr: focr_add_score => 1
[18573] dbg: FuzzyOcr: focr_autodisable_negative_score => -10
[18573] dbg: FuzzyOcr: focr_autodisable_score => 1000
[18573] dbg: FuzzyOcr: focr_autosort_buffer => 10
[18573] dbg: FuzzyOcr: focr_autosort_scanset => 1
[18573] dbg: FuzzyOcr: focr_base_score => 5
[18573] dbg: FuzzyOcr: focr_corrupt_score => 2.5
[18573] dbg: FuzzyOcr: focr_corrupt_unfixable_score => 0.5
[18573] dbg: FuzzyOcr: focr_counts_required => 2
[18573] dbg: FuzzyOcr: focr_db_hash => /etc/mail/spamassassin/FuzzyOcr.db
[18573] dbg: FuzzyOcr: focr_db_max_days => 21
[18573] dbg: FuzzyOcr: focr_db_safe => /etc/mail/spamassassin/FuzzyOcr.safe.db
[18573] dbg: FuzzyOcr: focr_digest_db => /etc/mail/spamassassin/FuzzyOcr.hashdb
[18573] dbg: FuzzyOcr: focr_enable_image_hashing => 0
[18573] dbg: FuzzyOcr: focr_global_timeout => 0
[18573] dbg: FuzzyOcr: focr_global_wordlist => /etc/mail/spamassassin/FuzzyOcr.words
[18573] dbg: FuzzyOcr: focr_hashing_learn_scanned => 1
[18573] dbg: FuzzyOcr: focr_keep_bad_images => 1
[18573] dbg: FuzzyOcr: focr_log_pmsinfo => 1
[18573] dbg: FuzzyOcr: focr_log_stderr => 1
[18573] dbg: FuzzyOcr: focr_logfile => /var/log/FuzzyOcr.log
[18573] dbg: FuzzyOcr: focr_max_height => 800
[18573] dbg: FuzzyOcr: focr_max_width => 800
[18573] dbg: FuzzyOcr: focr_min_height => 4
[18573] dbg: FuzzyOcr: focr_min_width => 4
[18573] dbg: FuzzyOcr: focr_minimal_scanset => 0
[18573] dbg: FuzzyOcr: focr_mysql_db => FuzzyOcr
[18573] dbg: FuzzyOcr: focr_mysql_hash => Hash
[18573] dbg: FuzzyOcr: focr_mysql_host => localhost
[18573] dbg: FuzzyOcr: focr_mysql_port => 3306
[18573] dbg: FuzzyOcr: focr_mysql_safe => Safe
[18573] dbg: FuzzyOcr: focr_mysql_update_hash => 0
[18573] dbg: FuzzyOcr: focr_mysql_user => fuzzyocr
[18573] dbg: FuzzyOcr: focr_no_homedirs => 1
[18573] dbg: FuzzyOcr: focr_path_bin => /usr/local/bin:/usr/bin
[18573] dbg: FuzzyOcr: focr_personal_wordlist => __userstate__/FuzzyOcr.words
[18573] dbg: FuzzyOcr: focr_preprocessor_file => /etc/mail/spamassassin/FuzzyOcr.preps
[18573] dbg: FuzzyOcr: focr_scanset_file => /etc/mail/spamassassin/FuzzyOcr.scansets
[18573] dbg: FuzzyOcr: focr_score_ham => 0
[18573] dbg: FuzzyOcr: focr_skip_bmp => 0
[18573] dbg: FuzzyOcr: focr_skip_gif => 0
[18573] dbg: FuzzyOcr: focr_skip_jpeg => 0
[18573] dbg: FuzzyOcr: focr_skip_png => 0
[18573] dbg: FuzzyOcr: focr_skip_tiff => 0
[18573] dbg: FuzzyOcr: focr_skip_updates => 0
[18573] dbg: FuzzyOcr: focr_strip_numbers => 1
[18573] dbg: FuzzyOcr: focr_threshold => 0.25
[18573] dbg: FuzzyOcr: focr_timeout => 15
[18573] dbg: FuzzyOcr: focr_twopass_scoring_factor => 1.5
[18573] dbg: FuzzyOcr: focr_unique_matches => 0
[18573] dbg: FuzzyOcr: focr_verbose => 2
[18573] dbg: FuzzyOcr: focr_wrongctype_score => 1.5
[18573] dbg: FuzzyOcr: focr_wrongext_score => 1.5
[18573] info: FuzzyOcr: Loaded preprocessor normalize: /usr/bin/pnmnorm
[18573] info: FuzzyOcr: Loaded preprocessor invert: /usr/bin/pnminvert
[18573] info: FuzzyOcr: Loaded preprocessor ppmtopgm: /usr/bin/ppmtopgm
[18573] info: FuzzyOcr: Loaded preprocessor pamtopnm: /usr/bin/pamtopnm
[18573] info: FuzzyOcr: Loaded preprocessor pamthreshold: /usr/bin/pamthreshold -simple -threshold 0.5
[18573] info: FuzzyOcr: Loaded preprocessor maketiff: pnmtotiff -color -truecolor
[18573] info: FuzzyOcr: Using scan ocrad: /usr/local/bin/ocrad -s5 $input
[18573] info: FuzzyOcr: Using scan ocrad-invert: /usr/local/bin/ocrad -s5 -i $input
[18573] info: FuzzyOcr: Using scan ocrad-decolorize-invert: /usr/local/bin/ocrad -s5 -i $input
[18573] info: FuzzyOcr: Using scan ocrad-decolorize: /usr/local/bin/ocrad -s5 $input
[18573] info: FuzzyOcr: Using scan gocr: /usr/bin/gocr -i $input
[18573] info: FuzzyOcr: Using scan gocr-180: /usr/bin/gocr -l 180 -d 2 -i $input
[18573] info: FuzzyOcr: Added <43> words from "/etc/mail/spamassassin/FuzzyOcr.words"
[18573] info: rules: meta test SARE_SPEC_PROLEO_M2a has dependency 'MIME_QP_LONG_LINE' with a zero score
[18573] info: rules: meta test SARE_HEAD_SUBJ_RAND has undefined dependency 'SARE_XMAIL_SUSP2'
[18573] info: rules: meta test SARE_HEAD_SUBJ_RAND has undefined dependency 'SARE_HEAD_XAUTH_WARN'
[18573] info: rules: meta test SARE_RD_SAFE has undefined dependency 'SARE_RD_SAFE_MKSHRT'
[18573] info: rules: meta test SARE_RD_SAFE has undefined dependency 'SARE_RD_SAFE_GT'
[18573] info: rules: meta test SARE_RD_SAFE has undefined dependency 'SARE_RD_SAFE_TINY'
[18573] info: rules: meta test ACKME_OBFURL1 has undefined dependency 'ACKME_OBFURL1c'
[18573] info: rules: meta test SARE_OBFU_CIALIS has undefined dependency 'SARE_OBFU_CIALIS2'
[18573] dbg: FuzzyOcr: Starting FuzzyOcr...
[18573] info: FuzzyOcr: Processing Message with ID "<no messageid>" (<no sender> -> <no receipients>)
[18573] dbg: FuzzyOcr: Skipping OCR, no image files found...
[18573] dbg: FuzzyOcr: Processed in 0.000339 sec.

Thanks!

Attachments

Change History

01.06.2007 20:13:47 changed by UdeS

I just upgraded to SA 3.2.0 and it is now working!!!

07.07.2007 03:41:05 changed by anonymous

15.08.2007 02:26:03 changed by anonymous

01.04.2008 09:26:47 changed by anonymous

徐州辉煌钢结构工程有限公司是一家集网架钢结构设计、制作、安装及技术服务为一体的大型专业化企业。公司坐落于有网架之乡美誉的江苏省徐州市,这里是全国优质网架原材料供应基地,也是全国网架技术熟练工人培训基地,有着人才,技术和原材料的地域优势。企业创办多年来,本着"求实创新、开拓进取"的精神,不断引进吸收国内外先进技术经验,汇集来自全国各地从事专业管理,专业设计、制造、检测试验等高级优秀人才,配置了各种先进的成套生产和检测设备,能满足制造生产能力要求的流水生产线。公司始终本着“诚信为本、信守合同、用户至上”的理念,坚持贯彻实践三个“第一”——质量第一、信誉第一,服务第一,企业不断深化改革,深挖潜力,降低成本,以最低的价格吸引客户,以最好的质量服务客户,让辉煌网架钢构建设遍布全国各地,多年来深受广大客户及建设单位的一致好评。   竭诚欢迎各界新老朋友真诚合作、共创辉煌、共享绩效,公司将一如既往地为各界朋友提供优秀的服务!

09.04.2008 08:37:20 changed by anonymous

电子地磅解码器,吨位遥控器/本吨位遥控器引进日本先进技术研制而成,完全采用数字式集成电路技术,采用万能解码数据处理线路,适用于10----150吨以下吨位,无须对地磅作任何改动 具有防拦截,防扫描等优点,解码器安装于车上或离地磅8米以内,在电子称旁40米或60米以内,能控制电子称的数码数据,最小值20公斤,规格10 /15/20,此产品主要产生电子磁场干扰和控制,从而使吨位变大或变小,性能稳定可靠,体积小,遥控主机 解码处理器 如烟盒大小,遥控器配两种型号,隐蔽性强,附件含使用光盘一套,


Add/Change #387 (FuzzyOcr doesn't see the embedded image)