Ticket #201 (new defect)

Opened 1 year ago

Last modified 1 year ago

Problem with digits in the words

Reported by: tempread@ukr.net Assigned to: decoder
Priority: major Milestone: Development Release Version 3.4
Component: Image Analysis Version: 3.4
Keywords: Cc:

Description

Using FuzzyOcr? plugin, version 3.4

I try detect phone numbers with "fuzzyocr". Add some numbers to wordlist. But have such lines in log: 2007-02-28 17:35:32 [67664] Found word "" in line

"" with fuzz of 0.0000 scanned with scanset $gocr -i $pfile

2007-02-28 17:35:32 [67664] Found word "" in line

"" with fuzz of 0.0000 scanned with scanset $gocr -i $pfile

2007-02-28 17:35:32 [67664] Found word "" in line

"" with fuzz of 0.0000 scanned with scanset $gocr -i $pfile

2007-02-28 17:35:32 [67664] Found word "" in line

"" with fuzz of 0.0000 scanned with scanset $gocr -i $pfile

2007-02-28 17:35:32 [67664] Found word "" in line

"" with fuzz of 0.0000 scanned with scanset $gocr -i $pfile

Problem exist at the next lines:

number 1137: $w =~ s/[a-z]//g;

and number 1144: s/[a-zA-Z]//g;

Why we restrict wordlist only in lower case alfabetical symbol?(at line 1137)

As for me(for resolving problem with detectinf phone numbers), i was commented line 1137: w =~ s/[a-z]//g;

and replace line 1144 from s/[a-zA-Z]//g; to s/[a-zA-Z0-9]//g; (but i think that we can comment this line too)

Attachments

Change History

28.02.2007 18:33:16 changed by tempread@ukr.net

(i mean FuzzyOcr?.pm)

15.08.2007 02:25:59 changed by anonymous


Add/Change #201 (Problem with digits in the words)