[KLUG Members] human vs bot

Bert Bbbink kalamazoo at dse.nl
Tue Apr 12 06:08:19 EDT 2005


> Does anybody know why these things are so hard for an OCR program to
> handle?  I know the human eye has a much easier time with recognizing
> characters than a computer does, but with all the hardware we can throw
> at a problem, do a few lines scratched through the figure really confuse
> a computer that much?  I'm just imagining the screaming that will occur
> when somebody figures out a simple algorithm to strip out the lines.
>
>
you can raed this lien of text vrey well I tink. Humans read sentences by
recognizing the words mainly by the fisrt and lsat character.

Bots need to "read" every character seperatly. Most ocr (as far as I know)
find letter by trying to lay them on numerious images of characters. The
best fit is the letter typed. If you have lines or characters in different
shapes and sizes, the search for a character is taking up much more time
then when the bot "learns" which font is being used. The chance to get it
wrongs also increases. The bot is not capable of "finding" a character
shape because of the line. Bots using mathematical character definitions
(lines, curves,angles) also a mislead by the lines. Everytime the bost
needs to search where the lines stop and the character begin. That's hard
work for a "system" that does not reocginze words easy, as humans do.

Bert.



More information about the Members mailing list