Re: [Bug-ocrad] Technical documentation summary readme.txt, page skew, H

bug-ocrad

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Bug-ocrad] Technical documentation summary readme.txt, page skew, H

From:	Antonio Diaz Diaz
Subject:	Re: [Bug-ocrad] Technical documentation summary readme.txt, page skew, Hough transform.
Date:	Sat, 04 Mar 2006 15:21:04 +0100
User-agent:	Mozilla/5.0 (X11; U; Linux i586; en-US; rv:1.7.12) Gecko/20050923

Chris K. Skinner wrote:

As you are probably aware, there are software patents in some countries.


Yes, in almost every country with a corrupt and/or fascist government.

If you had some kind of outline of the algorithms that were applied perversion of the software that would greatly help someone new coming infresh off the street to gain a quicker understanding of stuff ingeneral, and probably demonstrate to the world at large that you haveinvented something new that could not be patented / stolen / claimed bysome greedy corporate dudes.

I sympathize with your idea, but I lack the time and the ability toexplain the algorithms I use or invent. On the other hand, a patent isvalid even if I invented it independently, so it won't be an effectivedefense.

Do you have any design notes, bibliographic citations, web links toinformation that you've made use of , release notes for what algorithmsare being used / abandoned.

The short answer is no. I have looked into the source of gocr andclaraocr, but I haven't got anything from them. I use the Otsu algorithmfor binarization (as gocr does). Apart from this, I work mostly in a vacuum.

In the J. R. Parker book w/CD ROM "Algorithms For Image Processing AndComputer Vision" that I have read, the author provides a couple ofalgorithm suggestions for combating the page skew angle issue. AHough-transform when applied to the dots of the bottoms of the boundingboxes of glyphs results in a page skew angle in degrees (with his sourcecode, that is).


This has a number of problems:

- Hough-transform, and in general any transformation on the whole image,is slow as hell.

- What if the line is not skewed but curved? (frequent in scanned books).
- "The bottoms of the bounding boxes of glyphs" are usually not aligned.
- etc...

This is why I expect working code, not suggestions, from possiblecollaborators. (Show me the code, you know?) ;-)


By the way, ocrad's algorithms are designed to be resistant to page skew.

Another approach is to use angle-independent Complex-Number-CoefficientNeural Networks to use as feature recognizers. The Japanese promoter ofthese neural networks says that they are Affine-Transform insensitive,and thereby can recognize a pattern that has been so transformed.

I would like to see this recognizing a page in less than five minuteswith good accuracy.

This too is just a theory. I don't have a copy of any books onComplex-Number-Coefficient Neural Networks, or any source code from acompetent mathematician who has converted the advanced mathematics intoworking C++ code examples.

Don't worry. "Advanced mathematics" are the sofware of the future... andthey will always be. ;-)



Regards,
Antonio.

[Prev in Thread]

Current Thread

[Next in Thread]

[Bug-ocrad] Technical documentation summary readme.txt, page skew, Hough transform., Chris K. Skinner, 2006/03/02
- Re: [Bug-ocrad] Technical documentation summary readme.txt, page skew, Hough transform., Antonio Diaz Diaz <=

Prev by Date: [Bug-ocrad] Question on new rotate option
Next by Date: Re: [Bug-ocrad] Question on new rotate option
Previous by thread: [Bug-ocrad] Technical documentation summary readme.txt, page skew, Hough transform.
Next by thread: [Bug-ocrad] Question on new rotate option
Index(es):
- Date
- Thread