Page 1 of 4

OCR for Neobook using Tesseract Engine

PostPosted: Sat Apr 28, 2012 5:45 am
by TechMediaPlugins2010
Tesseract Engine is a powerful OCR engine (Open Source). I am building an app using it. Normally, I will NOT transform it in a plugin, but I can do it, but i need 6 sponsors to donate US$ 20 each to make it available for donators, otherwise i will not make it. It wil lbe NOT in my regular line of products. If you want to have this, please donate US$ 22 to ameyer@redemed.net until 29 April. If I cant get the 6 people, donators will be fully refunded.

Text OCR returned in Text (string) and HTML

Cheers

PostPosted: Sat Apr 28, 2012 5:52 am
by datadon
What type of documents does it use for the source files? PDF, JPG etc.

PostPosted: Sat Apr 28, 2012 7:19 am
by TechMediaPlugins2010
Bitmap (bmp, jpeg)

PostPosted: Sat Apr 28, 2012 11:15 am
by domino
I'm interested... payment sent

Cheers
Dave

PostPosted: Sat Apr 28, 2012 11:23 am
by datadon
What do the investors get for the $20?

PostPosted: Sat Apr 28, 2012 11:29 am
by TechMediaPlugins2010
received one.

Donaters will received a plugin that wrappers tesseract DLL, wich performs OPCR in Neobook. You just need to deply tesseract.dll along with pub

PostPosted: Sat Apr 28, 2012 1:40 pm
by TechMediaPlugins2010
received two

PostPosted: Sat Apr 28, 2012 1:56 pm
by andybrock
Assuming you get all donators... when will the plugin be finished? I don't mind joining in but if its an indefinate length of time like superwebbrowser then I'd probably pass.

Also would lilke to know same as Don about the input types.

Andy

PostPosted: Sat Apr 28, 2012 2:19 pm
by TechMediaPlugins2010
May 2 plugin done.

SuperWebBrowser is a VERY complex plugins and demands a LOT of code, specially if you need to deal with Windows 8, etc. I try to code my plugins to be useful for many years. WebBrowser have more than 300 functions.

You load a BMP or JPEG image, choose the language, and HIT recognize. Text or HTML text is returned. Of course, the reliablity of thee OCR is based on TESSERACT, but it is a hell of good OCR.

In my opinion, 20 bucks is a lot of CHEAP for that functionabily.

PostPosted: Sun Apr 29, 2012 4:51 am
by andybrock
I'm in.

Andy

PostPosted: Sun Apr 29, 2012 5:07 am
by TechMediaPlugins2010
3 receveid

PostPosted: Mon Apr 30, 2012 4:13 am
by TechMediaPlugins2010
I will wait 2 more days. Only 3 received.

PostPosted: Mon Apr 30, 2012 4:29 am
by domino
I will wait 2 more days.


Thanks... :D

PostPosted: Mon Apr 30, 2012 7:35 am
by datadon
I hate to plead (well not really), but this is a deal we don't want to miss out on. $20 for an OCR that you can stick in the middle of a Neobook app is just too good to miss out on. Alberto is already up to speed on it and is ready to go right away. Twenty bucks is a big box of popcorn at the movies...right. Your arteries certainly don't need that.

Just think, you can now create an app that will allow you to grab an image from the web that has just the text you want in it (capture plugin from GK) crop it up and feed in into the OCR. Bingo fresh real text. Or one of the PDF docs that you can't grab text from, just capture it and process the image.

Or open up a locked out Excel file and grab the data via an image of it.

All useful functions.

So, come on, preserve our future. Chip in and reap the benefits. The price might go up after the initial 6 so..... don't hesitate.

Your friendly encourager.

PostPosted: Mon Apr 30, 2012 8:07 am
by TechMediaPlugins2010
Don, and the Tesseract Engine is very good, with LOTS of dictionaries. Another useful thing is a way to capture the entire screen and make Neobook speech it, with Speech API. There's a lot of things to do with it.

Cheers


PS: i went to watch "Avengers, the Movie"yesterday and spent a lot more then US$ 20 on Popcorn. :)