João Machete

EnaOCR – Open Source

by admin on Mar.25, 2010, under Research and Development

This is my first contribution to the world of open source. I started developing this project in the university, and I will continue to work on it until it is stable enough to run on any operating system. For now only runs in Windows and Linux. OSX is the next step, I hope this can be available soon.
splashWhat is EnaOCR?

EnaOCR is a Optical Character Recognition desktop application, that allows you to read multiple images files and convert them to ASCII text or XML.

With this application is easy to deploy any data to any external database in a short period of time. You just need one click to process hundreds of images in seconds, is very fast and with strong accuracy in the detection of words

This application was written in Java with the help of Tesseract engine.

The project was developed in Polytechnic Institute of Setúbal, Superior School of Technology, Portugal.

At the moment this project are just in Portuguese, but shortly it will be translated to another languages like English and Spanish, is just a matter of free time.

code_small

All the information and source are available in http://code.google.com/p/enaocr/. The source is protected through the GNU LGP license. If you want to reuse any part of the code in other project, let me know first.

If you feel this project interesting and want to join the team, send me an email, I will be very happy to answer.

Here are some printscreens:


image003
image005
image010
image014
image016

:,
2 comments for this entry:
  1. Nina

    I haven’t tried this before. I learn a lot about new OCR software at http://www.ocrworld.com but most of them are quiet expensive.

  2. admin

    I Nina, ABBYY and Irislink are the companies behind that website OCRWorld, they are famous for being expensive like you said, and they have a big problem, that software are not as accurate when compared with the Tesseract engine, search on the web and you will see the benchmarks, and they not allow you to recognize multiple images at once, neither save the layout of different images inside the program for a more fast usability. :)

Leave a Reply

Spam protection by WP Captcha-Free

Looking for something?

Use the form below to search the site:

Still not finding what you're looking for? Drop a comment on a post or contact us so we can take care of it!