EnaOCR – Open Source
by admin on Mar.25, 2010, under Research and Development
This is my first contribution to the world of open source. I started developing this project in the university, and I will continue to work on it until it is stable enough to run on any operating system. For now only runs in Windows and Linux. OSX is the next step, I hope this can be available soon.
What is EnaOCR?
EnaOCR is a Optical Character Recognition desktop application, that allows you to read multiple images files and convert them to ASCII text or XML.
With this application is easy to deploy any data to any external database in a short period of time. You just need one click to process hundreds of images in seconds, is very fast and with strong accuracy in the detection of words
This application was written in Java with the help of Tesseract engine.
The project was developed in Polytechnic Institute of Setúbal, Superior School of Technology, Portugal.
At the moment this project are just in Portuguese, but shortly it will be translated to another languages like English and Spanish, is just a matter of free time.

All the information and source are available in http://code.google.com/p/enaocr/. The source is protected through the GNU LGP license. If you want to reuse any part of the code in other project, let me know first.
If you feel this project interesting and want to join the team, send me an email, I will be very happy to answer.
Here are some printscreens:
March 26th, 2010 on 08:49
I haven’t tried this before. I learn a lot about new OCR software at http://www.ocrworld.com but most of them are quiet expensive.
March 26th, 2010 on 12:09
I Nina, ABBYY and Irislink are the companies behind that website OCRWorld, they are famous for being expensive like you said, and they have a big problem, that software are not as accurate when compared with the Tesseract engine, search on the web and you will see the benchmarks, and they not allow you to recognize multiple images at once, neither save the layout of different images inside the program for a more fast usability.