Simpler Navigation for Servers and Operating Systems - Please Update Your Bookmarks
Completed: a much simpler Servers and Operating Systems section of the Community. We combined many of the older boards, so you won't have to click through so many levels to get at the information you need. Check the consolidated boards here as many sub-forums are now single boards.
If you have bookmarked forums or discussion boards in Servers and Operating Systems, we suggest you check and update them as needed.
General
cancel
Showing results for 
Search instead for 
Did you mean: 

Optical Characters recognition in image files

Ph Vouters
Valued Contributor

Optical Characters recognition in image files

Dear reader,

 

I did not find any suitable ITRC Linux forum, hence me posting this here.

 

If you are interested with text recognition in image files such as produced by scanners, here are some results I acheived on Linux and that I documented at http://vouters.dyndns.org/tima/Linux-Java-Tesseract-ocr-Porting_Tesseract_from_android_Graphics_to_Sun_Graphics.html

 

I am currently working on porting one of the software (Tesseract OCR) onto Windows. So the subject is being worked upon. This subject interests many national libraries to turn their richnesses currently paper printed into text documents accessible from anywhere over the world via Internet.

 

It happens the Tesseract OCR software which would currently be the most advanced software has initially been developed by HP Labs, then offered by HP to the Opensource world. Google is as of today sponsoring this Opensource code development. This Google sponsorship has to be connected with Google's project to digitalize all paper printed documents.

 

Best regards to everyone,

Philippe Vouters