Results 1 to 1 of 1
Thread: extracting text from jpeg
- 10-05-2008, 11:40 PM #1
extracting text from jpeg
Has anyone seen any libs for pulling text from either .jpeg or standard office document files? My goal is strictly to rid the datastream of all formatting and get anything that is truly
5. String data = new String(" ","UTF-8");
6. String data = new String(" ","UTF-16");
Unihan, Unicode, ascii 7/8 or whatever but does not have any of the useless ancilliary formatting codes that are endemic to office software. The goal here is to eliminate manual data entry from a datastore that is currently being conveyed in a format that has proven un-reliable in an industry where reliability is hyper-critical later down the datastream.
One approach: I can take a jpeg of the 8½ x 11 inches paper format.
Other approach: Calling driver for shrink-wrapped software.
A better solution would be if there are established libraries that have proven track on pulling the text from contemporary ..... I tried digging into Open Office but that used up thirty or fourty hours of research budget with nothing to show for it.
Another idea I had was to look into Open GL but I have never done any work with that tool and would need to know where to start.Introduction to Programming Using Java.
Cybercartography: A new theoretical construct proposed by D.R. Fraser Taylor
- By Maz in forum Java 2DReplies: 3Last Post: 05-05-2011, 12:29 PM
- By shaktish in forum JavaServer Pages (JSP) and JSTLReplies: 4Last Post: 02-17-2009, 02:52 PM
- By hemant in forum Java 2DReplies: 1Last Post: 07-04-2008, 05:39 PM
- By Java Tip in forum Java TipReplies: 0Last Post: 02-08-2008, 09:17 AM
- By Hasan in forum New To JavaReplies: 1Last Post: 05-31-2007, 03:42 PM