Results 1 to 1 of 1
- 03-04-2011, 07:12 AM #1
Member
- Join Date
- Mar 2011
- Posts
- 1
- Rep Power
- 0
PDF extraction issue with apache PDFBox 1.3.1
Hi All,
I am facing some issue while extracting data from PDF using apache PDFBox.
With PDFBox version 1.1, i was able to extract the data properly. But the same code is giving different output with version 1.3.1. Only for few PDFs,
I am facing this issue.
Code sample
------------
PDDocument document = PDDocument.load(new File("sample.pdf"));
PDFTextStripper stripper = new PDFTextStripper();
stripper.setSortByPosition( true );
System.out.println(stripper.getText(document));
I have attached the outputs. Kindly request your help to resolve this.
Thanks
Seby
Similar Threads
-
Data Extraction using JAVA
By yap_1991 in forum Advanced JavaReplies: 1Last Post: 06-01-2010, 08:02 AM -
File Extraction using Java
By yap_1991 in forum Advanced JavaReplies: 7Last Post: 05-14-2010, 08:06 AM -
JAVA Video Extraction
By KSadeck in forum Advanced JavaReplies: 3Last Post: 01-08-2009, 07:43 AM -
web extraction
By murali in forum NetworkingReplies: 3Last Post: 12-13-2008, 07:10 AM -
Apache Axis2 is built on Apache AXIOM, a new high performance, pull-based XML object.
By anusoniaa in forum XMLReplies: 0Last Post: 11-15-2008, 07:29 AM


LinkBack URL
About LinkBacks
Reply With Quote
Bookmarks