Results 1 to 1 of 1
  1. #1
    Seby is offline Member
    Join Date
    Mar 2011
    Posts
    1
    Rep Power
    0

    Unhappy PDF extraction issue with apache PDFBox 1.3.1

    Hi All,

    I am facing some issue while extracting data from PDF using apache PDFBox.
    With PDFBox version 1.1, i was able to extract the data properly. But the same code is giving different output with version 1.3.1. Only for few PDFs,
    I am facing this issue.

    Code sample
    ------------

    PDDocument document = PDDocument.load(new File("sample.pdf"));
    PDFTextStripper stripper = new PDFTextStripper();
    stripper.setSortByPosition( true );
    System.out.println(stripper.getText(document));

    I have attached the outputs. Kindly request your help to resolve this.

    Thanks
    Seby
    Attached Files Attached Files

Similar Threads

  1. Data Extraction using JAVA
    By yap_1991 in forum Advanced Java
    Replies: 1
    Last Post: 06-01-2010, 08:02 AM
  2. File Extraction using Java
    By yap_1991 in forum Advanced Java
    Replies: 7
    Last Post: 05-14-2010, 08:06 AM
  3. JAVA Video Extraction
    By KSadeck in forum Advanced Java
    Replies: 3
    Last Post: 01-08-2009, 07:43 AM
  4. web extraction
    By murali in forum Networking
    Replies: 3
    Last Post: 12-13-2008, 07:10 AM
  5. Replies: 0
    Last Post: 11-15-2008, 07:29 AM

Posting Permissions

  • You may not post new threads
  • You may not post replies
  • You may not post attachments
  • You may not edit your posts
  •