Results 1 to 3 of 3

Thread: PDF@HTMLl5

  1. #1
    nj007 is offline Member
    Join Date
    Feb 2013
    Posts
    1
    Rep Power
    0

    Default PDF@HTMLl5

    I have a requirement where I have to convert the PDF document to HTML5. I do not want to use any available tool achieve this. I want to write my own code to achieve this. Being java developer I have started with iText but I saw that, iText just extract the text from PDF and does not keep the formatting layout on PDF.

    Can someone please guide which API i should use to achieve this? below is my high level requirement.

    1-Extract the text from the PDF without loosing formatting layout.

    2-extract the images if any.

    3-Retain the formatting in the newly converted HTML5 page same as that of PDF page.

    Thanks in Advance.

  2. #2
    DarrylBurke's Avatar
    DarrylBurke is offline Member
    Join Date
    Sep 2008
    Location
    Madgaon, Goa, India
    Posts
    11,244
    Rep Power
    19

    Default Re: PDF@HTMLl5

    Moved from a staff-only section

    db
    If you're forever cleaning cobwebs, it's time to get rid of the spiders.

  3. #3
    Tolls is offline Moderator
    Join Date
    Apr 2009
    Posts
    12,015
    Rep Power
    20

    Default Re: PDF@HTMLl5

    Do you understand the structure of a PDF document and how it handles layout and the like?
    If not then you'll need to read up on that before you start.
    Please do not ask for code as refusal often offends.

    ** This space for rent **

Tags for this Thread

Posting Permissions

  • You may not post new threads
  • You may not post replies
  • You may not post attachments
  • You may not edit your posts
  •