You are currently viewing our boards as a guest which gives you limited access to view most discussions and access our other features. By joining our free community, you will:
have access to post topics
communicate privately with other members (PM)
not see advertisements between posts
have the possibility to earn one of our surprises if you are an active member
access many other special features that will be introduced later.
Hi You can try with the trial version of pd4ml which actually converts the HTML to PDF, you can get the logic of HTML to PDF in the open source. The same can be retreated for a pdf doc as well. The usuall result of pdf decomposing is mostly MSO HTML, so you can parse the same and get the desired content out.