PLEASE HELP: Reading words one by one (with determining font types) from pdf document
I have a pdf document(that is created from MS Word document, not image document). This pdf has texts in different font styles (bold, normal, and italic). I need to read these words of a text(from pdf) one by one and determine the type of font of the read word. for example: let we have such sentence in pdf:
Veli is writing to you.
I need to write a program that prints out:
word1: Veli ; font type:bold
word2: is ; font type: normal
word3: writing ; font type: italic
word4: to ; font type: bold
word5: you ; font type: bold+italic
It is important for me.. I have need to finish my lesson project. This is the first part of my project
Thanks in advance..
Re: PLEASE HELP: Reading words one by one (with determining font types) from pdf docu
What have you tried? Where are you stuck? Where is your SSCCE? Since it's so important for you, you might want to make it easier for us to help you by being more specific.
Re: PLEASE HELP: Reading words one by one (with determining font types) from pdf docu
I searched a lot about my problem in inernet. I found out a lot of information about writin to pdfs. A few of them are about reading from existing pdfs.. I found pdfBox library of Apache, which I think will be more helpful for me.. But I couldn't make inferences on it.. :(. I need to choose bold written words from pdf document. (Then I will write those words to a text document.))