i tried all kind of sources
You need to check on your sources then... allow me to give you some correct sources:
java html parser - Google Search
Connect to the webpage. Use one of those million parsers to strip all the html tags from the page, then count the words.