Search Engine , Web Crawler
Hi,
I am trying to build a search engine.[Java]
I have made a web crawler which on being given an initial [starting ]
URL,
1.will go to that URL
2.Retrieve all the URL's present on that web page.
3.Store them in URL_Database_file.
4.After finishing with that URL it retrieves the next URL automatically from
URL_Database_file.[Then repeats steps 1-4].
Now i want some guidance regarding how to retrieve words from a web pages HTML code, ie:- <p>Sahil Ansari needs Help</p> and then store them in a file.:confused::confused: