Results 1 to 3 of 3
- 01-08-2010, 04:39 AM #1
Member
- Join Date
- Jan 2010
- Posts
- 2
- Rep Power
- 0
Developing a front end for a public website
I was asked to write a front end application that would pull data from the following website.
Atlas of Genetics and Cytogenetics in Oncology and Haematology
Specification is that it the application should be able to send a search term and return details about the query from the website.
I have done some research on HTML parsers and feel that this would be the first step but I am new to java and require some input.
Can someone please point me as to where to begin?
- 01-08-2010, 05:57 AM #2
I've done quite a bit of this -- some sites are easier than others. In the best case, you can create a BufferedReader and give it a StreamReader that opens a data stream to a url you specify. The data that you get from the reader is the result of the URL being sent to the server -- so, if you send something simple like 'http://www.apple.com' you get the HTML for the Apple homepage.
However, you can also put javascript or php or html post/get messages in the url. In this case, the results you get are the results of the the 'query' you constructed in the initial URL.
Now, things get tricky when the webserver on the other end does not like that fact that you are not running a real web browser, rather a java app. In these cases (IMDB.com for example), you have to create a custom HTTP request that lies to the server and tells it your app is really Mozilla or something similar.
I've done it both ways, happy to share -- Why not tell us more about what progress you've made thus far?
- 01-13-2010, 08:26 AM #3
Member
- Join Date
- Jan 2010
- Posts
- 2
- Rep Power
- 0
Ok, so all I've done so far on research on html parsers. I'm undecided whether just to parse the whole website and dump into a database or try to figure out the post/get coding. It seems just to send a simple query isn't too bad to figure out but can you post some examples?
The problem is that the website only lets you search by gene name, whereas I might need to search each individual gene html file content for a gene position and construct complex queries later on. So it makes sense just to do a database dump to SQL. However, the site will be continously updated and I can't tax their server, they are french and may get upset. How is this usually done - just automate the dump every so often?
Any suggestions?
Similar Threads
-
Things you do if your not in Front of the Computer.
By Eku in forum Forum LobbyReplies: 28Last Post: 09-15-2010, 10:21 AM -
Err:java.io.IOException: public key ring doesn't start with public key tag
By Deepa in forum New To JavaReplies: 5Last Post: 06-26-2009, 03:03 AM -
J2EE Front-end Developer in OHIO
By recruitness in forum Jobs OfferedReplies: 0Last Post: 12-02-2008, 10:34 PM -
How to download website (Get all link in website)
By finalmem in forum Advanced JavaReplies: 0Last Post: 11-12-2008, 08:43 AM -
Front-End OOP Developer
By AshleyNYC in forum Jobs OfferedReplies: 0Last Post: 04-25-2008, 05:25 PM


LinkBack URL
About LinkBacks
Reply With Quote

Bookmarks