Java Forums

Main Menu
Home
Today's Posts
FAQ
Search
Contact Us

Java Network
Linux Archive
Java Tips
Java Tips Blog

Sponsored Links





Welcome to the Java Forums.

You are currently viewing our boards as a guest which gives you limited access to view most discussions and access our other features. By joining our free community, you will:

  • have access to post topics
  • communicate privately with other members (PM)
  • not see advertisements between posts
  • have the possibility to earn one of our surprises if you are an active member
  • access many other special features that will be introduced later.

Registration is fast, simple and absolutely free so please, join our community today!

If you have any problems with the registration process or your account login, please contact us.

Reply
 
LinkBack Thread Tools Display Modes
  #1 (permalink)  
Old 01-03-2008, 04:44 PM
Member
 
Join Date: Jan 2008
Posts: 4
galford23 is on a distinguished road
Unclear Requirement from client ... Ring a bell?
Hi all,

I am new to Java, but was ask to do a project from a client.

The deliverable is to produce an document archiving system running on Java + tomcat + Lucene

there is this requirement Which totally does not ring a bell... wondering anyone out that understand this statement.

"A user of the system will be able to create a document archive on a machine running a Tomcat (or Jetty) web-server.A document may come from an RSS feed (its content will be saved on the server along with data characterizing the original feed details)."

Really need help ... Try reading on the net for RSS , but how can I archive a RSS feed ?

Bookmark Post in Technorati
Reply With Quote
Sponsored Links
  #2 (permalink)  
Old 01-03-2008, 05:21 PM
jelly's Avatar
Member
 
Join Date: Jan 2008
Location: Somerset, UK
Posts: 46
jelly is on a distinguished road
RSS feeds are XML based so you could simply save the XML document from the feed
Bookmark Post in Technorati
Reply With Quote
  #3 (permalink)  
Old 01-03-2008, 05:32 PM
Member
 
Join Date: Jan 2008
Posts: 4
galford23 is on a distinguished road
Meaning to say that if a user click on the XML Document where it bring the user to ?
Bookmark Post in Technorati
Reply With Quote
  #4 (permalink)  
Old 01-03-2008, 09:55 PM
roots's Avatar
Moderator
 
Join Date: Jan 2008
Location: Dallas
Posts: 263
roots is on a distinguished road
You should talk to client. More you talk with them more you talk more you will know how he wants the bell to ring.
Keep the local copy of the RSS Feed perhaps on lucene index itself. Pull the document from net using general java networking library or apache http components. Extract the XML data to different field like title, body and so on. Prepare lucene document out of that feed and store that in index.

Asking wont harm, talk to the client once more is this what you wanted by this.
__________________
dont worry newbie, we got you covered.
Bookmark Post in Technorati
Reply With Quote
  #5 (permalink)  
Old 01-08-2008, 04:51 AM
Member
 
Join Date: Jan 2008
Posts: 4
galford23 is on a distinguished road
Hi roots,

I have check up on the java networking library and understand file can be download, however it is this statement "its content will be saved on the server along with data characterizing the original feed details" that I do not quite get it.

Another requirement is "the archive will contain only a URL for a document that exists on the Web". no idea how to design this part of the project..
If it is a physical document (doc, pdf...etc)I understand that it can be uploaded manually via upload function -> go through lucene Index -> database storage.
URL ?

They leave it to us to design and we will present the 1st proposal in a couple of week time.

Maybe someone can help me to provide some link where I can research my self..
Bookmark Post in Technorati
Reply With Quote
  #6 (permalink)  
Old 01-08-2008, 05:49 AM
roots's Avatar
Moderator
 
Join Date: Jan 2008
Location: Dallas
Posts: 263
roots is on a distinguished road
Quote:
Originally Posted by galford23 View Post
its content will be saved on the server along with data characterizing the original feed details
In simplest term let's begin with an example RSS feed with attached content such as image. Now we can interpret the statement as image shall be saved on server along with the data (such as url, date, title .. ) if the original RSS feed.

I could not get this
Quote:
If it is a physical document (doc, pdf...etc)I understand that it can be uploaded manually via upload function -> go through lucene Index -> database storage.
URL ?
Quote:
Originally Posted by galford23 View Post
the archive will contain only a URL for a document that exists on the Web
I can guess your feeds will come with documents linked or attached to feed. In a case the document is linked to an URI then you only need to save that link in the database.

Review all the specification of RSS and other Feed types and talk to them. If they are not available then it's the client's fault.
__________________
dont worry newbie, we got you covered.
Bookmark Post in Technorati
Reply With Quote
  #7 (permalink)  
Old 01-08-2008, 06:40 AM
Member
 
Join Date: Jan 2008
Posts: 4
galford23 is on a distinguished road
Quote:
Originally Posted by roots View Post
In simplest term let's begin with an example RSS feed with attached content such as image. Now we can interpret the statement as image shall be saved on server along with the data (such as url, date, title .. ) if the original RSS feed.

From my newbie understanding, I guess the only the XML file will go throught the indexing.
Meaning to say that will the image file that the feed point to, be saved as well? If yes how can it be achieve?
Bookmark Post in Technorati
Reply With Quote
  #8 (permalink)  
Old 01-08-2008, 07:31 AM
roots's Avatar
Moderator
 
Join Date: Jan 2008
Location: Dallas
Posts: 263
roots is on a distinguished road
That's the good point. Except for text content nothing can be stored in Lucene. Things like Image should be either saved in file system or the database. With just lucene and File System solution you can store non text document in a file system and include their path information in lucene.

.. not ringing bell is again the problem here .. what are the possible document formats .. Is it just text in the feed and the text contained in content pointed by feed or something as well... In real sense we talking out of our assumption will increase the scope of application but doesnot necessarily address what customer wants.. once again you should talk to client ..
__________________
dont worry newbie, we got you covered.
Bookmark Post in Technorati
Reply With Quote
Sponsored Links
Reply


Thread Tools
Display Modes

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

vB code is On
Smilies are On
[IMG] code is On
HTML code is Off
Trackbacks are On
Pingbacks are On
Refbacks are On


Similar Threads
Thread Thread Starter Forum Replies Last Post
UDP Client help misslilbit02 Advanced Java 0 03-09-2008 09:31 PM
Requirement for Technical Professionals deepak_yoganishta Jobs Offered 0 02-04-2008 03:08 PM
Identify Client in Socket Client Server Application masadjie Networking 1 12-20-2007 11:18 AM
Urgent Requirement for Java/J2ee anushareddy7480 Java Announcements 1 07-23-2007 03:35 PM
Urgent Requirement for Java/J2ee anushareddy7480 New To Java 0 07-23-2007 01:56 PM


All times are GMT +3. The time now is 01:32 PM.


VBulletin, Copyright ©2000 - 2008, Jelsoft Enterprises Ltd.
Content Relevant URLs by vBSEO ©2007, Crawlability, Inc.
Copyright ©2006 - 2007, www.java-forums.org