Results 1 to 6 of 6
  1. #1
    TomBoy13 is offline Member
    Join Date
    Mar 2012
    Posts
    3
    Rep Power
    0

    Post About Vector Search Model

    I new for IR Techniques;
    i need to implement Vector Space Model,
    remove stopwords,
    calculate TF-IDF weights.
    The output: the rank, the document's ID and the similarity score.
    Please Help me...
    Million Thanks in Advance

  2. #2
    Norm's Avatar
    Norm is online now Moderator
    Join Date
    Jun 2008
    Location
    Eastern Florida
    Posts
    17,883
    Rep Power
    25

    Default Re: About Vector Search Model

    Do you have any specific java programming questions? Your problem description doesn't have anything to about a java program in it.

  3. #3
    TomBoy13 is offline Member
    Join Date
    Mar 2012
    Posts
    3
    Rep Power
    0

    Post Re: About Vector Search Model

    Quote Originally Posted by TomBoy13 View Post
    I new for IR Techniques;
    i need to implement Vector Space Model,
    remove stopwords,
    calculate TF-IDF weights.
    The output: the rank, the document's ID and the similarity score.
    Please Help me...
    Million Thanks in Advance
    1. Perform the common stopword removal pre-processing step (you are not required to perform stemming). A list of stopwords to use is contained in the stopwords.txt file.

    2. Create an appropriate index so that IR using the Vector Space Model may be performed. This will require you to calculate the appropriate TF-IDF weights. This may be stored in memory or in an external file. You may NOT use database systems such as MySQL, SQL Server, Oracle or similar.

    3. Accept a query on the command line (no GUI is required) and return a list of the 100 most relevant documents, according to the Vector Space IR Model sorted beginning with the highest similarity score. The output should have three columns: the rank, the document's ID and the similarity score.

    i need to develop in Java.
    Please Help me...
    i have 28000 thousands word file..in that remove the stopwords. no stemming performs
    How would i get?
    Thanks in advance.
    Last edited by TomBoy13; 03-15-2012 at 03:26 PM.

  4. #4
    Norm's Avatar
    Norm is online now Moderator
    Join Date
    Jun 2008
    Location
    Eastern Florida
    Posts
    17,883
    Rep Power
    25

    Default Re: About Vector Search Model

    Read the file line by line.
    Remove the undesired words.
    Then ????

  5. #5
    TomBoy13 is offline Member
    Join Date
    Mar 2012
    Posts
    3
    Rep Power
    0

    Default Re: About Vector Search Model

    How to calculate Tf-Idf of given string.?
    If you have example just post it.
    Thanks

  6. #6
    Norm's Avatar
    Norm is online now Moderator
    Join Date
    Jun 2008
    Location
    Eastern Florida
    Posts
    17,883
    Rep Power
    25

    Default Re: About Vector Search Model

    Find the class with the TfIdf() method and call that method. There is nothing like that in the Java SE classes and methods. You will have to find a third party package with the classes and methods that you need.

Similar Threads

  1. Replies: 0
    Last Post: 02-24-2012, 09:39 AM
  2. Replies: 4
    Last Post: 03-25-2011, 12:50 AM
  3. Replies: 5
    Last Post: 08-26-2008, 04:43 PM
  4. vector list search
    By hezfast2 in forum New To Java
    Replies: 2
    Last Post: 06-14-2008, 07:48 PM
  5. Search a object in a vector
    By TalhaS in forum New To Java
    Replies: 2
    Last Post: 04-30-2008, 03:05 PM

Posting Permissions

  • You may not post new threads
  • You may not post replies
  • You may not post attachments
  • You may not edit your posts
  •