Java Forums

Main Menu
Home
Today's Posts
FAQ
Search
Contact Us

Java Network
Linux Archive
Java Tips
Java Tips Blog

Sponsored Links





Welcome to the Java Forums.

You are currently viewing our boards as a guest which gives you limited access to view most discussions and access our other features. By joining our free community, you will:

  • have access to post topics
  • communicate privately with other members (PM)
  • not see advertisements between posts
  • have the possibility to earn one of our surprises if you are an active member
  • access many other special features that will be introduced later.

Registration is fast, simple and absolutely free so please, join our community today!

If you have any problems with the registration process or your account login, please contact us.

Reply
 
LinkBack Thread Tools Display Modes
  #1 (permalink)  
Old 01-30-2008, 08:03 PM
Member
 
Join Date: Jan 2008
Posts: 1
joe_2110 is on a distinguished road
Feature extraction from a text file in java. this is used for scoring the sentences
I have seperated individual sentences out of a text document read from a file using several heuristics. Now i need to index the sentence according to the order of appearance (eg 1st sentence as 1, then 2,3 etc). Then i need to extract features out of these marked sentences. The features include position(if at top of document position is 1 if bottom its 0),length(number of words in the sentence) and thematic words(most frequent words). based on these features i will score the sentences. Please do help me out friends
Bookmark Post in Technorati
Reply With Quote
Sponsored Links
  #2 (permalink)  
Old 02-04-2008, 10:26 PM
Member
 
Join Date: Jan 2008
Posts: 7
erhart is on a distinguished road
There are a few ways that you could do this. The most efficient way is to keep track of the position of the sentences during the extraction from the file. You could define a class Sentence with variables like String sentence, int position, int appearanceOrder, etc. The Sentence class could contain methods which would count the number of words in the sentence and find the most frequent words. Is this helpful, or were you looking for something more specific?
Bookmark Post in Technorati
Reply With Quote
Sponsored Links
Reply


Thread Tools
Display Modes

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

vB code is On
Smilies are On
[IMG] code is On
HTML code is Off
Trackbacks are On
Pingbacks are On
Refbacks are On


Similar Threads
Thread Thread Starter Forum Replies Last Post
How to read a text file from a Java Archive File Java Tip Java Tips 0 02-08-2008 11:13 AM
Extract Text from PDF File using java TSW1016 Advanced Java 5 01-07-2008 01:03 AM
How to print text file in java(dotmatrix printer) yoganeethi Advanced Java 1 12-13-2007 02:38 PM
Converting text file(.txt) to JPG file(.jpg) in java javadeveloper Advanced Java 0 11-09-2007 06:22 PM
count character in text file as input file aNNuur New To Java 0 06-18-2007 08:46 AM


All times are GMT +3. The time now is 11:45 PM.


VBulletin, Copyright ©2000 - 2008, Jelsoft Enterprises Ltd.
Content Relevant URLs by vBSEO ©2007, Crawlability, Inc.
Copyright ©2006 - 2007, www.java-forums.org