Reply
 
LinkBack Thread Tools Display Modes
  #1 (permalink)  
Old 03-14-2009, 09:15 AM
makpandian's Avatar
Senior Member
 
Join Date: Dec 2008
Location: Chennai
Posts: 253
Rep Power: 2
makpandian is on a distinguished road
Default SIte Grabber
Hi to all
i want to grab all files belongs to website?
i think it is possible with the help of URL,URI classes.
I can download a file form site using URL class.
but i couldn't find out what are the files under the site.

if Any one know how to extract files names under web directory ,share yours with me


Thanking YoU.
__________________
Mak
(Living @ Virtual World)
Bookmark Post in Technorati
Reply With Quote
  #2 (permalink)  
Old 03-14-2009, 10:37 AM
Senior Member
 
Join Date: Aug 2008
Posts: 368
Rep Power: 2
Supamagier is on a distinguished road
Default
Every page is a file... xx.html
__________________
I die a little on the inside...
Every time I get shot.
Bookmark Post in Technorati
Reply With Quote
  #3 (permalink)  
Old 03-14-2009, 11:33 AM
makpandian's Avatar
Senior Member
 
Join Date: Dec 2008
Location: Chennai
Posts: 253
Rep Power: 2
makpandian is on a distinguished road
Default
i know the site name but i dont know what files are in that site.if i know all files paths i can download entire site.
__________________
Mak
(Living @ Virtual World)
Bookmark Post in Technorati
Reply With Quote
  #4 (permalink)  
Old 03-16-2009, 07:35 AM
Member
 
Join Date: Mar 2009
Posts: 6
Rep Power: 0
tommosimmo is on a distinguished road
Default
For security reasons, all files on a webserver/website arent accessible to the general public unless made so by a webmaster admin. An easy way of displaying all the files on a webserver is by simply not having an index page.
Bookmark Post in Technorati
Reply With Quote
  #5 (permalink)  
Old 03-16-2009, 10:59 AM
makpandian's Avatar
Senior Member
 
Join Date: Dec 2008
Location: Chennai
Posts: 253
Rep Power: 2
makpandian is on a distinguished road
Default
Thanks a lot tommosimmo
In earlier ,i have tried to copy the site by accessing file names of server.But by your command i accept that it is not possible.

Now my try is to access all file name via index.html

do u know how to parse html to find out the links available in site?
__________________
Mak
(Living @ Virtual World)
Bookmark Post in Technorati
Reply With Quote
  #6 (permalink)  
Old 03-16-2009, 11:39 AM
Member
 
Join Date: Mar 2009
Posts: 72
Rep Power: 0
bubbless is on a distinguished road
Default
Search on regular expressions.
You can use "http://" or "www" to find the links.
Bookmark Post in Technorati
Reply With Quote
  #7 (permalink)  
Old 03-16-2009, 01:07 PM
makpandian's Avatar
Senior Member
 
Join Date: Dec 2008
Location: Chennai
Posts: 253
Rep Power: 2
makpandian is on a distinguished road
Default
bubbless
you know one thing index files contain links as a relative path so it difficult to trace it by regular expression as www and http:
__________________
Mak
(Living @ Virtual World)
Bookmark Post in Technorati
Reply With Quote
  #8 (permalink)  
Old 03-16-2009, 01:27 PM
Member
 
Join Date: Mar 2009
Posts: 72
Rep Power: 0
bubbless is on a distinguished road
Default
You can also do it with <a href=", that will work.
Bookmark Post in Technorati
Reply With Quote
  #9 (permalink)  
Old 03-16-2009, 01:46 PM
makpandian's Avatar
Senior Member
 
Join Date: Dec 2008
Location: Chennai
Posts: 253
Rep Power: 2
makpandian is on a distinguished road
Default
bubbless .
Now i am tring a way as you told here...
__________________
Mak
(Living @ Virtual World)
Bookmark Post in Technorati
Reply With Quote
Reply

Bookmarks

Thread Tools
Display Modes

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off
Trackbacks are On
Pingbacks are On
Refbacks are On


Similar Threads
Thread Thread Starter Forum Replies Last Post
Site slogan fishtoprecords Suggestions & Feedback 6 01-08-2009 08:02 PM
New Forum Site elasolova Java Announcements 0 11-03-2008 12:53 AM
My java blog and site Engineeringserver.com Reviews / Advertising 1 10-23-2008 03:47 AM
[SOLVED] Site Blocking Mir Networking 12 07-03-2008 07:04 AM
Site hacked tim Suggestions & Feedback 3 02-02-2008 10:47 AM


All times are GMT +2. The time now is 05:52 PM.



VBulletin, Copyright ©2000 - 2010, Jelsoft Enterprises Ltd.
Content Relevant URLs by vBSEO ©2009, Crawlability, Inc.
Copyright ©2006 - 2007, www.java-forums.org