Results 1 to 5 of 5

Thread: Files Indexer

  1. #1
    carlneto is offline Member
    Join Date
    Feb 2011
    Posts
    3
    Rep Power
    0

    Unhappy Files Indexer

    Hi,

    I'm developing an indexing structure (like AVLTree) in JAVA, but can not find the way to make it read properly pdf, docx, xlsx... files. :confused:

    Could Someone give me an idea?

    Thanks!
    carlneto
    :)

  2. #2
    doWhile is offline Moderator
    Join Date
    Jul 2010
    Location
    California
    Posts
    1,641
    Rep Power
    7

    Default

    Not sure what you are looking for...so I suggest you start with this:
    Lesson: Basic I/O (The Java™ Tutorials > Essential Classes)
    Each file type will have its own format, so if you want to parse out text you will need parsers for each type (apache open source projects contain several 3rd party libraries which might be useful to you)

  3. #3
    carlneto is offline Member
    Join Date
    Feb 2011
    Posts
    3
    Rep Power
    0

    Default

    I have already found the apache pdfbox, but I can't seam able to incorporate it into IDE NetBeans and use it's methods. It would hypothetically work like this:

    My code would call that method

    if (objFile.isFile()){
    if((objFile.getName().endsWith(".pdf") || objFile.getName().endsWith(".pdf")){


    // how do I call the text parser?


    }
    }

    - How do I install PDFBox in the IDE NetBeans?

    - How do I Call it's methods?

  4. #4
    doWhile is offline Moderator
    Join Date
    Jul 2010
    Location
    California
    Posts
    1,641
    Rep Power
    7

    Default

    You need to add the jar(s) to your classpath to be able to access the library. I don't use netbeans so can't lead you through it, but I bet a quick google search will lead you in the right direction.

  5. #5
    carlneto is offline Member
    Join Date
    Feb 2011
    Posts
    3
    Rep Power
    0

Similar Threads

  1. Replies: 8
    Last Post: 07-30-2010, 06:47 PM
  2. Lucene as Conditional Evaluator / Indexer?
    By cuebei in forum Lucene
    Replies: 0
    Last Post: 01-11-2010, 08:36 PM
  3. working with files (text files)
    By itaipee in forum New To Java
    Replies: 1
    Last Post: 02-24-2009, 12:38 PM
  4. Lucene Indexer Encoding problem
    By svirid in forum Lucene
    Replies: 5
    Last Post: 02-18-2009, 10:26 AM
  5. Behaving text files like binary files
    By Farzaneh in forum New To Java
    Replies: 2
    Last Post: 08-27-2008, 04:20 PM

Tags for this Thread

Posting Permissions

  • You may not post new threads
  • You may not post replies
  • You may not post attachments
  • You may not edit your posts
  •