Results 1 to 3 of 3
  1. #1
    Join Date
    Apr 2009
    Rep Power

    Default How to index the special characters in Lucene


    I'm new to the lucene. I downloaded lucene 2.4.1.
    I have one xml file which contains few special characters like '', ',' ' etc.(these are Danish language elements).
    How can I search these things.

    When i'm indexing my documents i given instace of DutchAnalyzer as an argument to IndexWriter Class.

    After this when i search for the content which contains the danish elements .. Still it is not able to identify.

    Please tell me how to use DutchAnalzer in my application. Sample example or series of steps helps me.

  2. #2
    serjant's Avatar
    serjant is offline Senior Member
    Join Date
    Jun 2008
    Rep Power


    Try to unicode those characters in your xml file, the java tool native2ascii will convert your characters to unicode.

  3. #3
    Join Date
    Apr 2009
    Rep Power

    Default How to index the special characters in Lucene


    thanks for your reply. There is no way to change the content. We need to index them using diffenent analyzers which are available in lucene.

    I found one analyzer which will work for Danish elements. But i'm unable to know how to use it.

Similar Threads

  1. [SOLVED] special characters (ASCII)
    By AlejandroPe in forum New To Java
    Replies: 8
    Last Post: 04-06-2009, 10:42 AM
  2. Searching for Microsoft special characters
    By Tim McDaniel in forum Eclipse
    Replies: 2
    Last Post: 02-24-2009, 03:11 PM
  3. special characters
    By ravian in forum New To Java
    Replies: 2
    Last Post: 11-16-2007, 01:28 PM
  4. Replies: 1
    Last Post: 08-07-2007, 05:32 AM
  5. Special characters in text fields
    By Felissa in forum Web Frameworks
    Replies: 0
    Last Post: 06-27-2007, 04:47 PM

Posting Permissions

  • You may not post new threads
  • You may not post replies
  • You may not post attachments
  • You may not edit your posts