Hai all,
Lucene have so many analyzers. but there is no specific analyser for Indian languages (hindi, tamil, marathi etc).
I want to know how to index/search unicode documents.
if anybody knows please reply back.
Regards,
goms
Printable View
Hai all,
Lucene have so many analyzers. but there is no specific analyser for Indian languages (hindi, tamil, marathi etc).
I want to know how to index/search unicode documents.
if anybody knows please reply back.
Regards,
goms
Solr has things called "tokenizers" that split on these. You can see a good talk on the subject at Lucid Imagination » Analyze THIS! Free webinar on getting the Lucene/Solr analyzer to index and search your content right