Help me Index writer for Ngram
I want create indexWriter for character Ngram. ex: Lucene is a great language. Then i want to use Ngram with n=3 to become: Luc uce cen ene is a gre rea eat....
IndexWriter writer = new IndexWriter(INDEX_DIR, new PositionalPorterStopAnalyzer(), true, IndexWriter.MaxFieldLength.UNLIMITED);IndexWriter. MaxFieldLength.UNLIMITED);
Reader reader = new FileReader(f);
Document doc = new Document();
NGramTokenizer token=new NGramTokenizer(token,3,3);
doc.add(new Field("contents", new FileReader(f)));
With above code I only create IndexWriter for token with extract 3 character but it is not gram.
Who can help me for this issues? because token on above NgramTokenizer only extract 3 character without 3 character of Ngram?
Thanks very much in advance for your help?