View RSS Feed


  1. Cassandra Build the index

    by , 02-23-2012 at 07:27 PM
    When the data is ready, next step is to store it into column family. All the tags that are created in tokenizer can be processed in this step. Tokenizer has provided us a list of tags with document IDs. With the help of this information, we can do the following:
    Check the tags for duplication.
    Write data to column family in Cassandra.

    Java Code: This is the code to explain index buildning
    private void tokenize(String doc, String docID) {
            //remove all none alpha numeric vals