Need ideas on best practices of Lucene - Indexing & Reindexing!
We would like to implement a Lucene based full text search for a 80 GB sized database. We are novice to Lucene, and the current plan is to achieve this by storing the index terms in a flat file.
1) For Indexing, is it wise to choose Zend Lucene (in terms of performance, stability & usage)? Research shows that Zend/PHP Lucene is much slower! So, would it better to use Java Lucene (or SOLR) for the indexing and Zend Framework for querying the search results?
2) In case of insert/update/delete in the records, how to handle the re-indexing? Please advise how shall we update/delete only the relevant portions of the file.