variogr.am home | notes | writings | gallery

variogr.am latest

SolrIndexer for Nutch

[This post might be inane if you don’t use Solr or Nutch, I am sorry, more iPhone and Music of the Future posts soon]

Back in early 07 Sami Siren posted an excellent Nutch->Solr Indexer. Anyway, it hasn’t been maintained much due to the forthcoming NUTCH-442. But for simplicity’s sake, I like the SolrIndexer, and here’s a zip file with the updated source to get it compiled on Nutch trunk and using Solr trunk solrj. You’ll also need to do this:

+++ src/java/org/apache/nutch/indexer/Indexer.java      (working copy)
-  private static class LuceneDocumentWrapper implements Writable {
+  public static class LuceneDocumentWrapper implements Writable {

And follow the rest of Sami’s post.

Leave a Reply