For the past year I’ve been talking with Christopher Ball and Erik Hatcher about how frustrating it is to have to carve time out of our regular day jobs to work on Solr and Lucene. Thanks to their enthusiasm for the idea of spending a couple of days hacking on Solr and Lucene, last weekend OSC hosted folks who spent two days of writing code and learning from each other. And since hacking all day Friday and Saturday burns you out, some of us went tubing on Sunday!
“by the numbers” what happened:
Attendees: 12, Eric, Scott, Matt, John, Kasey, David, Joesph, Erik, Anthony, Jessica, Christopher, Jake
Visitors who stopped in for some search chat: 2
Wiki Pages Updated: 2
Motto for the weekend: “It’s literally going to take at most an hour. Make that 18 minutes.” – David Dodge
We had Show and Tell each day, and here is what we saw:
- A “single page” search app using EmberJS from Matt Overstreet that demonstrated the awesome data binding aspects of EmberJS. Instant Search results were as simple as binding the results pane to the query box and issuing queries to Solr.
- Anthony Burton walked us through some challenges he was having in using XPath with DataImportHandler to index xml documents. We all debugged the issues as a group, with David Dodge having the key insight into the pattern of XPath that was required to make it all work. Lots of discussion on the pro’s and con’s of DIH!
- I attempted to demo using Apache UIMA with Solr. Jessica Bonnie and I hacked on the example app documented in the wiki at http://wiki.apache.org/solr/SolrUIMA, and at the end of it I had the WhitespaceTokenizer working through the UIMA framework. However, no luck in getting the more interesting AlchemyAPI and OpenCalais integrations to work.
- John used node.js to index the public data from StackOverflow directly into Solr via the JSON updater in Solr.