Build Your Own Lucene Codec!
A Lucene Codec, a bit twiddlers dream! Ive been having a lot of fun hacking on a Lucene Codec lately. My hope is to create a Lucene storage layer…
A Lucene Codec, a bit twiddlers dream! Ive been having a lot of fun hacking on a Lucene Codec lately. My hope is to create a Lucene storage layer…
In our recent and continuing effort to make the world a better place, we have been working with the illustrious Waldo Jaquith on a project called StateDecoded. Basically, were…
Over the past few weeks Ive been working with Elasticsearch. However, what I really have been missing is a good deployment system. Elasticsearch has a chef provisioning tutorial on…
A pretty subtle change happened in the transition from Lucene/Solr 3 to 4. The abstract method for CharFilterFactory changed from public CharStream create(CharStream input); to public abstract Reader create(Reader…
In our daily adventures on our computers and the Internet, we often find/create notes or documents that we would like to keep for later, that we attempt to organize….
Little did you know, Lucene stores its index on hundreds of Post-It Notes A new feature of Lucene 4 – pluggable codecs – allows for the modification of Lucene’s…
I’m Jonathan Thompson, a rising 4th year CS major at UVA. This is my first blog post as an intern at OSC, I hope you enjoy it! \n\n While…
The team recently spent time at Lucene/Solr Revolution Much of the crew just got back from Lucene Revolution. It was an incredible experience to hang out with the cream-of-the-crop…
On a recent engagement, we were posed with the problem of sorting through 6.5 million foreign patent documents and indexing them into Solr. This totaled about 1 TB of…
I ran into an interesting problem today. I was working with the first project where we legitimately needed Solr soft commits and in testing my configuration I wanted to…
By this point everyone is well acquainted with the power of Hadoops MapReduce. But what youre also probably well acquainted with is the pain that must be suffered when…
Let’s recall from Anatomy of a Dismax Query some key components of the dismax query parser: OK, now we’ve had plenty of time to study John’s post (and hey…