Beginner Tips for Elastic MapReduce
By this point everyone is well acquainted with the power of Hadoops MapReduce. But what youre also probably well acquainted with is the pain that must be suffered when…
By this point everyone is well acquainted with the power of Hadoops MapReduce. But what youre also probably well acquainted with is the pain that must be suffered when…
Let’s recall from Anatomy of a Dismax Query some key components of the dismax query parser: OK, now we’ve had plenty of time to study John’s post (and hey…
Next Stop Debuggersville! Recently I was puzzled by some behavior Solr was showing me. I scratched my head and called over a colleague. We couldnt quite figure out what…
English is a confusing language. I mean, does it really make sense that you can park in a driveway or drive in a parkway? Also, Ive always been amused…
Learn you some Mahout for great good! Yesterday, John and I gave a talk to the DC Hadoop Users Group about using Mahout with Solr to perform Latent Semantic…
I just spent some time putting together some basic Java code to read some data from HDFS. Pretty basic stuff. No map reduce involved. Pretty boilerplate code like the…
Yesterday I attended an event hosted by Booz Allen/Amazon around Big Data and Cloud Computing for life sciences. It was a fascinating event that brought together folks from data…
OSC came to me with an interesting task: To do a quick refresh of SmallBizContracting.com. The first thing that I did was to visit the site so that I…
This past Wednesday and Thursday (March 13th and 14th) OpenSource Connections held an on-site 2-day Solr training course called Solr Unleashed. We covered a broad range of Solr topics…
Last month we found the best time to ask a question on StackOverflow using the oft-missed ‘join’ feature in Solr. Numbers will get you far in isolating the top-performing…
A while ago we created a sample index of US patent grants roughly 700k documents big. Adjacently we pulled down the corresponding multi-page TIFFs of those grants and made…
So heres the problem: Youve finally found a block of time to set down and get your head around Hadoop and MapReduce. You do a quick Google search for…