…right, as shown in this screenshot: At first blush, these Solr results for Johnny Depp aren’t so great. That’s ok, you’re just getting started with Solr. Now it’s up to…
…that magic time is? Simple, you index the entire StackOverflow data dump into Solr and treat Solr as a StackOverflow analytics engine. (Hey, you may not know the difference between…
Crawling with Nutch
…good enough that I had could pull data in to solr. However, users using a non-LWS Solr may need to also add a version field. In addition, if you need…
…in his post, when we come out with a second version of the Solr book, updating it for changes between when we published it and the final release of Solr…
…which Solr is built. I will then demonstrate how Lucene may be used to build a basic search engine. After this, I will introduce Solr search engine. Solr can be…
…documents. The only additional setup required is to enable the MoreLikeThis requestHandler within solrconfig.xml Using Solrs MoreLikeThis Handler Now (after having restarted Solr) you can issue MoreLikeThis queries like this:…
…to send documents to Solr, adding Luwak to it would be a simple way to implement query-based classification or alerting independent of Solr. You’ll still have the issue of query…
…thing about this strategy is we’re directly telling Solr exactly what we want in an awesome candidate. It’s a bit like using Solr for a fuzzy sorter, explicitly feeding it…
Recently I saw this post on solr-user mailing list asking about running Tika for text extraction in Solr, which if you follow the thread led to chorus of people saying:…
…Apache OpenNLP. We will use the model to perform sentiment analysis on movie descriptions. Using the Model in OpenNLP with Solr We can use the model from Apache Solr by…
…<fieldType name=”text_opennlp” class=”solr.TextField” positionIncrementGap=”100″ multiValued=”true” > <analyzer> <tokenizer class=”solr.OpenNLPTokenizerFactory” sentenceModel=”en-sent.bin” tokenizerModel=”en-token.bin”/> <filter class=”solr.OpenNLPPOSFilterFactory” posTaggerModel=”en-pos-maxent.bin”/> <filter class=”solr.OpenNLPChunkerFilterFactory” chunkerModel=”en-chunker.bin”/> <filter class=”solr.TypeAsPayloadFilterFactory”/> </analyzer> </fieldType> <field type=”text_opennlp” name=”title_opennlp” multiValued=”true” /> <field type=”text_opennlp” name=”overview_opennlp” multiValued=”true”…
One thing I really like about Solr is that its super easy to get started. You just download solr, fire it up, and then after following the 10 minute tutorial…