Lucene Archives - Page 2 of 3 - OpenSource Connections

November 21, 2017 Doug Turnbull

Solr Synonyms and Taxonomies: Mea Culpa

I screwed up! Mea Culpa. Let he who is without synonyms throw the first rock! In my talk Taxonomical Semantical Magical Search, I presented an index-time hypernym/hyponym expansion solution….

October 6, 2017 Matt Overstreet

Vespa vs Lucene: First Impressions

As we learn more about Vespa, we wanted to give our initial impressions when comparing to Lucene-based search (Solr/Elasticsearch). This is based on initial passes with Vespa and our…

July 10, 2017 Elizabeth Haubert

Caching In Elasticsearch

Responding to queries takes CPU time, memory, and in unfortunate cases, wall time as well. Increasing the power of a cluster helps, over-provisioning can be very expensive. Caching is…

February 20, 2017 Elizabeth Haubert

Solr UTF-8 Character Handling

Searching for non-ASCII characters can be a challenge. There are a number of reasons for doing so, even in a primarily English corpus: Accented characters in names and words…

October 19, 2016 Doug Turnbull

BM25F in Lucene with BlendedTermQuery

As part of the London hack days Diego Ceccarelli started a BM25F implementation. I began to continue it at Lucene Revolution’s Lucene hackathon. I realized though that when you…

August 1, 2016 Doug Turnbull

Search for Lunch: this search expert will come speak for free at your company’s lunch and learn

I want to share what I know at your company’s lunch and learn! For free! The problem with lunch and learn’s is that everyone’s busy. It’s challenging to find…

June 1, 2016 Doug Turnbull

Thoughts on Algolia (vs Solr & Elasticsearch)

After getting cranky on one Algolia blog post, and having a Search Disco episode with Julien Lemoine CTO of Algolia, I’m left fascinated by the solution. Algolia, so…

February 5, 2016 Scott Stults

How to use Luwak to run preset queries against incoming documents

Overview Quite a while ago Flax released Luwak as a document monitoring and alerting library. It was designed to solve the problem of running a lot of predetermined queries…

October 29, 2015 Scott Stults

Recap of Lucene Revolution 2015 in Austin

Every one of us at OSC looks forward to Lucene Revolution. It’s one of the few conferences we attend where everyone understands search at a deep level. That means…

October 16, 2015 Doug Turnbull

BM25 The Next Generation of Lucene Relevance

There’s something new cooking in how Lucene scores text. Instead of the traditional “TF*IDF,” Lucene just switched to something called BM25 in trunk. That means a new scoring formula…

July 7, 2015 Eric Pugh

Recap of VLDS Insights Conference

VLDS Insights Conference Recap Yesterday I had the privilege of attending the VLDS Insights conference. VLDS is the Virginia Longitudinal Data System, which provides researchers and policy makers anonymized…

July 6, 2015 Doug Turnbull

“Relevant Search” Chapters 4 and 5 Now Available!

We’re pleased to announce that Chapters 4 and 5 are available for early access for Relevant Search! Please read and give us feedback. This is early access for a…

Category: Lucene